Yes, individual hashtags, counts, possibly some of the information located in the tweet itself (related terminology to the hashtags). User ID is probably useless to the type of study we are doing.
The only other aspect I think we would possibly use is location, but only to try and separate US and European clusters, which may not be possible at all.