I’m at the stage now where, using twarc, I’ve successfully extracted all Tweets in a particular date range for a range of hashtags.
I’ve been looking for an easy way to ‘eyeball’ the files to get a feel for their content, prior to further processing, data cleaning, analysis etc.
What I’ve done so far is to convert them to CSV format using twarc and load them up in Excel. That way, I can move around the rows and columns to see what’s going on.
But, I suspect there must be a better way than CSV → Excel.
What do other people do for this stage? Is there any software specifically for this purpose? Like a ‘Tweet viewer’? Does anyone write the data into a database and look at it there? Anything else?
Anyway, just looking for some advice because - as you can see - I’m new to this.
Thanks
Mike
1 Like
I usually do something similar - CSV → Pandas instead.
I’m planning on implementing a sort of “report” of the data that includes a bunch of summary statistics and counts over time in Counts Wishlist · Issue #2 · DocNow/twarc-statistics · GitHub this plugin. Still nothing working there at the moment.
There’s also twarc-network that can maybe help visualize things GitHub - DocNow/twarc-network: Generate network visualizations from Twitter data.
Would love to hear what others do!
1 Like
Thanks for the prompt response, Igor.
I’ll report back on how I get on.
Mike
1 Like