Hi,

I downloaded archive of tweets via tweet downloader webtool and I came across with this problem. When I export downloaded tweets to json, long integers (id, author_id, in_reply_to_user_id) are okay. But when I want to download it to csv these fields are corrupted. There are shown only first few digits and the rest is replaced by zeros. Do you have the same problem or it is only in my case.

This usually happens when opening the CSV in excel, which destroys the IDs. The best thing to do is not to use it, or import the CSV and specify each column manually as text.

1 Like

Hello, thank you, as I imported it directly to R the long integers stayed intact. Now I have to find out the best way to parse those “nested” dictionaries. Do you have by any chance some best practice advice for it? (But I guess its for another topic.
Thank you again.

1 Like

I use twarc csv as a command line tool to convert the json into flattened csv, GitHub - DocNow/twarc-csv: A plugin for twarc2 for converting tweet JSON into DataFrames and exporting to CSV. unfortunately it won’t work on another CSV file, only on json

1 Like

Thank you once again. Tried converting json to csv and works perfect.

1 Like