Dear Sir/madam
Morning
How can I scrappe twitter data from January 01, 2020 to December 30, 2022?
Thank you
I believe that you need Academic access. Otherwise, you can only retrieve Tweets from the past week.
Alternatively, if you have Tweet IDs, you can rehydrate them. Maybe I’m mistaken, but I don’t think that there is any time limit on this.
I guess this is probably not the answer you wanted. I wish you the best
1 Like
Without Academic Access, the only other alternative is to use the old v1.1 Premium API, for which you need elevated access. Search API | Twitter API | Docs | Twitter Developer Platform but this is totally different and separate to v2 Academic Access, and the old v1.1 search entirely.
1 Like
Thank you for reply. I have academic access
That’s good. You can use twarc. This should get you started:
Follow these tutorials. Try it out. Let us know if you have a more specefic question.
1 Like
Thank you for the suggestion. I am usingRSt udio. Is twarc compatible with RStudio?
twarc command line tools themselves should be run inside a terminal / command prompt - not in R, but R should be able to load the generated CSV from twarc without any issues.
so, to run twarc, I should use python, right?
Yes, you use Python. It’s very straightforward after you set up.
Try it out. If you get stuck, someone will help you
1 Like
You need python installed, but you will not need to code any python. Twarc runs as a command line tool - so you type commands into a prompt, not python.
The python looks like this. How to start?
No, this is a python prompt - you type python code in here, which is not what you want. But this is good that Python is definitely installed on your machine.
For twarc, you only need to have python installed, and use a command line to type a command in. On windows, try opening PowerShell for example.
See here for a demo:
(ignore the $ because on windows it will look different, but you can type the exact same commands into Powershell - you may also need to replace twarc2 with twarc2.exe but i don’t have Windows to double check this)
After typing (twarc), it looks like this:
Oh, sorry, no, you don’t need to type (twarc) in either - just the actual twarc2 command, like
twarc2 search
Also you are still typing this into a Python prompt, not a Command Line - it may be worth starting here: Tutorial - twarc