I am using Streaming API to harvest geo-tagged English tweet for Greater London area. After reading several papers I found those dataset contains much more data than I get each day. For example, in one research they collect 41.2 million geo-tweets during whole 2014. Well, I can only collect 7k-9k per day (even though I collect English tweets only). I am wondering if there is something wrong with my code/setting, or they were using commercial APIs so they could have that much data?
Can anyone share your experience? How much data you can have for one day? I also want to know if 7k-9k tweets per day is a reasonable number?