I’ve been collecting geotagged tweets through the streaming API for some time now. As of February, there has been a significant drop in the number of tweets received. This looks to be related to:
When I plot the number of hashtags tweeted per day as a function of time does, this is what I get:
which shows a massive drop off, both in mean and standard deviation. But is this behavior expected? Daily volume varies and so I would expect that my 1% share should also vary—at this scale, it appears mostly flat and makes me wonder if I’m being capped at a fixed total rather than a percentage. Could someone explain the massive drop, and also whether what I am seeing is still consistent with 1%? Is the “volume” that Twitter uses to determine 1% averaged over days, and thus, relatively constant?