Gardenhouse Access?


#1

Hi,

I intend to collect data for my research in Data Mining area. For this research, i need a big data access, which is around 10 million data in English.

Actually, currently I am crawling Twitter using the resource written in this link: https://github.com/lintool/twitter-tools/wiki/Sampling-the-public-Twitter-stream
With this Public Streaming method, how many tweet maximum can I get?

Or, if possible, can i get access to the gardenhouse for retrieving at least 10 million English tweet? Thank you very much.


#2

You probably don’t need any kind of elevated streaming access to accomplish this. If you leave the ~1% sample hose at [node:10390] connected for a long period of time, you’ll be able to collect many tweets in short order.