Hello guys,
some days ago i started a project with twitter, hadoop and flume.
I want to collect tweets and want to create an map like an density map but with additional information from another source.
My flume agent runs and collects tweets but no tweet has coordinate only some of them have an information about the location but i need coordiantes.
I tried to add the parameter “location” to my configuration with the value -180,-90,180,90 like described in the API but also the same problem.
Can anybode what to do to get tweets with some coordinates to work with an map. If its possible it would be great if theres an possibilty only for tweets for coordinates. But i would be happy if there would be coordinates.
Here you can see my configuration.
TwitterAgent.sources = Twitter
TwitterAgent.channels = MemChannel
TwitterAgent.sinks = HDFS
TwitterAgent.sources.Twitter.type = com.cloudera.flume.source.TwitterSource
TwitterAgent.sources.Twitter.channels = MemChannel
TwitterAgent.sources.Twitter.consumerKey = xxx
TwitterAgent.sources.Twitter.consumerSecret = xxx
TwitterAgent.sources.Twitter.accessToken = xxx
TwitterAgent.sources.Twitter.accessTokenSecret = xxx
TwitterAgent.sources.Twitter.keywords = hadoop, big data, apache foundation, flume, mahout
TwitterAgent.source.Twitter.locations=-180,-90,180,90
TwitterAgent.sinks.HDFS.channel = MemChannel
TwitterAgent.sinks.HDFS.type = hdfs
TwitterAgent.sinks.HDFS.hdfs.path = hdfs://localhost:9000/examples/tweets/
TwitterAgent.sinks.HDFS.hdfs.fileType = DataStream
TwitterAgent.sinks.HDFS.hdfs.writeFormat = Text
TwitterAgent.sinks.HDFS.hdfs.batchSize = 1000
TwitterAgent.sinks.HDFS.hdfs.rollSize = 0
TwitterAgent.sinks.HDFS.hdfs.rollCount = 10000
TwitterAgent.channels.MemChannel.type = memory
TwitterAgent.channels.MemChannel.capacity = 10000
TwitterAgent.channels.MemChannel.transactionCapacity = 100
Thanks for your help
Frank