Trouble detecting duplicate tweets in search api


#1

How can I detect when a tweet is duplicated, the only way is to compare the text ?


#2

Those are not duplicate Tweets - they are all independent, different Tweets with different IDs. The Tweet status text is duplicated, yes (probably someone is copy and pasting the status, or using an old-style “RT” format client). You’d have to filter those our in your own code.


#3

If the tweet is a retweet you will have the retweeted_status property populated with the original tweet. Otherwise it is not a duplicate or as mentioned by andypiper it is probably a copy or RT formatted tweet.


#4

I found it strange because I did a search with a size of 5 elements and the surprise was that everyone is equal to the naked eye. Excuse my english I am using translator.