Duplicate Tweets



I’m using the search on the full archive and getting duplicate tweets, out of 61k tweets I pulled, there are only 1002 uniques. Removing the retweet option lowers the number to 30k, but that’s still way more the 1000.
How can I get only unique tweets?

This is my code (using Python requests):

base_url = 'https://api.twitter.com/'
search_headers = {
    'Authorization': 'Bearer {}'.format(access_token)    
search_params = {'query': 'search phrase', 'maxResults': 500}  
search_url = '{}1.1/tweets/search/fullarchive/dev.json'.format(base_url) 
search_resp = requests.get(search_url, headers=search_headers, params=search_params)
tweet_data = search_resp.json()


This depends on what you mean when you say ‘unique Tweets’.

I suggest for you to review our premium operator offerings. I’m assuming you negated is:retweet to remove Retweets. We do also have an is:reply and is:quote operator that can be negated, but it is only available with the PowerTrack product at this time.

If you still need help, please also include your search query for us to review.

closed #3

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.