I’m using the search on the full archive and getting duplicate tweets, out of 61k tweets I pulled, there are only 1002 uniques. Removing the retweet option lowers the number to 30k, but that’s still way more the 1000.
How can I get only unique tweets?
This is my code (using Python requests):
base_url = 'https://api.twitter.com/'
search_headers = {
'Authorization': 'Bearer {}'.format(access_token)
}
search_params = {'query': 'search phrase', 'maxResults': 500}
search_url = '{}1.1/tweets/search/fullarchive/dev.json'.format(base_url)
search_resp = requests.get(search_url, headers=search_headers, params=search_params)
search_resp.status_code
tweet_data = search_resp.json()