Hello.
When I extract data using the Full Archive endpoint of the Premium API, even if the query is exactly the same, the data may differ between the first and second extractions after a period of time (about 2 days in this case).
There are two possible reasons for this, but I don’t understand one of them.
I would appreciate your advice.
(1) Differences due to the fact that some posts have already been deleted at the time of the second extraction and the extraction decreases.
(2) The difference due to the fact that there are tweets that were not taken in the first extraction and the extraction will increase during the second extraction.
Is it possible for number 2 to happen with the Full Archive endpoint of the Premium API?
Also, what are the factors that cause the data to increase in the same query?
(2) The difference due to the fact that there are tweets that were not taken in the first extraction and the extraction will increase during the second extraction.
One example of how this might happen: If a user has some tweets that match your query, but then locks their account - these tweets will no longer be retrievable. Then after some time they unlock their account, and their tweets are once again retrievable.
Depends on the size of the difference - if it’s a huge one, maybe the issue is that they updated a search index or something?
1 Like
I understand clearly now.
It seems that there are many factors, such as temporary account restrictions and public/private accounts.
Thank you very much.
1 Like
system
Closed
#4
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.