Search API "sampling": What kind of sampling?

search

#1

Following the recent changes to the Search API documentation: https://www.apichangelog.com/changes/2cf10ce4-813c-4c18-bef7-8e585e3d03d0

@edsu highlighted a key phrase in there: “Twitter Search API searches against a sampling of recent Tweets published in the past 7 days.”

It would be great if anyone from twitter could clarify or expand on this. Having a better sense of what’s indexed will definitely help avoid pitfalls & false assumptions when working with twitter data.

What exactly is meant by “sampling”? Does it use one of these methods: https://en.wikipedia.org/wiki/Sampling_(statistics) If so - Any details?

Or does sampling mean something else more complicated or something simpler? Something like "filter_level":"low" tweets less likely to be returned in Search API?

Any clarifications / corrections or relevant links would be great!