The streaming API is a 1% sample of all the data on the Twitter firehose at any one time. If you were to use the filter endpoint and track a term that was under 1% of the Twitter firehose then you’d receive all of the Tweets, depending on load.
The search API has a limited index and may not include, for example, Tweets that are withheld in certain jurisdictions; Tweets from very new users; Tweets and/or users, hashtags, or from source apps that have been marked as spammy or abusive either by our algorithms.
So, it is difficult to provide a specific percentage variance based on the question you’re asking, but that hopefully clarifies the difference.
The 30-day and full archive search options from Gnip are not subject to all of the same constraints, nor is Powertrack. These APIs are commercial offerings and offer complete access to the data.