Just to clarify, what do you mean by “[…] tweets are from user ids […]”? It reads a bit as if a tweet is from User 123 is ['user']['id_str']=123. However, as far as I understand the API documentation, you will also get tweets the are replies to User 123 or retweets of tweets of User 123. If you only look at ['user']['id_str'], the 50% sound fine to me. But I assume you also checked the other fields like ['retweet_status']['user']['id_str'] or ['in_reply_to_user_id_str'].
Admittedly, I’m not even really sure if I understand the API documentation [1] correctly. For the follow paramter it says that you get “Replies to any Tweet created by the user.” To me this is ambigious. Either the user wrote the reply or the tweet that someone hast replied to. Looking at my data, I assume it’s the latter, since I have tweets with, for example, ['user']['id_str']=XXX and ['retweet_status']['user']['id_str']=1234 with User 1234 of the users I follow.
[1] https://dev.twitter.com/streaming/overview/request-parameters