We are having the same issues with the streaming api (with the encoding issues).
If I search for 障害, I get a lot of tweets per minute. If I start to track that word via the streaming api, I only get one or two per minute.
I’m streaming with the tracking parameter:
track=%E9%9A%9C%E5%AE%B3&language=ja
(eg: the same character set as twitter search)
I’m only receiving tweets with the given characters surrounded by spaces or newlines, like:
https://twitter.com/amaama69/status/357754192057733120
or
It looks like Twitter is filtering the hashtags, because sometimes I do receive a tweet with a hashtag:
But I never receive tweets which have the two characters in the middle of a sentence. As JA doesn’t use spaces the way latin languages do, it’s kind of a big deal.
I’m not a native JA speaker, but I seem to have the same problem with the German encoding and other languages (eg: Russian).