I have a problem with the “lang” parameter for the Twitter premium API.
My goal was to download a list of Finnish tweets, so for this reason i specified lang=fi .
However, I ended up with more or less half tweets in other languages:
- some in hungarian
- some in estonian
- some in english as well
- some with mixed finnish and something else (which i can accept)
So, do you have any suggestion here on how to improve the quality of the results? Some parameter I can add to the query maybe?
Another question: is there a way to avoid very short tweets? like for example tweets with only one or two words. I could not find a parameter for the tweet size but maybe I am wrong here.