I want to release a data set useful for training twitter sentiment analysis algorithms. It consists of ~4500 sentiment-labeled tweets. It would be publicly available and distributed without charge.
However, I’m afraid the data may violate the API ToS, specifically term 4A.
The data set consists of:
- tweet text
- tweet creation date
- hand-curated tweet topic
- hand-curated sentiment label: “positive”, “neutral”, “negative”, or “irrelevant”
Does this violate the ToS? Is there anyone I can contact to get permission to release it?