What Can We Learn From Almost a Decade of Food Tweets

10 Jul 2020  ·  Uga Sproģis, Matīss Rikters ·

We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using data from the corpus.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Sentiment Analysis Latvian Twitter Eater Sentiment Dataset Naive Bayes Accuracy 61.23 # 1

Methods


No methods listed for this paper. Add relevant methods here