FinnSentiment -- A Finnish Social Media Corpus for Sentiment Polarity Annotation

4 Dec 2020  ·  Krister Lindén, Tommi Jauhiainen, Sam Hardwick ·

Sentiment analysis and opinion mining is an important task with obvious application areas in social media, e.g. when indicating hate speech and fake news. In our survey of previous work, we note that there is no large-scale social media data set with sentiment polarity annotations for Finnish. This publications aims to remedy this shortcoming by introducing a 27,000 sentence data set annotated independently with sentiment polarity by three native annotators. We had the same three annotators for the whole data set, which provides a unique opportunity for further studies of annotator behaviour over time. We analyse their inter-annotator agreement and provide two baselines to validate the usefulness of the data set.

PDF Abstract

Datasets


Introduced in the Paper:

FinnSentiment

Used in the Paper:

MPQA Opinion Corpus OpenSubtitles

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods


No methods listed for this paper. Add relevant methods here