Investigating Shallow and Deep Learning Techniques for Emotion Classification in Short Persian Texts

The identification of emotions in short texts of low-resource languages poses a significant challenge, requiring specialized frameworks and computational intelligence techniques. This paper presents a comprehensive exploration of shallow and deep learning methods for emotion detection in short Persian texts. Shallow learning methods employ feature extraction and dimension reduction to enhance classification accuracy. On the other hand, deep learning methods utilize transfer learning and word embedding, particularly BERT, to achieve high classification accuracy. A Persian dataset called "ShortPersianEmo" is introduced to evaluate the proposed methods, comprising 5472 diverse short Persian texts labeled in five main emotion classes. The evaluation results demonstrate that transfer learning and BERT-based text embedding perform better in accurately classifying short Persian texts than alternative approaches. The dataset of this study ShortPersianEmo will be publicly available online at https://github.com/vkiani/ShortPersianEmo.

PDF Abstract

Datasets


Introduced in the Paper:

ShortPersianEmo

Used in the Paper:

ArmanEmo

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Emotion Classification ArmanEmo Deep ParsBERT Macro F1 0.65 # 1
Emotion Classification ShortPersianEmo Deep ParsBERT Macro F1 0.71 # 1

Methods