Transformers are Short Text Classifiers: A Study of Inductive Short Text Classifiers on Benchmarks and Real-world Datasets

30 Nov 2022  ยท  Fabian Karl, Ansgar Scherp ยท

Short text classification is a crucial and challenging aspect of Natural Language Processing. For this reason, there are numerous highly specialized short text classifiers. However, in recent short text research, State of the Art (SOTA) methods for traditional text classification, particularly the pure use of Transformers, have been unexploited. In this work, we examine the performance of a variety of short text classifiers as well as the top performing traditional text classifier. We further investigate the effects on two new real-world short text datasets in an effort to address the issue of becoming overly dependent on benchmark datasets with a limited number of characteristics. Our experiments unambiguously demonstrate that Transformers achieve SOTA accuracy on short text classification tasks, raising the question of whether specialized short text techniques are necessary.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Text Classification MR BERT Accuracy 86.94 # 7
Text Classification MR RoBERTa Accuracy 89.42 # 5
Text Classification MR DeBERTa Accuracy 90.21 # 2
Text Classification MR ERNIE 2.0 Accuracy 88.97 # 6
Text Classification MR DistilBERT Accuracy 85.31 # 9
Text Classification MR ALBERTv2 Accuracy 86.02 # 8
Text Classification MR ERNIE 2.0 (optimized) Accuracy 89.53 # 4
Text Classification NICE-2 RoBERTa Accuracy 99.76 # 1
Text Classification NICE-45 BERT Accuracy 72.79 # 1
Text Classification R8 DeBERTa Accuracy 98.451 # 1
Text Classification R8 BERT Accuracy 98.171 # 5
Text Classification R8 ESGNN Accuracy 98.23 # 3
Text Classification R8 C-BERT (ESGNN + BERT) Accuracy 98.28 # 2
Text Classification R8 SGNN Accuracy 98.09 # 6
Text Classification R8 fastText Accuracy 96.13 # 21
Text Classification R8 WideMLP Accuracy 96.98 # 18
Text Classification R8 ALBERTv2 Accuracy 97.62 # 10
Text Classification R8 DistilBERT Accuracy 97.981 # 8
Text Classification R8 ERNIE 2.0 Accuracy 98.041 # 7
Text Classification Searchsnippets DistilBERT Accuracy 89.69 # 1
Text Classification Searchsnippets BERT Accuracy 88.2 # 2
Text Classification SST-2 BERT Accuracy 91.37 # 2
Text Classification SST-2 DeBERTa Accuracy 94.78 # 1
Text Classification STOPS-2 ERNIE 2.0 STOPS-2 99.88 # 1
Text Classification STOPS-41 DeBERTa Accuracy 89.73 # 1
Text Classification TREC-10 BERT Accuracy 99.40 # 1
Text Classification Twitter ERNIE 2.0 Accuracy 99.97 # 1
Text Classification Twitter BERT Accuracy 99.96 # 2
Text Classification Twitter DistilBERT Accuracy 99.96 # 2

Methods


No methods listed for this paper. Add relevant methods here