About

Text classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.

( Image credit: Text Classification Algorithms: A Survey )

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Subtasks

Datasets

Greatest papers with code

Adversarial Training Methods for Semi-Supervised Text Classification

25 May 2016tensorflow/models

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting.

CLASSIFICATION SEMI SUPERVISED TEXT CLASSIFICATION SEMI-SUPERVISED TEXT CLASSIFICATION SENTIMENT ANALYSIS WORD EMBEDDINGS

Semi-supervised Sequence Learning

NeurIPS 2015 tensorflow/models

In our experiments, we find that long short term memory recurrent networks after being pretrained with the two approaches are more stable and generalize better.

LANGUAGE MODELLING TEXT CLASSIFICATION

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

NeurIPS 2020 huggingface/transformers

With the success of language pretraining, it is highly desirable to develop more efficient architectures of good scalability that can exploit the abundant unlabeled data at a lower cost.

READING COMPREHENSION TEXT CLASSIFICATION

FlauBERT: Unsupervised Language Model Pre-training for French

LREC 2020 huggingface/transformers

Language models have become a key step to achieve state-of-the art results in many different Natural Language Processing (NLP) tasks.

LANGUAGE MODELLING NATURAL LANGUAGE INFERENCE TEXT CLASSIFICATION WORD SENSE DISAMBIGUATION

XLNet: Generalized Autoregressive Pretraining for Language Understanding

NeurIPS 2019 huggingface/transformers

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling.

DOCUMENT RANKING HUMOR DETECTION LANGUAGE MODELLING NATURAL LANGUAGE INFERENCE PARAPHRASE IDENTIFICATION QUESTION ANSWERING READING COMPREHENSION SENTIMENT ANALYSIS TEXT CLASSIFICATION

FastText.zip: Compressing text classification models

12 Dec 2016facebookresearch/fastText

We consider the problem of producing compact architectures for text classification, such that the full model fits in a limited amount of memory.

CLASSIFICATION QUANTIZATION TEXT CLASSIFICATION WORD EMBEDDINGS

Universal Language Model Fine-tuning for Text Classification

ACL 2018 fastai/fastai

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch.

CLASSIFICATION LANGUAGE MODELLING SENTIMENT ANALYSIS TEXT CLASSIFICATION TRANSFER LEARNING