Text Classification

486 papers with code • 37 benchmarks • 54 datasets

Text classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.

( Image credit: Text Classification Algorithms: A Survey )

Greatest papers with code

FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP

zalandoresearch/flair NAACL 2019

We present FLAIR, an NLP framework designed to facilitate training and distribution of state-of-the-art sequence labeling, text classification and language models.

Text Classification

Ludwig: a type-based declarative deep learning toolbox

uber/ludwig 17 Sep 2019

In this work we present Ludwig, a flexible, extensible and easy to use toolbox which allows users to train deep learning models and use them for obtaining predictions without writing code.

Image Captioning Image Classification +12

StarSpace: Embed All The Things!

facebookresearch/ParlAI 12 Sep 2017

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Text Classification Word Embeddings

Simple Recurrent Units for Highly Parallelizable Recurrence

aymericdamien/TopDeepLearning EMNLP 2018

Common recurrent neural architectures scale poorly due to the intrinsic difficulty in parallelizing their state computations.

General Classification Machine Translation +2

LANGUAGE MODEL EMBEDDINGS IMPROVE SENTIMENT ANALYSIS IN RUSSIAN

deepmipt/DeepPavlov Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference “Dialogue 2019” 2019

In this paper we introduce pre-trained Russian language models which are used to extract embeddings (ELMo) to improve accuracy for classification of short conversational texts.

Language Modelling Sentiment Analysis +1

ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

PaddlePaddle/ERNIE 31 Dec 2020

Transformers are not suited for processing long documents, due to their quadratically increasing memory and time consumption.

Document-level Language Modelling +2

Character-level Convolutional Networks for Text Classification

gaussic/text-classification-cnn-rnn NeurIPS 2015

This article offers an empirical exploration on the use of character-level convolutional networks (ConvNets) for text classification.

Classification General Classification +2

FNet: Mixing Tokens with Fourier Transforms

labmlai/annotated_deep_learning_paper_implementations 9 May 2021

We show that Transformer encoder architectures can be massively sped up, with limited accuracy costs, by replacing the self-attention sublayers with simple linear transformations that "mix" input tokens.

Linguistic Acceptability Machine Translation +5

The Natural Language Decathlon: Multitask Learning as Question Answering

salesforce/decaNLP ICLR 2019

Though designed for decaNLP, MQAN also achieves state of the art results on the WikiSQL semantic parsing task in the single-task setting.

Domain Adaptation Machine Translation +9