Text Classification

1107 papers with code • 93 benchmarks • 136 datasets

Text Classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.

Text Classification problems include emotion classification, news classification, citation intent classification, among others. Benchmark datasets for evaluating text classification capabilities include GLUE, AGNews, among others.

In recent years, deep learning techniques like XLNet and RoBERTa have attained some of the biggest performance jumps for text classification problems.

( Image credit: Text Classification Algorithms: A Survey )

Libraries

Use these libraries to find Text Classification models and implementations

DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation

arumaekawa/dilm 30 Mar 2024

To address this issue, we propose a novel text dataset distillation approach, called Distilling dataset into Language Model (DiLM), which trains a language model to generate informative synthetic training samples as text data, instead of directly optimizing synthetic samples.

6
30 Mar 2024

HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification

rooooyy/hill 26 Mar 2024

Existing self-supervised methods in natural language processing (NLP), especially hierarchical text classification (HTC), mainly focus on self-supervised contrastive learning, extremely relying on human-designed augmentation rules to generate contrastive samples, which can potentially corrupt or distort the original information.

1
26 Mar 2024

LlamBERT: Large-scale low-cost data annotation in NLP

aielte-research/llambert 23 Mar 2024

Large Language Models (LLMs), such as GPT-4 and Llama 2, show remarkable proficiency in a wide range of natural language processing (NLP) tasks.

6
23 Mar 2024

SpikeGraphormer: A High-Performance Graph Transformer with Spiking Graph Attention

phd-lanyu/spikegraphormer 21 Mar 2024

In this work, we propose a novel insight into integrating SNNs with Graph Transformers and design a Spiking Graph Attention (SGA) module.

5
21 Mar 2024

SynerMix: Synergistic Mixup Solution for Enhanced Intra-Class Cohesion and Inter-Class Separability in Image Classification

wxitxy/complementary_intra-class_and_inter-class_mixup 21 Mar 2024

It also surpasses the top-performer of either Manifold MixUp or SynerMix-Intra by 0. 12% to 5. 16%, with an average gain of 1. 11%.

3
21 Mar 2024

Investigating Text Shortening Strategy in BERT: Truncation vs Summarization

mirzaalimm/truncationvssummarization 19 Mar 2024

In this study, we investigate the performance of document truncation and summarization in text classification tasks.

1
19 Mar 2024

Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning

andychiangsh/pre-cofactv3 15 Mar 2024

In this paper, we present Pre-CoFactv3, a comprehensive framework comprised of Question Answering and Text Classification components for fact verification.

3
15 Mar 2024

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

thestephencasper/latent_adversarial_training 8 Mar 2024

Despite extensive diagnostics and debugging by developers, AI systems sometimes exhibit harmful unintended behaviors.

6
08 Mar 2024

RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules

miaomiaoli2/ruleprompt 5 Mar 2024

Weakly supervised text classification (WSTC), also called zero-shot or dataless text classification, has attracted increasing attention due to its applicability in classifying a mass of texts within the dynamic and open Web environment, since it requires only a limited set of seed words (label names) for each category instead of labeled data.

3
05 Mar 2024