Short Text Clustering

14 papers with code • 8 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Short Text Clustering

Dataset	Best Model	Compare
Stackoverflow	TCL	See all
Searchsnippets	SCCL	See all
Biomedical	TCL	See all
AG News	SCCL	See all
GoogleNews-TS	SCCL	See all
GoogleNews-T	SCCL	See all
GoogleNews-S	SCCL	See all
Tweet	SCCL	See all

Datasets

AG News
MNAD

Latest papers

Most implemented Social Latest No code

Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering

hmllmh/rstc • • 23 May 2023

To tackle the above issues, we propose a Robust Short Text Clustering (RSTC) model to improve robustness against imbalanced and noisy data.

23 May 2023

Paper
Code

Twin Contrastive Learning for Online Clustering

Yunfan-Li/Twin-Contrastive-Learning • • 21 Oct 2022

Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.

21 Oct 2022

Paper
Code

EASE: Entity-Aware Contrastive Learning of Sentence Embedding

studio-ousia/ease • • NAACL 2022

We present EASE, a novel method for learning sentence embeddings via contrastive learning between sentences and their related entities.

09 May 2022

Paper
Code

DECAF: Deep Extreme Classification with Label Features

Extreme-classification/DECAF • • 1 Aug 2021

This paper develops the DECAF algorithm that addresses these challenges by learning models enriched by label metadata that jointly learn model parameters and feature representations using deep networks and offer accurate classification at the scale of millions of labels.

01 Aug 2021

Paper
Code

ECLARE: Extreme Classification with Label Graph Correlations

Extreme-classification/ECLARE • • 31 Jul 2021

This paper presents ECLARE, a scalable deep learning architecture that incorporates not only label text, but also label correlations, to offer accurate real-time predictions within a few milliseconds.

31 Jul 2021

Paper
Code

Efficient Sparse Spherical k-Means for Document Clustering

johpro/esp-kmeans • 30 Jul 2021

Spherical k-Means is frequently used to cluster document collections because it performs reasonably well in many settings and is computationally efficient.

30 Jul 2021

Paper
Code

Supporting Clustering with Contrastive Learning

makcedward/nlpaug • • NAACL 2021

Unsupervised clustering aims at discovering the semantic categories of data according to some distance measured in the representation space.

4,298

24 Mar 2021

Paper
Code

Discovering New Intents with Deep Aligned Clustering

thuiar/DeepAligned-Clustering • • 16 Dec 2020

In this work, we propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of the limited known intent data.

114

16 Dec 2020

Paper
Code

Intent Mining from past conversations for conversational agent

ajaychatterjee/IntentMining • • COLING 2020

In this paper, we present an intent discovery framework that involves 4 primary steps: Extraction of textual utterances from a conversation using a pre-trained domain agnostic Dialog Act Classifier (Data Extraction), automatic clustering of similar user utterances (Clustering), manual annotation of clusters with an intent label (Labeling) and propagation of intent labels to the utterances from the previous step, which are not mapped to any cluster (Label Propagation); to generate intent training data from raw conversations.

22 May 2020

Paper
Code

Enhancement of Short Text Clustering by Iterative Classification

rashadulrakib/short-text-clustering-enhancement • 31 Jan 2020

Short text clustering is a challenging task due to the lack of signal contained in such short texts.

31 Jan 2020

Paper
Code

Short Text Clustering

Benchmarks Add a Result

Datasets

Latest papers

Content

Benchmarks

Add a Result