Short Text Clustering

14 papers with code • 8 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering

hmllmh/rstc 23 May 2023

To tackle the above issues, we propose a Robust Short Text Clustering (RSTC) model to improve robustness against imbalanced and noisy data.

7
23 May 2023

Twin Contrastive Learning for Online Clustering

Yunfan-Li/Twin-Contrastive-Learning 21 Oct 2022

Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.

51
21 Oct 2022

EASE: Entity-Aware Contrastive Learning of Sentence Embedding

studio-ousia/ease NAACL 2022

We present EASE, a novel method for learning sentence embeddings via contrastive learning between sentences and their related entities.

54
09 May 2022

DECAF: Deep Extreme Classification with Label Features

Extreme-classification/DECAF 1 Aug 2021

This paper develops the DECAF algorithm that addresses these challenges by learning models enriched by label metadata that jointly learn model parameters and feature representations using deep networks and offer accurate classification at the scale of millions of labels.

52
01 Aug 2021

ECLARE: Extreme Classification with Label Graph Correlations

Extreme-classification/ECLARE 31 Jul 2021

This paper presents ECLARE, a scalable deep learning architecture that incorporates not only label text, but also label correlations, to offer accurate real-time predictions within a few milliseconds.

41
31 Jul 2021

Efficient Sparse Spherical k-Means for Document Clustering

johpro/esp-kmeans 30 Jul 2021

Spherical k-Means is frequently used to cluster document collections because it performs reasonably well in many settings and is computationally efficient.

2
30 Jul 2021

Supporting Clustering with Contrastive Learning

makcedward/nlpaug NAACL 2021

Unsupervised clustering aims at discovering the semantic categories of data according to some distance measured in the representation space.

4,298
24 Mar 2021

Discovering New Intents with Deep Aligned Clustering

thuiar/DeepAligned-Clustering 16 Dec 2020

In this work, we propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of the limited known intent data.

114
16 Dec 2020

Intent Mining from past conversations for conversational agent

ajaychatterjee/IntentMining COLING 2020

In this paper, we present an intent discovery framework that involves 4 primary steps: Extraction of textual utterances from a conversation using a pre-trained domain agnostic Dialog Act Classifier (Data Extraction), automatic clustering of similar user utterances (Clustering), manual annotation of clusters with an intent label (Labeling) and propagation of intent labels to the utterances from the previous step, which are not mapped to any cluster (Label Propagation); to generate intent training data from raw conversations.

14
22 May 2020

Enhancement of Short Text Clustering by Iterative Classification

rashadulrakib/short-text-clustering-enhancement 31 Jan 2020

Short text clustering is a challenging task due to the lack of signal contained in such short texts.

30
31 Jan 2020