Topic Models

209 papers with code • 6 benchmarks • 12 datasets

A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for the discovery of hidden semantic structures in a text body.

Benchmarks

Add a Result

These leaderboards are used to track progress in Topic Models

Dataset	Best Model	Compare
AG News	DeTiME	See all
20NewsGroups	vONTSS	See all
20 Newsgroups	Bayesian SMM	See all
Arxiv HEP-TH citation graph	JoSH	See all
NYT	JoSH	See all
AgNews	vONTSS	See all

Libraries

Use these libraries to find Topic Models models and implementations

mind-Lab/octis

3 papers

681

YongfeiYan/Neural-Document-Modeling

3 papers

ahoho/topics

3 papers

d2klab/tomodapi

3 papers

See all 5 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Uncovering Latent Themes of Messaging on Social Media by Integrating LLMs: A Case Study on Climate Campaigns

no code yet • 15 Mar 2024

Furthermore, this method efficiently maps the text and the newly discovered themes, enhancing our understanding of the thematic nuances in social media messaging.

Paper
Add Code

The Geometric Structure of Topic Models

no code yet • 6 Mar 2024

We introduce and demonstrate the applicability of our approach based on a topic model derived from a corpus of scientific papers taken from 32 top machine learning venues.

Paper
Add Code

Topic Modeling as Multi-Objective Contrastive Optimization

no code yet • 12 Feb 2024

Secondly, we explicitly cast contrastive topic modeling as a gradient-based multi-objective optimization problem, with the goal of achieving a Pareto stationary solution that balances the trade-off between the ELBO and the contrastive objective.

Paper
Add Code

RankSum An unsupervised extractive text summarization based on rank fusion

no code yet • 7 Feb 2024

In this paper, we propose Ranksum, an approach for extractive text summarization of single documents based on the rank fusion of four multi-dimensional sentence features extracted for each sentence: topic information, semantic content, significant keywords, and position.

Paper
Add Code

CFTM: Continuous time fractional topic model

no code yet • 29 Jan 2024

This approach incorporates fractional Brownian motion~(fBm) to effectively identify positive or negative correlations in topic and word distribution over time, revealing long-term dependency or roughness.

Paper
Add Code

Dynamic embedded topic models and change-point detection for exploring literary-historical hypotheses

no code yet • 25 Jan 2024

We present a novel combination of dynamic embedded topic models and change-point detection to explore diachronic change of lexical semantic modality in classical and early Christian Latin.

Paper
Add Code

Topic Modelling: Going Beyond Token Outputs

no code yet • 16 Jan 2024

The output is commonly a set of topics consisting of isolated tokens that often co-occur in such documents.

Paper
Add Code

Short-Form Videos and Mental Health: A Knowledge-Guided Neural Topic Model

no code yet • 11 Jan 2024

To prevent widespread consequences, platforms are eager to predict these videos' impact on viewers' mental health.

Paper
Add Code

Discovering Significant Topics from Legal Decisions with Selective Inference

no code yet • 2 Jan 2024

We propose and evaluate an automated pipeline for discovering significant topics from legal decision texts by passing features synthesized with topic models through penalised regressions and post-selection significance tests.

Paper
Add Code

Dynamic Topic Language Model on Heterogeneous Children's Mental Health Clinical Notes

no code yet • 19 Dec 2023

However, few topic models are built for longitudinal settings, and they fail to keep consistent topics and capture temporal trajectories for each document.

Paper
Add Code

Topic Models

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result