no code implementations • 6 Jul 2023 • Weijie Xu, Jay Desai, Srinivasan Sengamedu, Xiaoyu Jiang, Francis Iannacci
Across a variety of datasets, S2vNTM outperforms existing semi-supervised topic modeling methods in classification accuracy with limited keywords provided.
no code implementations • 4 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, Fuqin Yan, Francis Iannacci
In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets.
1 code implementation • 3 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Srinivasan H. Sengamedu, Francis Iannacci, Jinjin Zhao
Recently, Neural Topic Models (NTM), inspired by variational autoencoders, have attracted a lot of research interest; however, these methods have limited applications in the real world due to the challenge of incorporating human knowledge.
Ranked #1 on Topic Models on 20NewsGroups
no code implementations • 30 Jun 2023 • Weijie Xu, Jinjin Zhao, Francis Iannacci, Bo wang
Generative modeling has been used frequently in synthetic data generation.