1 code implementation • 6 Mar 2024 • Arik Reuter, Anton Thielmann, Christoph Weisser, Benjamin Säfken, Thomas Kneib
With the rise of transformers in Natural Language Processing, however, several successful models that rely on straightforward clustering approaches in transformer-based embedding spaces have emerged and consolidated the notion of topics as clusters of embedding vectors.
no code implementations • 6 Mar 2024 • Arik Reuter, Anton Thielmann, Christoph Weisser, Sebastian Fischer, Benjamin Säfken
Topic modeling seems to be almost synonymous with generating lists of top words to represent topics within large text corpora.
no code implementations • 30 Mar 2023 • Anton Thielmann, Quentin Seifert, Arik Reuter, Elisabeth Bergherr, Benjamin Säfken
We demonstrate the competitive performance of our method with a large benchmark study, and achieve superior results compared to state-of-the-art topic modeling and document clustering models.