no code implementations • 11 Nov 2017 • Anna Potapenko, Artem Popov, Konstantin Vorontsov
We consider probabilistic topic models and more recent word embedding techniques from a perspective of learning hidden semantic representations.
no code implementations • RANLP 2019 • Maksim Eremeev, Konstantin Vorontsov
We use the reference corpus of texts and the quantile approach in order to determine what words are rare, and what frequencies are abnormal.
no code implementations • LREC 2020 • Victor Bulatov, Vasiliy Alekseev, Konstantin Vorontsov, Darya Polyudova, Eugenia Veselova, Alexey Goncharov, Evgeny Egorov
This paper introduces TopicNet, a new Python module for topic modeling.
no code implementations • ACL 2020 • Eugeniia Veselova, Konstantin Vorontsov
This article proposes a new approach for building topic models on unbalanced collections in topic modelling, based on the existing methods and our experiments with such methods.