1 code implementation • EACL 2021 • Evgeny Lagutin, Daniil Gavrilov, Pavel Kalaidin
Likelihood training and maximization-based decoding result in dull and repetitive generated texts even when using powerful language models (Holtzman et al., 2019).
no code implementations • EMNLP (ALW) 2020 • Nadezhda Zueva, Madina Kabirova, Pavel Kalaidin
Toxicity has become a grave problem for many online communities and has been growing across many languages, including Russian.
no code implementations • 14 Oct 2020 • Artem Chumachenko, Daniil Gavrilov, Nikita Balagansky, Pavel Kalaidin
We also proposed a variant of Weight Squeezing called Gated Weight Squeezing, for which we combined fine-tuning of BERT-Medium model and learning mapping from BERT-Base weights.
no code implementations • 23 Jan 2019 • Daniil Gavrilov, Pavel Kalaidin, Valentin Malykh
Headline generation is a special type of text summarization task.