1 code implementation • ACL 2022 • Varvara Logacheva, Daryna Dementieva, Sergey Ustyantsev, Daniil Moskovskiy, David Dale, Irina Krotova, Nikita Semenov, Alexander Panchenko
To the best of our knowledge, these are the first parallel datasets for this task. We describe our pipeline in detail to make it fast to set up for a new language or domain, thus contributing to faster and easier development of new parallel resources. We train several detoxification models on the collected data and compare them with several baselines and state-of-the-art unsupervised approaches.
1 code implementation • ACL 2022 • Daniil Moskovskiy, Daryna Dementieva, Alexander Panchenko
This work investigates multilingual and cross-lingual detoxification and the behavior of large multilingual models in this setting.
1 code implementation • 9 Jan 2024 • Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova, Albina Akhmetgareeva, Anton Emelyanov, Denis Shevelev, Pavel Lebedev, Leonid Sinev, Ulyana Isaeva, Katerina Kolomeytseva, Daniil Moskovskiy, Elizaveta Goncharova, Nikita Savushkin, Polina Mikhailova, Denis Dimitrov, Alexander Panchenko, Sergei Markov
To address these issues, we introduce an open Multimodal Evaluation of Russian-language Architectures (MERA), a new instruction benchmark for evaluating foundation models oriented towards the Russian language.
no code implementations • 23 Nov 2023 • Daryna Dementieva, Daniil Moskovskiy, David Dale, Alexander Panchenko
Text detoxification is the task of transferring the style of text from toxic to neutral.
1 code implementation • 5 Jun 2022 • Daniil Moskovskiy, Daryna Dementieva, Alexander Panchenko
However, models are not able to perform cross-lingual detoxification and direct fine-tuning on exact language is inevitable.
3 code implementations • 19 May 2021 • Daryna Dementieva, Daniil Moskovskiy, Varvara Logacheva, David Dale, Olga Kozlova, Nikita Semenov, Alexander Panchenko
We introduce the first study of automatic detoxification of Russian texts to combat offensive language.