Search Results for author: Varvara Logacheva

Found 35 papers, 11 papers with code

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company’s Reputation

no code implementations • EACL (BSNLP) 2021 • Nikolay Babakov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness.

Paper
Add Code

Evaluation of Taxonomy Enrichment on Diachronic WordNet Versions

no code implementations • EACL (GWC) 2021 • Irina Nikishina, Natalia Loukachevitch, Varvara Logacheva, Alexander Panchenko

The vast majority of the existing approaches for taxonomy enrichment apply word embeddings as they have proven to accumulate contexts (in a broad sense) extracted from texts which are sufficient for attaching orphan words to the taxonomy.

Word Embeddings

Paper
Add Code

A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification

no code implementations • HumEval (ACL) 2022 • Varvara Logacheva, Daryna Dementieva, Irina Krotova, Alena Fenogenova, Irina Nikishina, Tatiana Shavrina, Alexander Panchenko

It is often difficult to reliably evaluate models which generate text.

Style Transfer Text Style Transfer

Paper
Add Code

RuPAWS: A Russian Adversarial Dataset for Paraphrase Identification

1 code implementation • LREC 2022 • Nikita Martynov, Irina Krotova, Varvara Logacheva, Alexander Panchenko, Olga Kozlova, Nikita Semenov

We compare it to the largest available dataset for Russian ParaPhraser and show that the best available paraphrase identifiers for the Russian language fail on the RuPAWS dataset.

Paraphrase Identification

Paper
Code

A large-scale computational study of content preservation measures for text style transfer and paraphrase generation

1 code implementation • ACL 2022 • Nikolay Babakov, David Dale, Varvara Logacheva, Alexander Panchenko

In both tasks, the system is supposed to generate a text which should be semantically similar to the input text.

Paraphrase Generation Semantic Similarity +3

Paper
Code

ParaDetox: Detoxification with Parallel Data

1 code implementation • ACL 2022 • Varvara Logacheva, Daryna Dementieva, Sergey Ustyantsev, Daniil Moskovskiy, David Dale, Irina Krotova, Nikita Semenov, Alexander Panchenko

To the best of our knowledge, these are the first parallel datasets for this task. We describe our pipeline in detail to make it fast to set up for a new language or domain, thus contributing to faster and easier development of new parallel resources. We train several detoxification models on the collected data and compare them with several baselines and state-of-the-art unsupervised approaches.

Sentence

Paper
Code

Studying the role of named entities for content preservation in text style transfer

2 code implementations • 20 Jun 2022 • Nikolay Babakov, David Dale, Varvara Logacheva, Irina Krotova, Alexander Panchenko

Text style transfer techniques are gaining popularity in Natural Language Processing, finding various applications such as text detoxification, sentiment, or formality transfer.

Style Transfer Text Style Transfer

Paper
Code

Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language

no code implementations • 4 Mar 2022 • Nikolay Babakov, Varvara Logacheva, Alexander Panchenko

Toxicity on the Internet, such as hate speech, offenses towards particular users or groups of people, or the use of obscene words, is an acknowledged problem.

Chatbot Cultural Vocal Bursts Intensity Prediction

Paper
Add Code

Taxonomy Enrichment with Text and Graph Vector Representations

no code implementations • 21 Jan 2022 • Irina Nikishina, Mikhail Tikhomirov, Varvara Logacheva, Yuriy Nazarov, Alexander Panchenko, Natalia Loukachevitch

With the rapid growth of lexical resources for specific domains, the problem of automatic extension of the existing knowledge bases with new words is becoming more and more widespread.

Knowledge Graphs Word Embeddings

Paper
Add Code

Text Detoxification using Large Pre-trained Neural Models

1 code implementation • EMNLP 2021 • David Dale, Anton Voronov, Daryna Dementieva, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We compare our models with a number of methods for style transfer.

Style Transfer

Paper
Code

SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection

no code implementations • SEMEVAL 2021 • David Dale, Igor Markov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We show that fine-tuning a RoBERTa model for this problem is a strong baseline.

Sentence Toxic Spans Detection

Paper
Add Code

Methods for Detoxification of Texts for the Russian Language

3 code implementations • 19 May 2021 • Daryna Dementieva, Daniil Moskovskiy, Varvara Logacheva, David Dale, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We introduce the first study of automatic detoxification of Russian texts to combat offensive language.

Style Transfer

2,049

Paper
Code

Which is Better for Deep Learning: Python or MATLAB? Answering Comparative Questions in Natural Language

no code implementations • EACL 2021 • Viktoriia Chekalina, Alexander Bondarenko, Chris Biemann, Meriem Beloucif, Varvara Logacheva, Alexander Panchenko

in natural language.

Paper
Add Code

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

1 code implementation • 9 Mar 2021 • Nikolay Babakov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labeling a dataset for appropriateness.

Paper
Code

Studying Taxonomy Enrichment on Diachronic WordNet Versions

1 code implementation • COLING 2020 • Irina Nikishina, Alexander Panchenko, Varvara Logacheva, Natalia Loukachevitch

Ontologies, taxonomies, and thesauri are used in many NLP tasks.

Paper
Code

RUSSE'2020: Findings of the First Taxonomy Enrichment Task for the Russian language

no code implementations • 22 May 2020 • Irina Nikishina, Varvara Logacheva, Alexander Panchenko, Natalia Loukachevitch

This paper describes the results of the first shared task on taxonomy enrichment for the Russian language.

Paper
Add Code

Word Sense Disambiguation for 158 Languages using Word Embeddings Only

no code implementations • LREC 2020 • Varvara Logacheva, Denis Teslenko, Artem Shelmanov, Steffen Remus, Dmitry Ustalov, Andrey Kutuzov, Ekaterina Artemova, Chris Biemann, Simone Paolo Ponzetto, Alexander Panchenko

We use this method to induce a collection of sense inventories for 158 languages on the basis of the original pre-trained fastText word embeddings by Grave et al. (2018), enabling WSD in these languages.

Word Embeddings Word Sense Disambiguation

Paper
Add Code

MIPT System for World-Level Quality Estimation

no code implementations • WS 2019 • Mikhail Mosyagin, Varvara Logacheva

We explore different model architectures for the WMT 19 shared task on word-level quality estimation of automatic translation.

Translation

Paper
Add Code

The Second Conversational Intelligence Challenge (ConvAI2)

2 code implementations • 31 Jan 2019 • Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W. black, Alexander Rudnicky, Jason Williams, Joelle Pineau, Mikhail Burtsev, Jason Weston

We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots.

Paper
Code

Few-shot classification in Named Entity Recognition Task

1 code implementation • 14 Dec 2018 • Alexander Fritzler, Varvara Logacheva, Maksim Kretov

For many natural language processing (NLP) tasks the amount of annotated data is limited.

Classification General Classification +7

Paper
Code

Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

no code implementations • WS 2018 • Valentin Malykh, Varvara Logacheva, Taras Khakhulin

We suggest a new language-independent architecture of robust word vectors (RoVe).

Morphological Analysis Word Embeddings

Paper
Add Code

Findings of the WMT 2018 Shared Task on Quality Estimation

no code implementations • WS 2018 • Lucia Specia, Fr{\'e}d{\'e}ric Blain, Varvara Logacheva, Ram{\'o}n Astudillo, Andr{\'e} F. T. Martins

We report the results of the WMT18 shared task on Quality Estimation, i. e. the task of predicting the quality of the output of machine translation systems at various granularity levels: word, phrase, sentence and document.

Machine Translation Sentence +1

Paper
Add Code

DeepPavlov: Open-Source Library for Dialogue Systems

no code implementations • ACL 2018 • Mikhail Burtsev, Alex Seliverstov, er, Rafael Airapetyan, Mikhail Arkhipov, Dilyara Baymurzina, Nickolay Bushkov, Olga Gureenkova, Taras Khakhulin, Yuri Kuratov, Denis Kuznetsov, Alexey Litinsky, Varvara Logacheva, Alexey Lymar, Valentin Malykh, Maxim Petrov, Vadim Polulyakh, Leonid Pugachev, Alexey Sorokin, Maria Vikhreva, Marat Zaynutdinov

It supports modular as well as end-to-end approaches to implementation of conversational agents.

General Classification intent-classification +5

Paper
Add Code

Findings of the 2017 Conference on Machine Translation (WMT17)

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shu-Jian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia, Marco Turchi

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Metrics for Evaluation of Word-level Machine Translation Quality Estimation

no code implementations • ACL 2016 • Varvara Logacheva, Michal Lukasik, Lucia Specia

Machine Translation Translation

Paper
Add Code

Findings of the 2016 Conference on Machine Translation

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

USFD's Phrase-level Quality Estimation Systems

no code implementations • WS 2016 • Varvara Logacheva, Fr{\'e}d{\'e}ric Blain, Lucia Specia

Machine Translation

Paper
Add Code

Phrase Level Segmentation and Labelling of Machine Translation Errors

no code implementations • LREC 2016 • Fr{\'e}d{\'e}ric Blain, Varvara Logacheva, Lucia Specia

This paper presents our work towards a novel approach for Quality Estimation (QE) of machine translation based on sequences of adjacent words, the so-called phrases.

Machine Translation Sentence +1

Paper
Add Code

MARMOT: A Toolkit for Translation Quality Estimation at the Word Level

1 code implementation • LREC 2016 • Varvara Logacheva, Chris Hokamp, Lucia Specia

The tool has a set of state-of-the-art features for QE, and new features can easily be added.

Machine Translation Sentence +1

Paper
Code

SHEF-NN: Translation Quality Estimation with Neural Networks

no code implementations • WS 2015 • Kashif Shah, Varvara Logacheva, Gustavo Paetzold, Frederic Blain, Daniel Beck, Fethi Bougares, Lucia Specia

Feature Engineering Language Modelling +4

Paper
Add Code

Findings of the 2015 Workshop on Statistical Machine Translation

no code implementations • WS 2015 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi

Automatic Post-Editing Translation

Paper
Add Code

Data enhancement and selection strategies for the word-level Quality Estimation

no code implementations • WS 2015 • Varvara Logacheva, Chris Hokamp, Lucia Specia

Machine Translation

Paper
Add Code

The role of artificially generated negative data for quality estimation of machine translation

no code implementations • WS 2015 • Varvara Logacheva, Lucia Specia

Machine Translation Translation

Paper
Add Code

A Quality-based Active Sample Selection Strategy for Statistical Machine Translation

no code implementations • LREC 2014 • Varvara Logacheva, Lucia Specia

Our approach is based on a quality estimation technique which involves a wider range of features of the source text, automatic translation, and machine translation system compared to previous work.

Active Learning Machine Translation +3

Paper
Add Code

Confidence-based Active Learning Methods for Machine Translation

no code implementations • WS 2014 • Varvara Logacheva, Lucia Specia

Active Learning Domain Adaptation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.