Search Results for author: David Dale

Found 17 papers, 11 papers with code

ParaDetox: Detoxification with Parallel Data

1 code implementation ACL 2022 Varvara Logacheva, Daryna Dementieva, Sergey Ustyantsev, Daniil Moskovskiy, David Dale, Irina Krotova, Nikita Semenov, Alexander Panchenko

To the best of our knowledge, these are the first parallel datasets for this task. We describe our pipeline in detail to make it fast to set up for a new language or domain, thus contributing to faster and easier development of new parallel resources. We train several detoxification models on the collected data and compare them with several baselines and state-of-the-art unsupervised approaches.

Sentence

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

1 code implementation10 Jan 2024 Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alex Mourachko, Christophe Ropers, Carleigh Wood

Research in toxicity detection in natural language processing for the speech modality (audio-based) is quite limited, particularly for languages other than English.

SpeechAlign: a Framework for Speech Translation Alignment Evaluation

no code implementations20 Sep 2023 Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà

To contribute to these fields, we present SpeechAlign, a framework to evaluate the underexplored field of source-target alignment in speech models.

Speech-to-Text Translation Translation

Don't lose the message while paraphrasing: A study on content preserving style transfer

1 code implementation17 Aug 2023 Nikolay Babakov, David Dale, Ilya Gusev, Irina Krotova, Alexander Panchenko

Text style transfer techniques are gaining popularity in natural language processing allowing paraphrasing text in the required form: from toxic to neural, from formal to informal, from old to the modern English language, etc.

Style Transfer Text Style Transfer

The first neural machine translation system for the Erzya language

1 code implementation FieldMatters (COLING) 2022 David Dale

We present the first neural machine translation system for translation between the endangered Erzya language and Russian and the dataset collected by us to train and evaluate it.

Language Identification Machine Translation +3

Studying the role of named entities for content preservation in text style transfer

2 code implementations20 Jun 2022 Nikolay Babakov, David Dale, Varvara Logacheva, Irina Krotova, Alexander Panchenko

Text style transfer techniques are gaining popularity in Natural Language Processing, finding various applications such as text detoxification, sentiment, or formality transfer.

Style Transfer Text Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.