Search Results for author: Elena Voita

Found 20 papers, 13 papers with code

BPE-Dropout: Simple and Effective Subword Regularization

7 code implementations • ACL 2020 • Ivan Provilkov, Dmitrii Emelianenko, Elena Voita

Subword segmentation is widely used to address the open vocabulary problem in machine translation.

Ranked #1 on Machine Translation on IWSLT2017 French-English

Machine Translation Segmentation +1

9,441

Paper
Code

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

1 code implementation • ACL 2019 • Elena Voita, David Talbot, Fedor Moiseev, Rico Sennrich, Ivan Titov

Multi-head self-attention is a key component of the Transformer, a state-of-the-art architecture for neural machine translation.

Machine Translation Translation

280

Paper
Code

Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation

1 code implementation • ACL 2021 • Elena Voita, Rico Sennrich, Ivan Titov

We find that models trained with more data tend to rely on source information more and to have more sharp token contributions; the training process is non-monotonic with several stages of different nature.

Language Modelling Machine Translation +2

280

Paper
Code

LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models

1 code implementation • 10 Apr 2024 • Igor Tufanov, Karen Hambardzumyan, Javier Ferrando, Elena Voita

We present the LM Transparency Tool (LM-TT), an open-source interactive toolkit for analyzing the internal workings of Transformer-based language models.

Decision Making

249

Paper
Code

Information Flow Routes: Automatically Interpreting Language Models at Scale

1 code implementation • 27 Feb 2024 • Javier Ferrando, Elena Voita

These routes can be represented as graphs where nodes correspond to token representations and edges to operations inside the network.

249

Paper
Code

HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation

1 code implementation • 19 May 2023 • David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loïc Barrault, Marta R. Costa-jussà

Hallucinations in machine translation are translations that contain information completely unrelated to the input.

Hallucination Machine Translation +2

238

Paper
Code

When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

1 code implementation • ACL 2019 • Elena Voita, Rico Sennrich, Ivan Titov

Though machine translation errors caused by the lack of context beyond one sentence have long been acknowledged, the development of context-aware NMT systems is hampered by several problems.

Machine Translation NMT +2

Paper
Code

Context-Aware Monolingual Repair for Neural Machine Translation

1 code implementation • IJCNLP 2019 • Elena Voita, Rico Sennrich, Ivan Titov

For training, the DocRepair model requires only monolingual document-level data in the target language.

Automatic Post-Editing NMT +2

Paper
Code

Information-Theoretic Probing with Minimum Description Length

2 code implementations • EMNLP 2020 • Elena Voita, Ivan Titov

Instead, we propose an alternative to the standard probes, information-theoretic probing with minimum description length (MDL).

Paper
Code

Embedding Words in Non-Vector Space with Unsupervised Graph Learning

1 code implementation • EMNLP 2020 • Max Ryabinin, Sergei Popov, Liudmila Prokhorenkova, Elena Voita

We adopt a recent method learning a representation of data in the form of a differentiable weighted graph and use it to modify the GloVe training algorithm.

Graph Learning Word Embeddings +1

Paper
Code

A Large-Scale Test Set for the Evaluation of Context-Aware Pronoun Translation in Neural Machine Translation

1 code implementation • WS 2018 • Mathias Müller, Annette Rios, Elena Voita, Rico Sennrich

We show that, while gains in BLEU are moderate for those systems, they outperform baselines by a large margin in terms of accuracy on our contrastive test set.

Machine Translation Sentence +1

Paper
Code

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

2 code implementations • 10 Aug 2022 • Nuno M. Guerreiro, Elena Voita, André F. T. Martins

Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground.

Machine Translation NMT

Paper
Code

Sequence Modeling with Unconstrained Generation Order

1 code implementation • NeurIPS 2019 • Dmitrii Emelianenko, Elena Voita, Pavel Serdyukov

The dominant approach to sequence generation is to produce a sequence in some predefined order, e. g. left to right.

Image Captioning Machine Translation +1

Paper
Code

Context-Aware Neural Machine Translation Learns Anaphora Resolution

no code implementations • ACL 2018 • Elena Voita, Pavel Serdyukov, Rico Sennrich, Ivan Titov

Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence.

Machine Translation Translation

Paper
Add Code

The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives

no code implementations • IJCNLP 2019 • Elena Voita, Rico Sennrich, Ivan Titov

In this work, we use canonical correlation analysis and mutual information estimators to study how information flows across Transformer layers and how this process depends on the choice of learning objective.

Language Modelling Machine Translation +2

Paper
Add Code

Unsupervised Discovery of Interpretable Latent Manipulations in Language VAEs

no code implementations • 1 Jan 2021 • Max Ryabinin, Artem Babenko, Elena Voita

In this work, we make the first step towards unsupervised discovery of interpretable directions in language latent spaces.

Sentence Text Generation

Paper
Add Code

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

no code implementations • EMNLP 2021 • Elena Voita, Rico Sennrich, Ivan Titov

Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components, neural machine translation uses a single neural network to model the entire translation process.

Language Modelling Machine Translation +4

Paper
Add Code

Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better

no code implementations • 16 Dec 2022 • David Dale, Elena Voita, Loïc Barrault, Marta R. Costa-jussà

We propose to use a method that evaluates the percentage of the source contribution to a generated translation.

Machine Translation Sentence +2

Paper
Add Code

Neurons in Large Language Models: Dead, N-gram, Positional

no code implementations • 9 Sep 2023 • Elena Voita, Javier Ferrando, Christoforos Nalmpantis

Specifically, we focus on the OPT family of models ranging from 125m to 66b parameters and rely only on whether an FFN neuron is activated or not.

Position

Paper
Add Code

Know When To Stop: A Study of Semantic Drift in Text Generation

no code implementations • 8 Apr 2024 • Ava Spataru, Eric Hambro, Elena Voita, Nicola Cancedda

Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.