Search Results for author: Elena Voita

Found 20 papers, 13 papers with code

Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation

1 code implementation ACL 2021 Elena Voita, Rico Sennrich, Ivan Titov

We find that models trained with more data tend to rely on source information more and to have more sharp token contributions; the training process is non-monotonic with several stages of different nature.

Language Modelling Machine Translation +2

LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models

1 code implementation10 Apr 2024 Igor Tufanov, Karen Hambardzumyan, Javier Ferrando, Elena Voita

We present the LM Transparency Tool (LM-TT), an open-source interactive toolkit for analyzing the internal workings of Transformer-based language models.

Decision Making

Information Flow Routes: Automatically Interpreting Language Models at Scale

1 code implementation27 Feb 2024 Javier Ferrando, Elena Voita

These routes can be represented as graphs where nodes correspond to token representations and edges to operations inside the network.

When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

1 code implementation ACL 2019 Elena Voita, Rico Sennrich, Ivan Titov

Though machine translation errors caused by the lack of context beyond one sentence have long been acknowledged, the development of context-aware NMT systems is hampered by several problems.

Machine Translation NMT +2

Information-Theoretic Probing with Minimum Description Length

2 code implementations EMNLP 2020 Elena Voita, Ivan Titov

Instead, we propose an alternative to the standard probes, information-theoretic probing with minimum description length (MDL).

Embedding Words in Non-Vector Space with Unsupervised Graph Learning

1 code implementation EMNLP 2020 Max Ryabinin, Sergei Popov, Liudmila Prokhorenkova, Elena Voita

We adopt a recent method learning a representation of data in the form of a differentiable weighted graph and use it to modify the GloVe training algorithm.

Graph Learning Word Embeddings +1

A Large-Scale Test Set for the Evaluation of Context-Aware Pronoun Translation in Neural Machine Translation

1 code implementation WS 2018 Mathias Müller, Annette Rios, Elena Voita, Rico Sennrich

We show that, while gains in BLEU are moderate for those systems, they outperform baselines by a large margin in terms of accuracy on our contrastive test set.

Machine Translation Sentence +1

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

2 code implementations10 Aug 2022 Nuno M. Guerreiro, Elena Voita, André F. T. Martins

Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground.

Machine Translation NMT

Sequence Modeling with Unconstrained Generation Order

1 code implementation NeurIPS 2019 Dmitrii Emelianenko, Elena Voita, Pavel Serdyukov

The dominant approach to sequence generation is to produce a sequence in some predefined order, e. g. left to right.

Image Captioning Machine Translation +1

Context-Aware Neural Machine Translation Learns Anaphora Resolution

no code implementations ACL 2018 Elena Voita, Pavel Serdyukov, Rico Sennrich, Ivan Titov

Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence.

Machine Translation Translation

The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives

no code implementations IJCNLP 2019 Elena Voita, Rico Sennrich, Ivan Titov

In this work, we use canonical correlation analysis and mutual information estimators to study how information flows across Transformer layers and how this process depends on the choice of learning objective.

Language Modelling Machine Translation +2

Unsupervised Discovery of Interpretable Latent Manipulations in Language VAEs

no code implementations1 Jan 2021 Max Ryabinin, Artem Babenko, Elena Voita

In this work, we make the first step towards unsupervised discovery of interpretable directions in language latent spaces.

Sentence Text Generation

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

no code implementations EMNLP 2021 Elena Voita, Rico Sennrich, Ivan Titov

Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components, neural machine translation uses a single neural network to model the entire translation process.

Language Modelling Machine Translation +4

Neurons in Large Language Models: Dead, N-gram, Positional

no code implementations9 Sep 2023 Elena Voita, Javier Ferrando, Christoforos Nalmpantis

Specifically, we focus on the OPT family of models ranging from 125m to 66b parameters and rely only on whether an FFN neuron is activated or not.

Position

Know When To Stop: A Study of Semantic Drift in Text Generation

no code implementations8 Apr 2024 Ava Spataru, Eric Hambro, Elena Voita, Nicola Cancedda

Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.

Semantic Similarity Semantic Textual Similarity +1

Cannot find the paper you are looking for? You can Submit a new open access paper.