Search Results for author: Eleftheria Briakou

Found 21 papers, 8 papers with code

Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation

no code implementations16 Apr 2025 Julia Kreutzer, Eleftheria Briakou, Sweta Agrawal, Marzieh Fadaee, Kocmi Tom

Generation capabilities and language coverage of multilingual large language models (mLLMs) are advancing rapidly.

Machine Translation

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

1 code implementation18 Feb 2025 Daniel Deutsch, Eleftheria Briakou, Isaac Caswell, Mara Finkelstein, Rebecca Galor, Juraj Juraska, Geza Kovacs, Alison Lui, Ricardo Rei, Jason Riesa, Shruti Rijhwani, Parker Riley, Elizabeth Salesky, Firas Trabelsi, Stephanie Winkler, Biao Zhang, Markus Freitag

As large language models (LLM) become more and more capable in languages other than English, it is important to collect benchmark datasets in order to evaluate their multilingual performance, including on tasks like machine translation (MT).

Machine Translation

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

no code implementations30 Jan 2025 Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch, Jiaming Luo, Colin Cherry, Markus Freitag

Data contamination -- the accidental consumption of evaluation examples within the pre-training data -- can undermine the validity of evaluation benchmarks.

Machine Translation

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

no code implementations10 Sep 2024 Eleftheria Briakou, Jiaming Luo, Colin Cherry, Markus Freitag

In this paper we present a step-by-step approach to long-form text translation, drawing on established processes in translation studies.

Form Machine Translation +1

Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability

no code implementations17 May 2023 Eleftheria Briakou, Colin Cherry, George Foster

We investigate the role of incidental bilingualism -- the unintentional consumption of bilingual signals, including translation examples -- in explaining the translation capabilities of large language models, taking the Pathways Language Model (PaLM) as a case study.

Language Modeling Language Modelling +2

Can Synthetic Translations Improve Bitext Quality?

no code implementations ACL 2022 Eleftheria Briakou, Marine Carpuat

Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation.

Data Augmentation NMT

A Review of Human Evaluation for Style Transfer

1 code implementation ACL (GEM) 2021 Eleftheria Briakou, Sweta Agrawal, Ke Zhang, Joel Tetreault, Marine Carpuat

However, in style transfer papers, we find that protocols for human evaluations are often underspecified and not standardized, which hampers the reproducibility of research in this field and progress toward better human and automatic evaluation methods.

Style Transfer

Ol\'a, Bonjour, Salve! XFORMAL: A Benchmark for Multilingual Formality Style Transfer

no code implementations NAACL 2021 Eleftheria Briakou, Di Lu, Ke Zhang, Joel Tetreault

We take the first step towards multilingual style transfer by creating and releasing XFORMAL, a benchmark of multiple formal reformulations of informal text in Brazilian Portuguese, French, and Italian.

Formality Style Transfer Style Transfer

Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation

2 code implementations ACL 2021 Eleftheria Briakou, Marine Carpuat

While it has been shown that Neural Machine Translation (NMT) is highly sensitive to noisy parallel training samples, prior work treats all types of mismatches between source and target as noise.

Machine Translation NMT +1

XFORMAL: A Benchmark for Multilingual Formality Style Transfer

1 code implementation8 Apr 2021 Eleftheria Briakou, Di Lu, Ke Zhang, Joel Tetreault

We take the first step towards multilingual style transfer by creating and releasing XFORMAL, a benchmark of multiple formal reformulations of informal text in Brazilian Portuguese, French, and Italian.

Formality Style Transfer Style Transfer

Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

1 code implementation EMNLP 2020 Eleftheria Briakou, Marine Carpuat

Detecting fine-grained differences in content conveyed in different languages matters for cross-lingual NLP and multilingual corpora analysis, but it is a challenging machine learning problem since annotation is expensive and hard to scale.

Learning-To-Rank Sentence

Cross-topic distributional semantic representations via unsupervised mappings

no code implementations NAACL 2019 Eleftheria Briakou, Nikos Athanasiou, Alexandros Potamianos

In traditional Distributional Semantic Models (DSMs) the multiple senses of a polysemous word are conflated into a single vector space representation.

Word Similarity

Cannot find the paper you are looking for? You can Submit a new open access paper.