Search Results for author: Nuno M. Guerreiro

Found 13 papers, 11 papers with code

IST-Unbabel 2021 Submission for the Explainable Quality Estimation Shared Task

1 code implementation • EMNLP (Eval4NLP) 2021 • Marcos Treviso, Nuno M. Guerreiro, Ricardo Rei, André F. T. Martins

Paper
Code

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

1 code implementation • 27 Feb 2024 • Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task.

Language Modelling Large Language Model +1

Paper
Code

Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation

no code implementations • 20 Feb 2024 • Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M. Guerreiro

Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems.

Hallucination Machine Translation +1

Paper
Add Code

CroissantLLM: A Truly Bilingual French-English Language Model

1 code implementation • 1 Feb 2024 • Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António Loison, Duarte M. Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro H. Martins, Antoni Bigata Casademunt, François Yvon, André F. T. Martins, Gautier Viaud, Céline Hudelot, Pierre Colombo

We introduce CroissantLLM, a 1. 3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware.

Language Modelling Large Language Model

Paper
Code

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

no code implementations • 20 Oct 2023 • Duarte M. Alves, Nuno M. Guerreiro, João Alves, José Pombal, Ricardo Rei, José G. C. de Souza, Pierre Colombo, André F. T. Martins

Experiments on 10 language pairs show that our proposed approach recovers the original few-shot capabilities while keeping the added benefits of finetuning.

In-Context Learning Machine Translation +1

Paper
Add Code

xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection

1 code implementation • 16 Oct 2023 • Nuno M. Guerreiro, Ricardo Rei, Daan van Stigt, Luisa Coheur, Pierre Colombo, André F. T. Martins

Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score.

Machine Translation Sentence +1

393

Paper
Code

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

1 code implementation • 21 Sep 2023 • Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2).

Sentence Task 2

393

Paper
Code

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

1 code implementation • 26 May 2023 • Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins

Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models.

counterfactual Data Augmentation +2

Paper
Code

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

1 code implementation • 19 May 2023 • Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU.

Decision Making Machine Translation +2

393

Paper
Code

Hallucinations in Large Multilingual Translation Models

1 code implementation • 28 Mar 2023 • Nuno M. Guerreiro, Duarte Alves, Jonas Waldendorf, Barry Haddow, Alexandra Birch, Pierre Colombo, André F. T. Martins

Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications.

Language Modelling Large Language Model +2

Paper
Code

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

1 code implementation • 19 Dec 2022 • Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, André F. T. Martins

Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications.

Hallucination Machine Translation +2

Paper
Code

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

1 code implementation • 13 Sep 2022 • Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE).

Sentence

393

Paper
Code

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

2 code implementations • 10 Aug 2022 • Nuno M. Guerreiro, Elena Voita, André F. T. Martins

Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground.

Machine Translation NMT

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.