Search Results for author: Raúl Vázquez

Found 10 papers, 3 papers with code

MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

1 code implementation • 12 Mar 2024 • Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh, Michele Boggia, Ona de Gibert, Shaoxiong Ji, Niki Andreas Lopi, Alessandro Raganato, Raúl Vázquez, Jörg Tiedemann

NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled.

Machine Translation Philosophy +1

Paper
Code

Multilingual NMT with a language-independent attention bridge

1 code implementation • WS 2019 • Raúl Vázquez, Alessandro Raganato, Jörg Tiedemann, Mathias Creutz

In this paper, we propose a multilingual encoder-decoder architecture capable of obtaining multilingual sentence representations by means of incorporating an intermediate {\em attention bridge} that is shared across all languages.

NMT Sentence +2

Paper
Code

The MeMAD Submission to the WMT18 Multimodal Translation Task

no code implementations • WS 2018 • Stig-Arne Grönroos, Benoit Huet, Mikko Kurimo, Jorma Laaksonen, Bernard Merialdo, Phu Pham, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Raphael Troncy, Raúl Vázquez

Our experiments show that the effect of the visual features in our system is small.

Multimodal Machine Translation NMT +1

Paper
Add Code

The University of Helsinki submissions to the WMT19 news translation task

no code implementations • WS 2019 • Aarne Talman, Umut Sulubacak, Raúl Vázquez, Yves Scherrer, Sami Virpioja, Alessandro Raganato, Arvi Hurskainen, Jörg Tiedemann

In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English.

Sentence Translation

Paper
Add Code

An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation

no code implementations • EMNLP 2021 • Alessandro Raganato, Raúl Vázquez, Mathias Creutz, Jörg Tiedemann

In this paper, we investigate the benefits of an explicit alignment to language labels in Transformer-based MNMT models in the zero-shot context, by jointly training one cross attention head with word alignment supervision to stress the focus on the target language label.

Machine Translation Translation +1

Paper
Add Code

The Helsinki submission to the AmericasNLP shared task

no code implementations • NAACL (AmericasNLP) 2021 • Raúl Vázquez, Yves Scherrer, Sami Virpioja, Jörg Tiedemann

The University of Helsinki participated in the AmericasNLP shared task for all ten language pairs.

NMT

Paper
Add Code

Latest Development in the FoTran Project – Scaling Up Language Coverage in Neural Machine Translation Using Distributed Training with Language-Specific Components

no code implementations • EAMT 2022 • Raúl Vázquez, Michele Boggia, Alessandro Raganato, Niki A. Loppi, Stig-Arne Grönroos, Jörg Tiedemann

We describe the enhancement of a multilingual NMT toolkit developed as part of the FoTran project.

Machine Translation NMT +1

Paper
Add Code

A Closer Look at Parameter Contributions When Training Neural Language and Translation Models

no code implementations • COLING 2022 • Raúl Vázquez, Hande Celikkanat, Vinit Ravishankar, Mathias Creutz, Jörg Tiedemann

We analyze the learning dynamics of neural language and translation models using Loss Change Allocation (LCA), an indicator that enables a fine-grained analysis of parameter updates when optimizing for the loss function.

Causal Language Modeling Language Modelling +3

Paper
Add Code

Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings

1 code implementation • 10 Oct 2023 • Timothee Mickus, Raúl Vázquez

A recent body of work has demonstrated that Transformer embeddings can be linearly decomposed into well-defined sums of factors, that can in turn be related to specific network inputs or components.

Machine Translation Sentence

Paper
Code

SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

no code implementations • 12 Mar 2024 • Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate.

Machine Translation Paraphrase Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.