Search Results for author: Mikel L. Forcada

Found 23 papers, 2 papers with code

A multi-source approach for Breton–French hybrid machine translation

no code implementations • EAMT 2020 • Víctor M. Sánchez-Cartagena, Mikel L. Forcada, Felipe Sánchez-Martínez

Corpus-based approaches to machine translation (MT) have difficulties when the amount of parallel corpora to use for training is scarce, especially if the languages involved in the translation are highly inflected.

Data Augmentation Machine Translation +2

Paper
Add Code

Usefulness of MT output for comprehension — an analysis from the point of view of linguistic intercomprehension

no code implementations • MTSummit 2017 • Kenneth Jordan Núñez, Mikel L. Forcada, Esteve Clua

Paper
Add Code

One-parameter models for sentence-level post-editing effort estimation

no code implementations • MTSummit 2017 • Mikel L. Forcada, Miquel Esplà-Gomis, Felipe Sánchez-Martínez, Lucia Specia

Sentence

Paper
Add Code

An English-Swahili parallel corpus and its use for neural machine translation in the news domain

no code implementations • EAMT 2020 • Felipe Sánchez-Martínez, Víctor M. Sánchez-Cartagena, Juan Antonio Pérez-Ortiz, Mikel L. Forcada, Miquel Esplà-Gomis, Andrew Secker, Susie Coleman, Julie Wall

This paper describes our approach to create a neural machine translation system to translate between English and Swahili (both directions) in the news domain, as well as the process we followed to crawl the necessary parallel corpora from the Internet.

Machine Translation Translation

Paper
Add Code

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages

no code implementations • EAMT 2022 • Marta Bañón, Miquel Esplà-Gomis, Mikel L. Forcada, Cristian García-Romero, Taja Kuzman, Nikola Ljubešić, Rik van Noord, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Peter Rupnik, Vít Suchomel, Antonio Toral, Tobias van der Werff, Jaume Zaragoza

We introduce the project “MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages”, funded by the Connecting Europe Facility, which is aimed at building monolingual and parallel corpora for under-resourced European languages.

Paper
Add Code

MultitraiNMT Erasmus+ project: Machine Translation Training for multilingual citizens (multitrainmt.eu)

no code implementations • EAMT 2022 • Mikel L. Forcada, Pilar Sánchez-Gijón, Dorothy Kenny, Felipe Sánchez-Martínez, Juan Antonio Pérez Ortiz, Riccardo Superbo, Gema Ramírez Sánchez, Olga Torres-Hostench, Caroline Rossi

The MultitraiNMT Erasmus+ project has developed an open innovative syl-labus in machine translation, focusing on neural machine translation (NMT) and targeting both language learners and translators.

Machine Translation NMT +1

Paper
Add Code

Apertium: a free/open source platform for machine translation and basic language technology

no code implementations • EAMT 2016 • Mikel L. Forcada, Francis M. Tyers

Machine Translation Translation

Paper
Add Code

ParaCrawl: Web-Scale Acquisition of Parallel Corpora

2 code implementations • ACL 2020 • Marta Ba{\~n}{\'o}n, Pin-zhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Espl{\`a}-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ram{\'\i}rez-S{\'a}nchez, Elsa Sarr{\'\i}as, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, Jaume Zaragoza

We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software.

Machine Translation Parallel Corpus Mining +2

1,170

Paper
Code

Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality

1 code implementation • EMNLP (IWSLT) 2019 • Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia

To that end, we report experiments on a dataset with newly-collected post-editing indicators and show their usefulness when estimating post-editing effort.

Machine Translation Translation

Paper
Code

Global Under-Resourced Media Translation (GoURMET)

no code implementations • WS 2019 • Alex Birch, ra, Barry Haddow, Ivan Tito, Antonio Valerio Miceli Barone, Rachel Bawden, Felipe S{\'a}nchez-Mart{\'\i}nez, Mikel L. Forcada, Miquel Espl{\`a}-Gomis, V{\'\i}ctor S{\'a}nchez-Cartagena, Juan Antonio P{\'e}rez-Ortiz, Wilker Aziz, Andrew Secker, Peggy van der Kreeft

Translation

Paper
Add Code

UAlacant machine translation quality estimation at WMT 2018: a simple approach using phrase tables and feed-forward neural networks

no code implementations • WS 2018 • Miquel Esplà-Gomis, Felipe Sánchez-Martínez, Mikel L. Forcada

We describe the Universitat d'Alacant submissions to the word- and sentence-level machine translation (MT) quality estimation (QE) shared task at WMT 2018.

Machine Translation Sentence +1

Paper
Add Code

Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering

no code implementations • WS 2018 • Philipp Koehn, Huda Khayrallah, Kenneth Heafield, Mikel L. Forcada

We posed the shared task of assigning sentence-level quality scores for a very noisy corpus of sentence pairs crawled from the web, with the goal of sub-selecting 1{\%} and 10{\%} of high-quality data to be used to train machine translation systems.

Machine Translation Outlier Detection +2

Paper
Add Code

Exploring Gap Filling as a Cheaper Alternative to Reading Comprehension Questionnaires when Evaluating Machine Translation for Gisting

no code implementations • WS 2018 • Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch

A popular application of machine translation (MT) is gisting: MT is consumed as is to make sense of text in a foreign language.

Machine Translation Reading Comprehension +2

Paper
Add Code

A Maturity Model for Public Administration as Open Translation Data Providers

no code implementations • 7 Jul 2016 • Núria Bel, Mikel L. Forcada, Asunción Gómez-Pérez

Any public administration that produces translation data can be a provider of useful reusable data to meet its own translation needs and the ones of other public organizations and private companies that work with texts of the same domain.

Machine Translation Management +1

Paper
Add Code

Stand-off Annotation of Web Content as a Legally Safer Alternative to Crawling for Distribution

no code implementations • WS 2016 • Mikel L. Forcada, Miquel Espl{\`a}-Gomis, Juan Antonio P{\'e}rez-Ortiz

Machine Translation

Paper
Add Code

A Light Sliding-Window Part-of-Speech Tagger for the Apertium Free/Open-Source Machine Translation Platform

no code implementations • 18 Sep 2015 • Gang Chen, Mikel L. Forcada

This paper describes a free/open-source implementation of the light sliding-window (LSW) part-of-speech tagger for the Apertium free/open-source machine translation platform.

Machine Translation Translation

Paper
Add Code

A general framework for minimizing translation effort: towards a principled combination of translation technologies in computer-aided translation

no code implementations • WS 2015 • Mikel L. Forcada, Felipe S{\'a}nchez-Mart{\'\i}nez

Machine Translation Translation

Paper
Add Code

Abu-MaTran: Automatic building of Machine Translation

no code implementations • EAMT 2016 • Antonio Toral, Tommi A. Pirinen, Andy Way, Gema Ram{\'\i}rez-S{\'a}nchez, Sergio Ortiz Rojas, Raphael Rubino, Miquel Espl{\`a}, Mikel L. Forcada, Vassilis Papavassiliou, Prokopis Prokopidis, Nikola Ljube{\v{s}}i{\'c}

Machine Translation Transfer Learning +1

Paper
Add Code

Using on-line available sources of bilingual information for word-level machine translation quality estimation

no code implementations • WS 2015 • Miquel Espl{\`a}-Gomis, Felipe S{\'a}nchez-Mart{\'\i}nez, Mikel L. Forcada

Machine Translation Translation

Paper
Add Code

Unsupervised training of maximum-entropy models for lexical selection in rule-based machine translation

no code implementations • WS 2015 • Francis M. Tyers, Felipe S{\'a}nchez-Mart{\'\i}nez, Mikel L. Forcada

Language Modelling Machine Translation +2

Paper
Add Code

Evaluating machine translation for assimilation via a gap-filling task

no code implementations • WS 2015 • Ekaterina Ageeva, Mikel L. Forcada, Francis M. Tyers, Juan Antonio P{\'e}rez-Ortiz

Machine Translation Translation

Paper
Add Code

Inferring Shallow-Transfer Machine Translation Rules from Small Parallel Corpora

no code implementations • 15 Jan 2014 • Felipe Sánchez-Martínez, Mikel L. Forcada

This paper describes a method for the automatic inference of structural transfer rules to be used in a shallow-transfer machine translation (MT) system from small parallel corpora.

Machine Translation Sentence +2

Paper
Add Code

UAlacant: Using Online Machine Translation for Cross-Lingual Textual Entailment

no code implementations • SEMEVAL 2012 • Miquel Espl{\`a}-Gomis, Felipe S{\'a}nchez-Mart{\'\i}nez, Mikel L. Forcada

Information Retrieval Machine Translation +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.