Search Results for author: Cristina Espa{\~n}a-Bonet

Found 15 papers, 1 papers with code

TweetMT: A Parallel Microblog Corpus

no code implementations LREC 2016 I{\~n}aki San Vicente, I{\~n}aki Alegr{\'\i}a, Cristina Espa{\~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{\'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri

We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula.

Machine Translation Translation

Self-Supervised Neural Machine Translation

1 code implementation ACL 2019 Dana Ruiter, Cristina Espa{\~n}a-Bonet, Josef van Genabith

We present a simple new method where an emergent NMT system is used for simultaneously selecting training data and learning internal NMT representations.

Machine Translation NMT +1

UdS-DFKI Participation at WMT 2019: Low-Resource (en-gu) and Coreference-Aware (en-de) Systems

no code implementations WS 2019 Cristina Espa{\~n}a-Bonet, Dana Ruiter

This paper describes the UdS-DFKI submission to the WMT2019 news translation task for Gujarati{--}English (low-resourced pair) and German{--}English (document-level evaluation).

Translation

Context-Aware Neural Machine Translation Decoding

no code implementations WS 2019 Eva Mart{\'\i}nez Garcia, Carles Creus, Cristina Espa{\~n}a-Bonet

This work presents a decoding architecture that fuses the information from a neural translation model and the context semantics enclosed in a semantic space language model based on word embeddings.

Language Modelling Machine Translation +2

Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yor\`ub\'a and Twi

no code implementations LREC 2020 Jesujoba Alabi, Kwabena Amponsah-Kaakyire, David Adelani, Cristina Espa{\~n}a-Bonet

In this paper we focus on two African languages, Yor{\`u}b{\'a} and Twi, and compare the word embeddings obtained in this way, with word embeddings obtained from curated corpora and a language-dependent processing.

named-entity-recognition Named Entity Recognition +2

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction

no code implementations CL 2020 Marta R. Costa-juss{\`a}, Cristina Espa{\~n}a-Bonet, Pascale Fung, Noah A. Smith

We introduce the Computational Linguistics special issue on Multilingual and Interlingual Semantic Representations for Natural Language Processing.

How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech

no code implementations WS 2020 Yuri Bizzoni, Tom S Juzek, Cristina Espa{\~n}a-Bonet, Koel Dutta Chowdhury, Josef van Genabith, Elke Teich

Some translationese features tend to appear in simultaneous interpreting with higher frequency than in human text translation, but the reasons for this are unclear.

Machine Translation Translation

Understanding Translationese in Multi-view Embedding Spaces

no code implementations COLING 2020 Koel Dutta Chowdhury, Cristina Espa{\~n}a-Bonet, Josef van Genabith

Recent studies use a combination of lexical and syntactic features to show that footprints of the source language remain visible in translations, to the extent that it is possible to predict the original source language from the translation.

Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.