no code implementations • COLING 2020 • Koel Dutta Chowdhury, Cristina Espa{\~n}a-Bonet, Josef van Genabith
Recent studies use a combination of lexical and syntactic features to show that footprints of the source language remain visible in translations, to the extent that it is possible to predict the original source language from the translation.
no code implementations • WS 2020 • Yuri Bizzoni, Tom S Juzek, Cristina Espa{\~n}a-Bonet, Koel Dutta Chowdhury, Josef van Genabith, Elke Teich
Some translationese features tend to appear in simultaneous interpreting with higher frequency than in human text translation, but the reasons for this are unclear.
no code implementations • CL 2020 • Marta R. Costa-juss{\`a}, Cristina Espa{\~n}a-Bonet, Pascale Fung, Noah A. Smith
We introduce the Computational Linguistics special issue on Multilingual and Interlingual Semantic Representations for Natural Language Processing.
no code implementations • LREC 2020 • Jesujoba Alabi, Kwabena Amponsah-Kaakyire, David Adelani, Cristina Espa{\~n}a-Bonet
In this paper we focus on two African languages, Yor{\`u}b{\'a} and Twi, and compare the word embeddings obtained in this way, with word embeddings obtained from curated corpora and a language-dependent processing.
no code implementations • WS 2019 • Eva Mart{\'\i}nez Garcia, Carles Creus, Cristina Espa{\~n}a-Bonet
This work presents a decoding architecture that fuses the information from a neural translation model and the context semantics enclosed in a semantic space language model based on word embeddings.
no code implementations • WS 2019 • Cristina Espa{\~n}a-Bonet, Dana Ruiter
This paper describes the UdS-DFKI submission to the WMT2019 news translation task for Gujarati{--}English (low-resourced pair) and German{--}English (document-level evaluation).
1 code implementation • ACL 2019 • Dana Ruiter, Cristina Espa{\~n}a-Bonet, Josef van Genabith
We present a simple new method where an emergent NMT system is used for simultaneously selecting training data and learning internal NMT representations.
no code implementations • WS 2017 • Pranava Swaroop Madhyastha, Cristina Espa{\~n}a-Bonet
We propose a simple log-bilinear softmax-based model to deal with vocabulary expansion in machine translation.
no code implementations • SEMEVAL 2017 • Cristina Espa{\~n}a-Bonet, Alberto Barr{\'o}n-Cede{\~n}o
This is the Lump team participation at SemEval 2017 Task 1 on Semantic Textual Similarity.
no code implementations • LREC 2016 • I{\~n}aki San Vicente, I{\~n}aki Alegr{\'\i}a, Cristina Espa{\~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{\'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri
We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula.