Search Results for author: Cristina Espa{\~n}a-Bonet

Found 15 papers, 1 papers with code

Paper
Add Code

Word's Vector Representations meet Machine Translation

no code implementations • WS 2014 • Eva Mart{\'\i}nez Garcia, J{\"o}rg Tiedemann, Cristina Espa{\~n}a-Bonet, Llu{\'\i}s M{\`a}rquez

Machine Translation Translation

Paper
Add Code

Document-Level Machine Translation with Word Vector Models

no code implementations • WS 2015 • Eva Mart{\'\i}nez Garcia, Cristina Espa{\~n}a-Bonet, Llu{\'\i}s M{\`a}rquez

Document Level Machine Translation Language Modelling +3

Paper
Add Code

A Factory of Comparable Corpora from Wikipedia

no code implementations • WS 2015 • Alberto Barr{\'o}n-Cede{\~n}o, Cristina Espa{\~n}a-Bonet, Josu Boldoba, Llu{\'\i}s M{\`a}rquez

Machine Translation

Paper
Add Code

TweetMT: A Parallel Microblog Corpus

no code implementations • LREC 2016 • I{\~n}aki San Vicente, I{\~n}aki Alegr{\'\i}a, Cristina Espa{\~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{\'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri

We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula.

Machine Translation Translation

Paper
Add Code

The TALP--UPC Spanish--English WMT Biomedical Task: Bilingual Embeddings and Char-based Neural Language Model Rescoring in a Phrase-based System

no code implementations • WS 2016 • Marta R. Costa-juss{\`a}, Cristina Espa{\~n}a-Bonet, Pranava Madhyastha, Carlos Escolano, Jos{\'e} A. R. Fonollosa

Language Modelling Machine Translation +1

Paper
Add Code

Learning Bilingual Projections of Embeddings for Vocabulary Expansion in Machine Translation

no code implementations • WS 2017 • Pranava Swaroop Madhyastha, Cristina Espa{\~n}a-Bonet

We propose a simple log-bilinear softmax-based model to deal with vocabulary expansion in machine translation.

Representation Learning Translation +2

Paper
Add Code

Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity

no code implementations • SEMEVAL 2017 • Cristina Espa{\~n}a-Bonet, Alberto Barr{\'o}n-Cede{\~n}o

This is the Lump team participation at SemEval 2017 Task 1 on Semantic Textual Similarity.

Language Identification Machine Translation +3

Paper
Add Code

Self-Supervised Neural Machine Translation

1 code implementation • ACL 2019 • Dana Ruiter, Cristina Espa{\~n}a-Bonet, Josef van Genabith

We present a simple new method where an emergent NMT system is used for simultaneously selecting training data and learning internal NMT representations.

Machine Translation NMT +1

Paper
Code

UdS-DFKI Participation at WMT 2019: Low-Resource (en-gu) and Coreference-Aware (en-de) Systems

no code implementations • WS 2019 • Cristina Espa{\~n}a-Bonet, Dana Ruiter

This paper describes the UdS-DFKI submission to the WMT2019 news translation task for Gujarati{--}English (low-resourced pair) and German{--}English (document-level evaluation).

Translation

Paper
Add Code

Context-Aware Neural Machine Translation Decoding

no code implementations • WS 2019 • Eva Mart{\'\i}nez Garcia, Carles Creus, Cristina Espa{\~n}a-Bonet

This work presents a decoding architecture that fuses the information from a neural translation model and the context semantics enclosed in a semantic space language model based on word embeddings.

Language Modelling Machine Translation +2

Paper
Add Code

Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yor\`ub\'a and Twi

no code implementations • LREC 2020 • Jesujoba Alabi, Kwabena Amponsah-Kaakyire, David Adelani, Cristina Espa{\~n}a-Bonet

In this paper we focus on two African languages, Yor{\`u}b{\'a} and Twi, and compare the word embeddings obtained in this way, with word embeddings obtained from curated corpora and a language-dependent processing.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction

no code implementations • CL 2020 • Marta R. Costa-juss{\`a}, Cristina Espa{\~n}a-Bonet, Pascale Fung, Noah A. Smith

We introduce the Computational Linguistics special issue on Multilingual and Interlingual Semantic Representations for Natural Language Processing.

Paper
Add Code

How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech

no code implementations • WS 2020 • Yuri Bizzoni, Tom S Juzek, Cristina Espa{\~n}a-Bonet, Koel Dutta Chowdhury, Josef van Genabith, Elke Teich

Some translationese features tend to appear in simultaneous interpreting with higher frequency than in human text translation, but the reasons for this are unclear.

Machine Translation Translation

Paper
Add Code

Understanding Translationese in Multi-view Embedding Spaces

no code implementations • COLING 2020 • Koel Dutta Chowdhury, Cristina Espa{\~n}a-Bonet, Josef van Genabith

Recent studies use a combination of lexical and syntactic features to show that footprints of the source language remain visible in translations, to the extent that it is possible to predict the original source language from the translation.

Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.