Search Results for author: Alexandra Birch

Found 60 papers, 26 papers with code

Neural Machine Translation of Rare Words with Subword Units

25 code implementations • ACL 2016 • Rico Sennrich, Barry Haddow, Alexandra Birch

Neural machine translation (NMT) models typically operate with a fixed vocabulary, but translation is an open-vocabulary problem.

Ranked #1 on Machine Translation on WMT2015 English-Russian

NMT Segmentation +1

29,251

Paper
Code

Improving Neural Machine Translation Models with Monolingual Data

2 code implementations • ACL 2016 • Rico Sennrich, Barry Haddow, Alexandra Birch

Neural Machine Translation (NMT) has obtained state-of-the art performance for several language pairs, while only using parallel data for training.

Ranked #3 on Cross-Lingual Bitext Mining on BUCC French-to-English

Cross-Lingual Bitext Mining Language Modelling +3

Paper
Code

Edinburgh Neural Machine Translation Systems for WMT 16

1 code implementation • WS 2016 • Rico Sennrich, Barry Haddow, Alexandra Birch

We participated in the WMT 2016 shared news translation task by building neural translation systems for four language pairs, each trained in both directions: English<->Czech, English<->German, English<->Romanian and English<->Russian.

Ranked #1 on Machine Translation on WMT2016 Czech-English

Machine Translation Translation

137

Paper
Code

HUME: Human UCCA-Based Evaluation of Machine Translation

1 code implementation • EMNLP 2016 • Alexandra Birch, Omri Abend, Ondrej Bojar, Barry Haddow

Human evaluation of machine translation normally uses sentence-level measures such as relative ranking or adequacy scales.

Machine Translation Sentence +1

Paper
Code

Predicting Target Language CCG Supertags Improves Neural Machine Translation

no code implementations • WS 2017 • Maria Nadejde, Siva Reddy, Rico Sennrich, Tomasz Dwojak, Marcin Junczys-Dowmunt, Philipp Koehn, Alexandra Birch

Our results on WMT data show that explicitly modeling target-syntax improves machine translation quality for German->English, a high-resource pair, and for Romanian->English, a low-resource pair and also several syntactic phenomena including prepositional phrase attachment.

Machine Translation NMT +2

Paper
Add Code

Nematus: a Toolkit for Neural Machine Translation

4 code implementations • EACL 2017 • Rico Sennrich, Orhan Firat, Kyunghyun Cho, Alexandra Birch, Barry Haddow, Julian Hitschler, Marcin Junczys-Dowmunt, Samuel Läubli, Antonio Valerio Miceli Barone, Jozef Mokry, Maria Nădejde

We present Nematus, a toolkit for Neural Machine Translation.

Machine Translation Translation

798

Paper
Code

Deep Architectures for Neural Machine Translation

3 code implementations • WS 2017 • Antonio Valerio Miceli Barone, Jindřich Helcl, Rico Sennrich, Barry Haddow, Alexandra Birch

It has been shown that increasing model depth improves the quality of neural machine translation.

Machine Translation Translation

798

Paper
Code

The University of Edinburgh's Neural MT Systems for WMT17

no code implementations • WS 2017 • Rico Sennrich, Alexandra Birch, Anna Currey, Ulrich Germann, Barry Haddow, Kenneth Heafield, Antonio Valerio Miceli Barone, Philip Williams

This paper describes the University of Edinburgh's submissions to the WMT17 shared news translation and biomedical translation tasks.

Machine Translation Translation

Paper
Add Code

Evaluating Discourse Phenomena in Neural Machine Translation

no code implementations • NAACL 2018 • Rachel Bawden, Rico Sennrich, Alexandra Birch, Barry Haddow

Despite gains using BLEU, multi-encoder models give limited improvement in the handling of discourse phenomena: 50% accuracy on our coreference test set and 53. 5% for coherence/cohesion (compared to a non-contextual baseline of 50%).

Machine Translation NMT +2

Paper
Add Code

Marian: Fast Neural Machine Translation in C++

2 code implementations • ACL 2018 • Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Dwojak, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, André F. T. Martins, Alexandra Birch

We present Marian, an efficient and self-contained Neural Machine Translation framework with an integrated automatic differentiation engine based on dynamic computation graphs.

Machine Translation Translation

1,170

Paper
Code

Findings of the Second Workshop on Neural Machine Translation and Generation

no code implementations • WS 2018 • Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018).

Data Augmentation Domain Adaptation +2

Paper
Add Code

Exploring Gap Filling as a Cheaper Alternative to Reading Comprehension Questionnaires when Evaluating Machine Translation for Gisting

no code implementations • WS 2018 • Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch

A popular application of machine translation (MT) is gisting: MT is consumed as is to make sense of text in a foreign language.

Machine Translation Reading Comprehension +2

Paper
Add Code

The University of Edinburgh's Submissions to the WMT19 News Translation Task

no code implementations • WS 2019 • Rachel Bawden, Nikolay Bogoychev, Ulrich Germann, Roman Grundkiewicz, Faheem Kirefu, Antonio Valerio Miceli Barone, Alexandra Birch

For all translation directions, we created or used back-translations of monolingual data in the target language as additional synthetic training data.

Language Modelling Translation

Paper
Add Code

On the Importance of Word Boundaries in Character-level Neural Machine Translation

1 code implementation • WS 2019 • Duygu Ataman, Orhan Firat, Mattia A. Di Gangi, Marcello Federico, Alexandra Birch

Neural Machine Translation (NMT) models generally perform translation using a fixed-size lexical vocabulary, which is an important bottleneck on their generalization capability and overall translation quality.

Machine Translation NMT +1

Paper
Code

Findings of the Third Workshop on Neural Generation and Translation

no code implementations • WS 2019 • Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019).

Machine Translation NMT +1

Paper
Add Code

A Latent Morphology Model for Open-Vocabulary Neural Machine Translation

1 code implementation • ICLR 2020 • Duygu Ataman, Wilker Aziz, Alexandra Birch

Translation into morphologically-rich languages challenges neural machine translation (NMT) models with extremely sparse vocabularies where atomic treatment of surface forms is unrealistic.

Machine Translation Morphological Inflection +2

Paper
Code

Towards Making the Most of Context in Neural Machine Translation

1 code implementation • 19 Feb 2020 • Zaixiang Zheng, Xiang Yue, Shu-Jian Huang, Jia-Jun Chen, Alexandra Birch

Document-level machine translation manages to outperform sentence level models by a small margin, but have failed to be widely adopted.

Document Level Machine Translation Machine Translation +3

Paper
Code

Bridging Linguistic Typology and Multilingual Machine Translation with Multi-View Language Representations

1 code implementation • EMNLP 2020 • Arturo Oncevay, Barry Haddow, Alexandra Birch

Sparse language vectors from linguistic typology databases and learned embeddings from tasks like multilingual machine translation have been investigated in isolation, without analysing how they could benefit from each other's language characterisation.

Clustering Machine Translation +1

Paper
Code

Language Model Prior for Low-Resource Neural Machine Translation

1 code implementation • EMNLP 2020 • Christos Baziotis, Barry Haddow, Alexandra Birch

A common solution is to exploit the knowledge of language models (LM) trained on abundant monolingual data.

Knowledge Distillation Language Modelling +2

Paper
Code

Few-shot learning through contextual data augmentation

1 code implementation • EACL 2021 • Farid Arthaud, Rachel Bawden, Alexandra Birch

Machine translation (MT) models used in industries with constantly changing topics, such as translation or news agencies, need to adapt to new data to maintain their performance over time.

Data Augmentation Few-Shot Learning +3

Paper
Code

Exploring Unsupervised Pretraining Objectives for Machine Translation

1 code implementation • Findings (ACL) 2021 • Christos Baziotis, Ivan Titov, Alexandra Birch, Barry Haddow

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NMT), by drastically reducing the need for large parallel data.

Language Modelling Machine Translation +3

Paper
Code

Survey of Low-Resource Machine Translation

no code implementations • CL (ACL) 2022 • Barry Haddow, Rachel Bawden, Antonio Valerio Miceli Barone, Jindřich Helcl, Alexandra Birch

We present a survey covering the state of the art in low-resource machine translation research.

Machine Translation Translation

Paper
Add Code

CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification

1 code implementation • EMNLP 2021 • Matúš Falis, Hang Dong, Alexandra Birch, Beatrice Alex

We propose a set of metrics for hierarchical evaluation using the depth-based representation.

Multi Label Text Classification Multi-Label Text Classification +1

Paper
Code

Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

1 code implementation • EMNLP 2021 • Nikita Moghe, Mark Steedman, Alexandra Birch

In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks.

Cross-Lingual Transfer Dialogue State Tracking +2

Paper
Code

Distributionally Robust Recurrent Decoders with Random Network Distillation

no code implementations • RepL4NLP (ACL) 2022 • Antonio Valerio Miceli-Barone, Alexandra Birch, Rico Sennrich

Neural machine learning models can successfully model language that is similar to their training distribution, but they are highly susceptible to degradation under distribution shift, which occurs in many practical applications when processing out-of-domain (OOD) text.

Language Modelling Out of Distribution (OOD) Detection

Paper
Add Code

Non-Autoregressive Machine Translation: It's Not as Fast as it Seems

no code implementations • 4 May 2022 • Jindřich Helcl, Barry Haddow, Alexandra Birch

In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task.

Machine Translation Translation

Paper
Add Code

Quantifying Synthesis and Fusion and their Impact on Machine Translation

no code implementations • NAACL 2022 • Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva

In this work, we propose to reduce the rigidity of such claims, by quantifying morphological typology at the word and segment level.

Machine Translation Translation

Paper
Add Code

Exploring Diversity in Back Translation for Low-Resource Machine Translation

1 code implementation • DeepLo 2022 • Laurie Burchell, Alexandra Birch, Kenneth Heafield

We also find evidence that lexical diversity is more important than syntactic for back translation performance.

Machine Translation Translation

Paper
Code

MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue

no code implementations • 20 Dec 2022 • Nikita Moghe, Evgeniia Razumovskaia, Liane Guillou, Ivan Vulić, Anna Korhonen, Alexandra Birch

We use MULTI3NLU++ to benchmark state-of-the-art multilingual models for the NLU tasks of intent detection and slot labelling for TOD systems in the multilingual setting.

Intent Detection Machine Translation +2

Paper
Add Code

Extrinsic Evaluation of Machine Translation Metrics

no code implementations • 20 Dec 2022 • Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch

We calculate the correlation between the metric's ability to predict a good/bad translation with the success/failure on the final task for the Translate-Test setup.

Dialogue State Tracking Machine Translation +4

Paper
Add Code

Prompting Large Language Model for Machine Translation: A Case Study

no code implementations • 17 Jan 2023 • Biao Zhang, Barry Haddow, Alexandra Birch

Research on prompting has shown excellent performance with little or even no supervised training across many tasks.

Language Modelling Large Language Model +6

Paper
Add Code

Hallucinations in Large Multilingual Translation Models

1 code implementation • 28 Mar 2023 • Nuno M. Guerreiro, Duarte Alves, Jonas Waldendorf, Barry Haddow, Alexandra Birch, Pierre Colombo, André F. T. Martins

Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications.

Language Modelling Large Language Model +2

Paper
Code

When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale

no code implementations • 23 May 2023 • Christos Baziotis, Biao Zhang, Alexandra Birch, Barry Haddow

Next, we analyze the impact of scale (from 90M to 1. 6B parameters) and find it is important for both methods, particularly DAE.

Denoising Machine Translation +1

Paper
Add Code

An Open Dataset and Model for Language Identification

1 code implementation • 23 May 2023 • Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield

We achieve this by training on a curated dataset of monolingual data, the reliability of which we ensure by auditing a sample from each source and each language manually.

Language Identification

Paper
Code

Towards Effective Disambiguation for Machine Translation with Large Language Models

no code implementations • 20 Sep 2023 • Vivek Iyer, Pinzhen Chen, Alexandra Birch

Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation.

Benchmarking In-Context Learning +3

Paper
Add Code

Assessing the Reliability of Large Language Model Knowledge

1 code implementation • 15 Oct 2023 • Weixuan Wang, Barry Haddow, Alexandra Birch, Wei Peng

Large language models (LLMs) have been treated as knowledge bases due to their strong performance in knowledge probing tasks.

Hallucination Knowledge Probing +3

Paper
Code

Cache & Distil: Optimising API Calls to Large Language Models

no code implementations • 20 Oct 2023 • Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov

To curtail the frequency of these calls, one can employ a smaller language model -- a student -- which is continuously trained on the responses of the LLM.

Active Learning Language Modelling +1

Paper
Add Code

Code-Switching with Word Senses for Pretraining in Neural Machine Translation

no code implementations • 21 Oct 2023 • Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022).

Denoising Machine Translation +2

Paper
Add Code

Large Language Model Inference with Lexical Shortlisting

no code implementations • 16 Nov 2023 • Nikolay Bogoychev, Pinzhen Chen, Barry Haddow, Alexandra Birch

Large language model (LLM) inference is computation and memory intensive, so we adapt lexical shortlisting to it hoping to improve both.

Language Modelling Large Language Model +1

Paper
Add Code

Retrieval-augmented Multilingual Knowledge Editing

1 code implementation • 20 Dec 2023 • Weixuan Wang, Barry Haddow, Alexandra Birch

Knowledge represented in Large Language Models (LLMs) is quite often incorrect and can also become obsolete over time.

knowledge editing Retrieval

Paper
Code

Question Translation Training for Better Multilingual Reasoning

1 code implementation • 15 Jan 2024 • Wenhao Zhu, ShuJian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

A typical solution is to translate instruction data into all languages of interest, and then train on the resulting multilingual data, which is called translate-training.

Mathematical Reasoning Translation

Paper
Code

Can GPT-3.5 Generate and Code Discharge Summaries?

1 code implementation • 24 Jan 2024 • Matúš Falis, Aryo Pradipta Gema, Hang Dong, Luke Daines, Siddharth Basetti, Michael Holder, Rose S Penfold, Alexandra Birch, Beatrice Alex

Neural coding models were trained on baseline and augmented data and evaluated on a MIMIC-IV test set.

Data Augmentation

Paper
Code

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

1 code implementation • 29 Jan 2024 • Nikita Moghe, Arnisa Fazla, Chantal Amrhein, Tom Kocmi, Mark Steedman, Alexandra Birch, Rico Sennrich, Liane Guillou

We benchmark metric performance, assess their incremental performance over successive campaigns, and measure their sensitivity to a range of linguistic phenomena.

Benchmarking Machine Translation +3

Paper
Code

Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases

no code implementations • 1 Feb 2024 • Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow

While there has been a growing interest in developing direct speech translation systems to avoid propagating errors and losing non-verbal content, prior work in direct S2TT has struggled to conclusively establish the advantages of integrating the acoustic signal directly into the translation process.

speech-recognition Speech Recognition +2

Paper
Add Code

Code-Switched Language Identification is Harder Than You Think

1 code implementation • 2 Feb 2024 • Laurie Burchell, Alexandra Birch, Robert P. Thompson, Kenneth Heafield

Code switching (CS) is a very common phenomenon in written and spoken communication but one that is handled poorly by many natural language processing applications.

Language Identification Sentence

Paper
Code

Compact Speech Translation Models via Discrete Speech Units Pretraining

no code implementations • 29 Feb 2024 • Tsz Kin Lam, Alexandra Birch, Barry Haddow

In this paper, we leverage the SSL models by pretraining smaller models on their Discrete Speech Units (DSU).

Self-Supervised Learning Translation

Paper
Add Code

Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation

no code implementations • 27 Mar 2024 • Mateusz Klimaszewski, Piotr Andruszkiewicz, Alexandra Birch

Moreover, we propose a method that allows the transfer of modules between incompatible PLMs without any change in the inference complexity.

Domain Adaptation Knowledge Distillation +6

Paper
Add Code

No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement

no code implementations • 24 Apr 2024 • Mateusz Klimaszewski, Piotr Andruszkiewicz, Alexandra Birch

Modular deep learning is the state-of-the-art solution for lifting the curse of multilinguality, preventing the impact of negative interference and enabling cross-lingual performance in Multilingual Pre-trained Language Models.

Paper
Add Code

Improving Translation of Out Of Vocabulary Words using Bilingual Lexicon Induction in Low-Resource Machine Translation

no code implementations • AMTA 2022 • Jonas Waldendorf, Alexandra Birch, Barry Hadow, Antonio Valerio Micele Barone

Dictionary-based data augmentation techniques have been used in the field of domain adaptation to learn words that do not appear in the parallel training data of a machine translation model.

Bilingual Lexicon Induction Data Augmentation +4

Paper
Add Code

The Samsung and University of Edinburgh’s submission to IWSLT17

no code implementations • IWSLT 2017 • Pawel Przybysz, Marcin Chochowski, Rico Sennrich, Barry Haddow, Alexandra Birch

This paper describes the joint submission of Samsung Research and Development, Warsaw, Poland and the University of Edinburgh team to the IWSLT MT task for TED talks.

Domain Adaptation Translation

Paper
Add Code

Samsung and University of Edinburgh’s System for the IWSLT 2019

no code implementations • EMNLP (IWSLT) 2019 • Joanna Wetesko, Marcin Chochowski, Pawel Przybysz, Philip Williams, Roman Grundkiewicz, Rico Sennrich, Barry Haddow, None Barone, Valerio Miceli, Alexandra Birch

This paper describes the joint submission to the IWSLT 2019 English to Czech task by Samsung RD Institute, Poland, and the University of Edinburgh.

Paper
Add Code

The University of Edinburgh’s English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task

no code implementations • WMT (EMNLP) 2020 • Rachel Bawden, Alexandra Birch, Radina Dobreva, Arturo Oncevay, Antonio Valerio Miceli Barone, Philip Williams

We describe the University of Edinburgh’s submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut.

Language Modelling Machine Translation +1

Paper
Add Code

Surprise Language Challenge: Developing a Neural Machine Translation System between Pashto and English in Two Months

no code implementations • MTSummit 2021 • Alexandra Birch, Barry Haddow, Antonio Valerio Miceli Barone, Jindrich Helcl, Jonas Waldendorf, Felipe Sánchez Martínez, Mikel Forcada, Víctor Sánchez Cartagena, Juan Antonio Pérez-Ortiz, Miquel Esplà-Gomis, Wilker Aziz, Lina Murady, Sevi Sariisik, Peggy van der Kreeft, Kay Macquarrie

We find that starting from an existing large model pre-trained on 50languages leads to far better BLEU scores than pretraining on one high-resource language pair with a smaller model.

Machine Translation Transfer Learning +1

Paper
Add Code

Non-Autoregressive Machine Translation: It’s Not as Fast as it Seems

no code implementations • NAACL 2022 • Jindřich Helcl, Barry Haddow, Alexandra Birch

Machine Translation Translation

Paper
Add Code

GoURMET – Machine Translation for Low-Resourced Languages

no code implementations • EAMT 2022 • Peggy van der Kreeft, Alexandra Birch, Sevi Sariisik, Felipe Sánchez-Martínez, Wilker Aziz

The GoURMET project, funded by the European Commission’s H2020 program (under grant agreement 825299), develops models for machine translation, in particular for low-resourced languages.

Machine Translation Translation

Paper
Add Code

Horses to Zebras: Ontology-Guided Data Augmentation and Synthesis for ICD-9 Coding

no code implementations • BioNLP (ACL) 2022 • Matúš Falis, Hang Dong, Alexandra Birch, Beatrice Alex

We propose data augmentation and synthesis techniques in order to address these scenarios.

Data Augmentation

Paper
Add Code

The University of Edinburgh’s English-German and English-Hausa Submissions to the WMT21 News Translation Task

1 code implementation • WMT (EMNLP) 2021 • Pinzhen Chen, Jindřich Helcl, Ulrich Germann, Laurie Burchell, Nikolay Bogoychev, Antonio Valerio Miceli Barone, Jonas Waldendorf, Alexandra Birch, Kenneth Heafield

This paper presents the University of Edinburgh’s constrained submissions of English-German and English-Hausa systems to the WMT 2021 shared task on news translation.

Translation

1,170

Paper
Code

A Neural Verb Lexicon Model with Source-side Syntactic Context for String-to-Tree Machine Translation

no code implementations • IWSLT 2016 • Maria Nădejde, Alexandra Birch, Philipp Koehn

String-to-tree MT systems translate verbs without lexical or syntactic context on the source side and with limited target-side context.

Machine Translation Re-Ranking +2

Paper
Add Code

The University of Edinburgh’s systems submission to the MT task at IWSLT

no code implementations • IWSLT 2016 • Marcin Junczys-Dowmunt, Alexandra Birch

This paper describes the submission of the University of Edinburgh team to the IWSLT MT task for TED talks.

Domain Adaptation Translation

Paper
Add Code

Samsung and University of Edinburgh’s System for the IWSLT 2018 Low Resource MT Task

no code implementations • IWSLT (EMNLP) 2018 • Philip Williams, Marcin Chochowski, Pawel Przybysz, Rico Sennrich, Barry Haddow, Alexandra Birch

This paper describes the joint submission to the IWSLT 2018 Low Resource MT task by Samsung R&D Institute, Poland, and the University of Edinburgh.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.