Search Results for author: Alexandra Birch

Found 60 papers, 26 papers with code

Neural Machine Translation of Rare Words with Subword Units

25 code implementations ACL 2016 Rico Sennrich, Barry Haddow, Alexandra Birch

Neural machine translation (NMT) models typically operate with a fixed vocabulary, but translation is an open-vocabulary problem.

NMT Segmentation +1

Edinburgh Neural Machine Translation Systems for WMT 16

1 code implementation WS 2016 Rico Sennrich, Barry Haddow, Alexandra Birch

We participated in the WMT 2016 shared news translation task by building neural translation systems for four language pairs, each trained in both directions: English<->Czech, English<->German, English<->Romanian and English<->Russian.

Machine Translation Translation

HUME: Human UCCA-Based Evaluation of Machine Translation

1 code implementation EMNLP 2016 Alexandra Birch, Omri Abend, Ondrej Bojar, Barry Haddow

Human evaluation of machine translation normally uses sentence-level measures such as relative ranking or adequacy scales.

Machine Translation Sentence +1

Predicting Target Language CCG Supertags Improves Neural Machine Translation

no code implementations WS 2017 Maria Nadejde, Siva Reddy, Rico Sennrich, Tomasz Dwojak, Marcin Junczys-Dowmunt, Philipp Koehn, Alexandra Birch

Our results on WMT data show that explicitly modeling target-syntax improves machine translation quality for German->English, a high-resource pair, and for Romanian->English, a low-resource pair and also several syntactic phenomena including prepositional phrase attachment.

Machine Translation NMT +2

Evaluating Discourse Phenomena in Neural Machine Translation

no code implementations NAACL 2018 Rachel Bawden, Rico Sennrich, Alexandra Birch, Barry Haddow

Despite gains using BLEU, multi-encoder models give limited improvement in the handling of discourse phenomena: 50% accuracy on our coreference test set and 53. 5% for coherence/cohesion (compared to a non-contextual baseline of 50%).

Machine Translation NMT +2

Marian: Fast Neural Machine Translation in C++

2 code implementations ACL 2018 Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Dwojak, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, André F. T. Martins, Alexandra Birch

We present Marian, an efficient and self-contained Neural Machine Translation framework with an integrated automatic differentiation engine based on dynamic computation graphs.

Machine Translation Translation

Findings of the Second Workshop on Neural Machine Translation and Generation

no code implementations WS 2018 Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018).

Data Augmentation Domain Adaptation +2

On the Importance of Word Boundaries in Character-level Neural Machine Translation

1 code implementation WS 2019 Duygu Ataman, Orhan Firat, Mattia A. Di Gangi, Marcello Federico, Alexandra Birch

Neural Machine Translation (NMT) models generally perform translation using a fixed-size lexical vocabulary, which is an important bottleneck on their generalization capability and overall translation quality.

Machine Translation NMT +1

Findings of the Third Workshop on Neural Generation and Translation

no code implementations WS 2019 Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019).

Machine Translation NMT +1

A Latent Morphology Model for Open-Vocabulary Neural Machine Translation

1 code implementation ICLR 2020 Duygu Ataman, Wilker Aziz, Alexandra Birch

Translation into morphologically-rich languages challenges neural machine translation (NMT) models with extremely sparse vocabularies where atomic treatment of surface forms is unrealistic.

Machine Translation Morphological Inflection +2

Towards Making the Most of Context in Neural Machine Translation

1 code implementation19 Feb 2020 Zaixiang Zheng, Xiang Yue, Shu-Jian Huang, Jia-Jun Chen, Alexandra Birch

Document-level machine translation manages to outperform sentence level models by a small margin, but have failed to be widely adopted.

Document Level Machine Translation Machine Translation +3

Bridging Linguistic Typology and Multilingual Machine Translation with Multi-View Language Representations

1 code implementation EMNLP 2020 Arturo Oncevay, Barry Haddow, Alexandra Birch

Sparse language vectors from linguistic typology databases and learned embeddings from tasks like multilingual machine translation have been investigated in isolation, without analysing how they could benefit from each other's language characterisation.

Clustering Machine Translation +1

Few-shot learning through contextual data augmentation

1 code implementation EACL 2021 Farid Arthaud, Rachel Bawden, Alexandra Birch

Machine translation (MT) models used in industries with constantly changing topics, such as translation or news agencies, need to adapt to new data to maintain their performance over time.

Data Augmentation Few-Shot Learning +3

Exploring Unsupervised Pretraining Objectives for Machine Translation

1 code implementation Findings (ACL) 2021 Christos Baziotis, Ivan Titov, Alexandra Birch, Barry Haddow

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NMT), by drastically reducing the need for large parallel data.

Language Modelling Machine Translation +3

Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

1 code implementation EMNLP 2021 Nikita Moghe, Mark Steedman, Alexandra Birch

In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks.

Cross-Lingual Transfer Dialogue State Tracking +2

Distributionally Robust Recurrent Decoders with Random Network Distillation

no code implementations RepL4NLP (ACL) 2022 Antonio Valerio Miceli-Barone, Alexandra Birch, Rico Sennrich

Neural machine learning models can successfully model language that is similar to their training distribution, but they are highly susceptible to degradation under distribution shift, which occurs in many practical applications when processing out-of-domain (OOD) text.

Language Modelling Out of Distribution (OOD) Detection

Non-Autoregressive Machine Translation: It's Not as Fast as it Seems

no code implementations4 May 2022 Jindřich Helcl, Barry Haddow, Alexandra Birch

In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task.

Machine Translation Translation

MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue

no code implementations20 Dec 2022 Nikita Moghe, Evgeniia Razumovskaia, Liane Guillou, Ivan Vulić, Anna Korhonen, Alexandra Birch

We use MULTI3NLU++ to benchmark state-of-the-art multilingual models for the NLU tasks of intent detection and slot labelling for TOD systems in the multilingual setting.

Intent Detection Machine Translation +2

Extrinsic Evaluation of Machine Translation Metrics

no code implementations20 Dec 2022 Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch

We calculate the correlation between the metric's ability to predict a good/bad translation with the success/failure on the final task for the Translate-Test setup.

Dialogue State Tracking Machine Translation +4

Prompting Large Language Model for Machine Translation: A Case Study

no code implementations17 Jan 2023 Biao Zhang, Barry Haddow, Alexandra Birch

Research on prompting has shown excellent performance with little or even no supervised training across many tasks.

Language Modelling Large Language Model +6

Hallucinations in Large Multilingual Translation Models

1 code implementation28 Mar 2023 Nuno M. Guerreiro, Duarte Alves, Jonas Waldendorf, Barry Haddow, Alexandra Birch, Pierre Colombo, André F. T. Martins

Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications.

Language Modelling Large Language Model +2

When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale

no code implementations23 May 2023 Christos Baziotis, Biao Zhang, Alexandra Birch, Barry Haddow

Next, we analyze the impact of scale (from 90M to 1. 6B parameters) and find it is important for both methods, particularly DAE.

Denoising Machine Translation +1

An Open Dataset and Model for Language Identification

1 code implementation23 May 2023 Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield

We achieve this by training on a curated dataset of monolingual data, the reliability of which we ensure by auditing a sample from each source and each language manually.

Language Identification

Towards Effective Disambiguation for Machine Translation with Large Language Models

no code implementations20 Sep 2023 Vivek Iyer, Pinzhen Chen, Alexandra Birch

Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation.

Benchmarking In-Context Learning +3

Assessing the Reliability of Large Language Model Knowledge

1 code implementation15 Oct 2023 Weixuan Wang, Barry Haddow, Alexandra Birch, Wei Peng

Large language models (LLMs) have been treated as knowledge bases due to their strong performance in knowledge probing tasks.

Hallucination Knowledge Probing +3

Cache & Distil: Optimising API Calls to Large Language Models

no code implementations20 Oct 2023 Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov

To curtail the frequency of these calls, one can employ a smaller language model -- a student -- which is continuously trained on the responses of the LLM.

Active Learning Language Modelling +1

Code-Switching with Word Senses for Pretraining in Neural Machine Translation

no code implementations21 Oct 2023 Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022).

Denoising Machine Translation +2

Large Language Model Inference with Lexical Shortlisting

no code implementations16 Nov 2023 Nikolay Bogoychev, Pinzhen Chen, Barry Haddow, Alexandra Birch

Large language model (LLM) inference is computation and memory intensive, so we adapt lexical shortlisting to it hoping to improve both.

Language Modelling Large Language Model +1

Retrieval-augmented Multilingual Knowledge Editing

1 code implementation20 Dec 2023 Weixuan Wang, Barry Haddow, Alexandra Birch

Knowledge represented in Large Language Models (LLMs) is quite often incorrect and can also become obsolete over time.

knowledge editing Retrieval

Question Translation Training for Better Multilingual Reasoning

1 code implementation15 Jan 2024 Wenhao Zhu, ShuJian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

A typical solution is to translate instruction data into all languages of interest, and then train on the resulting multilingual data, which is called translate-training.

Mathematical Reasoning Translation

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

1 code implementation29 Jan 2024 Nikita Moghe, Arnisa Fazla, Chantal Amrhein, Tom Kocmi, Mark Steedman, Alexandra Birch, Rico Sennrich, Liane Guillou

We benchmark metric performance, assess their incremental performance over successive campaigns, and measure their sensitivity to a range of linguistic phenomena.

Benchmarking Machine Translation +3

Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases

no code implementations1 Feb 2024 Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow

While there has been a growing interest in developing direct speech translation systems to avoid propagating errors and losing non-verbal content, prior work in direct S2TT has struggled to conclusively establish the advantages of integrating the acoustic signal directly into the translation process.

speech-recognition Speech Recognition +2

Code-Switched Language Identification is Harder Than You Think

1 code implementation2 Feb 2024 Laurie Burchell, Alexandra Birch, Robert P. Thompson, Kenneth Heafield

Code switching (CS) is a very common phenomenon in written and spoken communication but one that is handled poorly by many natural language processing applications.

Language Identification Sentence

Compact Speech Translation Models via Discrete Speech Units Pretraining

no code implementations29 Feb 2024 Tsz Kin Lam, Alexandra Birch, Barry Haddow

In this paper, we leverage the SSL models by pretraining smaller models on their Discrete Speech Units (DSU).

Self-Supervised Learning Translation

Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation

no code implementations27 Mar 2024 Mateusz Klimaszewski, Piotr Andruszkiewicz, Alexandra Birch

Moreover, we propose a method that allows the transfer of modules between incompatible PLMs without any change in the inference complexity.

Domain Adaptation Knowledge Distillation +6

No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement

no code implementations24 Apr 2024 Mateusz Klimaszewski, Piotr Andruszkiewicz, Alexandra Birch

Modular deep learning is the state-of-the-art solution for lifting the curse of multilinguality, preventing the impact of negative interference and enabling cross-lingual performance in Multilingual Pre-trained Language Models.

Improving Translation of Out Of Vocabulary Words using Bilingual Lexicon Induction in Low-Resource Machine Translation

no code implementations AMTA 2022 Jonas Waldendorf, Alexandra Birch, Barry Hadow, Antonio Valerio Micele Barone

Dictionary-based data augmentation techniques have been used in the field of domain adaptation to learn words that do not appear in the parallel training data of a machine translation model.

Bilingual Lexicon Induction Data Augmentation +4

The Samsung and University of Edinburgh’s submission to IWSLT17

no code implementations IWSLT 2017 Pawel Przybysz, Marcin Chochowski, Rico Sennrich, Barry Haddow, Alexandra Birch

This paper describes the joint submission of Samsung Research and Development, Warsaw, Poland and the University of Edinburgh team to the IWSLT MT task for TED talks.

Domain Adaptation Translation

The University of Edinburgh’s English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task

no code implementations WMT (EMNLP) 2020 Rachel Bawden, Alexandra Birch, Radina Dobreva, Arturo Oncevay, Antonio Valerio Miceli Barone, Philip Williams

We describe the University of Edinburgh’s submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut.

Language Modelling Machine Translation +1

Non-Autoregressive Machine Translation: It’s Not as Fast as it Seems

no code implementations NAACL 2022 Jindřich Helcl, Barry Haddow, Alexandra Birch

In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task.

Machine Translation Translation

GoURMET – Machine Translation for Low-Resourced Languages

no code implementations EAMT 2022 Peggy van der Kreeft, Alexandra Birch, Sevi Sariisik, Felipe Sánchez-Martínez, Wilker Aziz

The GoURMET project, funded by the European Commission’s H2020 program (under grant agreement 825299), develops models for machine translation, in particular for low-resourced languages.

Machine Translation Translation

Samsung and University of Edinburgh’s System for the IWSLT 2018 Low Resource MT Task

no code implementations IWSLT (EMNLP) 2018 Philip Williams, Marcin Chochowski, Pawel Przybysz, Rico Sennrich, Barry Haddow, Alexandra Birch

This paper describes the joint submission to the IWSLT 2018 Low Resource MT task by Samsung R&D Institute, Poland, and the University of Edinburgh.

Cannot find the paper you are looking for? You can Submit a new open access paper.