Search Results for author: Felix Stahlberg

Found 34 papers, 7 papers with code

Data Strategies for Low-Resource Grammatical Error Correction

no code implementations EACL (BEA) 2021 Simon Flachs, Felix Stahlberg, Shankar Kumar

We investigate how best to take advantage of existing data sources for improving GEC systems for languages with limited quantities of high quality training data.

Grammatical Error Correction

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

no code implementations20 Oct 2023 Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke wu

One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations.

Hallucination Translation

Improved Long-Form Spoken Language Translation with Large Language Models

no code implementations19 Dec 2022 Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Axel H. Ng

A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations.

Language Modelling Large Language Model +1

Text Generation with Text-Editing Models

no code implementations NAACL (ACL) 2022 Eric Malmi, Yue Dong, Jonathan Mallinson, Aleksandr Chuklin, Jakub Adamek, Daniil Mirylenka, Felix Stahlberg, Sebastian Krause, Shankar Kumar, Aliaksei Severyn

Text-editing models have recently become a prominent alternative to seq2seq models for monolingual text-generation tasks such as grammatical error correction, simplification, and style transfer.

Grammatical Error Correction Hallucination +2

Transformer-based Models of Text Normalization for Speech Applications

no code implementations1 Feb 2022 Jae Hun Ro, Felix Stahlberg, Ke wu, Shankar Kumar

Text normalization, or the process of transforming text into a consistent, canonical form, is crucial for speech applications such as text-to-speech synthesis (TTS).

Sentence Speech Synthesis +1

Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models

1 code implementation EACL (BEA) 2021 Felix Stahlberg, Shankar Kumar

Synthetic data generation is widely known to boost the accuracy of neural grammatical error correction (GEC) systems, but existing methods often lack diversity or are too simplistic to generate the broad range of grammatical errors made by human writers.

Grammatical Error Correction Sentence +2

Seq2Edits: Sequence Transduction Using Span-level Edit Operations

1 code implementation EMNLP 2020 Felix Stahlberg, Shankar Kumar

For text normalization, sentence fusion, and grammatical error correction, our approach improves explainability by associating each edit operation with a human-readable tag.

Grammatical Error Correction Sentence +3

Using Context in Neural Machine Translation Training Objectives

no code implementations ACL 2020 Danielle Saunders, Felix Stahlberg, Bill Byrne

We find that each of these lines of research has a clear space in it for the other, and propose merging them with a scheme that allows a document-level evaluation metric to be used in the NMT training objective.

Grammatical Error Correction Machine Translation +3

Neural Machine Translation: A Review and Survey

2 code implementations4 Dec 2019 Felix Stahlberg

The field of machine translation (MT), the automatic translation of written text from one natural language into another, has experienced a major paradigm shift in recent years.

Machine Translation NMT +3

CUED@WMT19:EWC\&LMs

no code implementations WS 2019 Felix Stahlberg, Danielle Saunders, Adri{\`a} de Gispert, Bill Byrne

Two techniques provide the fabric of the Cambridge University Engineering Department{'}s (CUED) entry to the WMT19 evaluation campaign: elastic weight consolidation (EWC) and different forms of language modelling (LMs).

Language Modelling Sentence

The CUED's Grammatical Error Correction Systems for BEA-2019

no code implementations WS 2019 Felix Stahlberg, Bill Byrne

We describe two entries from the Cambridge University Engineering Department to the BEA 2019 Shared Task on grammatical error correction.

Grammatical Error Correction Machine Translation +1

Cued@wmt19:ewc&lms

no code implementations11 Jun 2019 Felix Stahlberg, Danielle Saunders, Adria de Gispert, Bill Byrne

Two techniques provide the fabric of the Cambridge University Engineering Department's (CUED) entry to the WMT19 evaluation campaign: elastic weight consolidation (EWC) and different forms of language modelling (LMs).

Language Modelling Sentence

Domain Adaptive Inference for Neural Machine Translation

no code implementations ACL 2019 Danielle Saunders, Felix Stahlberg, Adria de Gispert, Bill Byrne

We investigate adaptive ensemble weighting for Neural Machine Translation, addressing the case of improving performance on a new and potentially unknown domain without sacrificing performance on the original domain.

Machine Translation NMT +1

Neural Models of Text Normalization for Speech Applications

no code implementations CL 2019 Hao Zhang, Richard Sproat, Axel H. Ng, Felix Stahlberg, Xiaochang Peng, Kyle Gorman, Brian Roark

One problem that has been somewhat resistant to effective machine learning solutions is text normalization for speech applications such as text-to-speech synthesis (TTS).

BIG-bench Machine Learning Speech Synthesis +1

Simple Fusion: Return of the Language Model

1 code implementation WS 2018 Felix Stahlberg, James Cross, Veselin Stoyanov

Neural Machine Translation (NMT) typically leverages monolingual data in training through backtranslation.

Language Modelling Machine Translation +3

An Operation Sequence Model for Explainable Neural Machine Translation

1 code implementation WS 2018 Felix Stahlberg, Danielle Saunders, Bill Byrne

We propose to achieve explainable neural machine translation (NMT) by changing the output representation to explain itself.

Machine Translation NMT +3

The University of Cambridge's Machine Translation Systems for WMT18

no code implementations WS 2018 Felix Stahlberg, Adria de Gispert, Bill Byrne

The University of Cambridge submission to the WMT18 news translation task focuses on the combination of diverse models of translation.

Machine Translation Translation

Why not be Versatile? Applications of the SGNMT Decoder for Machine Translation

no code implementations WS 2018 Felix Stahlberg, Danielle Saunders, Gonzalo Iglesias, Bill Byrne

SGNMT is a decoding platform for machine translation which allows paring various modern neural models of translation with different kinds of constraints and symbolic models.

Machine Translation Translation

A Comparison of Neural Models for Word Ordering

1 code implementation WS 2017 Eva Hasler, Felix Stahlberg, Marcus Tomalin, Adri`a de Gispert, Bill Byrne

We compare several language models for the word-ordering task and propose a new bag-to-sequence neural model based on attention-based sequence-to-sequence models.

Unfolding and Shrinking Neural Machine Translation Ensembles

no code implementations EMNLP 2017 Felix Stahlberg, Bill Byrne

Ensembling is a well-known technique in neural machine translation (NMT) to improve system performance.

Machine Translation NMT +1

Neural Machine Translation by Minimising the Bayes-risk with Respect to Syntactic Translation Lattices

no code implementations EACL 2017 Felix Stahlberg, Adrià De Gispert, Eva Hasler, Bill Byrne

This makes our approach much more flexible than $n$-best list or lattice rescoring as the neural decoder is not restricted to the SMT search space.

Machine Translation NMT +1

Cannot find the paper you are looking for? You can Submit a new open access paper.