1 code implementation • WMT (EMNLP) 2021 • Pinzhen Chen, Jindřich Helcl, Ulrich Germann, Laurie Burchell, Nikolay Bogoychev, Antonio Valerio Miceli Barone, Jonas Waldendorf, Alexandra Birch, Kenneth Heafield
This paper presents the University of Edinburgh’s constrained submissions of English-German and English-Hausa systems to the WMT 2021 shared task on news translation.
1 code implementation • 2 Feb 2024 • Laurie Burchell, Alexandra Birch, Robert P. Thompson, Kenneth Heafield
Code switching (CS) is a very common phenomenon in written and spoken communication but one that is handled poorly by many natural language processing applications.
1 code implementation • 23 May 2023 • Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield
We achieve this by training on a curated dataset of monolingual data, the reliability of which we ensure by auditing a sample from each source and each language manually.
1 code implementation • 20 Oct 2022 • Faheem Kirefu, Vivek Iyer, Pinzhen Chen, Laurie Burchell
For subtask 1 we explored the effects of constrained decoding on English and transliterated subwords in order to produce Hinglish.
1 code implementation • DeepLo 2022 • Laurie Burchell, Alexandra Birch, Kenneth Heafield
We also find evidence that lexical diversity is more important than syntactic for back translation performance.
1 code implementation • COLING (LAW) 2020 • Laurie Burchell, Jie Chi, Tom Hosking, Nina Markl, Bonnie Webber
Multi-sentence questions (MSQs) are sequences of questions connected by relations which, unlike sequences of standalone questions, need to be answered as a unit.