no code implementations • WMT (EMNLP) 2020 • Rebecca Knowles, Darlene Stewart, Samuel Larkin, Patrick Littell
We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut-English shared task on news translation at the Fifth Conference on Machine Translation (WMT20).
no code implementations • WMT (EMNLP) 2020 • Chi-kiu Lo, Samuel Larkin
We present a study on using YiSi-2 with massive multilingual pretrained language models for machine translation (MT) reference-less evaluation.
no code implementations • WMT (EMNLP) 2021 • Rebecca Knowles, Samuel Larkin
We describe our neural machine translation systems for the 2021 shared task on Unsupervised and Very Low Resource Supervised MT, translating between Upper Sorbian and German (low-resource) and between Lower Sorbian and German (unsupervised).
no code implementations • MTSummit 2021 • Samuel Larkin, Michel Simard, Rebecca Knowles
We revisit the topic of translation direction in the data used for training neural machine translation systems and focusing on a real-world scenario with known translation direction and imbalances in translation direction: the Canadian Hansard.
no code implementations • NAACL (AmericasNLP) 2021 • Rebecca Knowles, Darlene Stewart, Samuel Larkin, Patrick Littell
We describe the NRC-CNRC systems submitted to the AmericasNLP shared task on machine translation.
no code implementations • WMT (EMNLP) 2020 • Rebecca Knowles, Samuel Larkin, Darlene Stewart, Patrick Littell
We describe the National Research Council of Canada (NRC) neural machine translation systems for the German-Upper Sorbian supervised track of the 2020 shared task on Unsupervised MT and Very Low Resource Supervised MT.
no code implementations • LREC 2020 • Eric Joanis, Rebecca Knowles, Rol Kuhn, , Samuel Larkin, Patrick Littell, Chi-kiu Lo, Darlene Stewart, Jeffrey Micher
This paper describes a newly released sentence-aligned Inuktitut{--}English corpus based on the proceedings of the Legislative Assembly of Nunavut, covering sessions from April 1999 to June 2017.
no code implementations • WS 2019 • Patrick Littell, Chi-kiu Lo, Samuel Larkin, Darlene Stewart
We describe the neural machine translation (NMT) system developed at the National Research Council of Canada (NRC) for the Kazakh-English news translation task of the Fourth Conference on Machine Translation (WMT19).
no code implementations • WS 2018 • Chi-kiu Lo, Michel Simard, Darlene Stewart, Samuel Larkin, Cyril Goutte, Patrick Littell
We present our semantic textual similarity approach in filtering a noisy web crawled parallel corpus using YiSi{---}a novel semantic machine translation evaluation metric.
no code implementations • WS 2018 • Patrick Littell, Samuel Larkin, Darlene Stewart, Michel Simard, Cyril Goutte, Chi-kiu Lo
The WMT18 shared task on parallel corpus filtering (Koehn et al., 2018b) challenged teams to score sentence pairs from a large high-recall, low-precision web-scraped parallel corpus (Koehn et al., 2018a).
no code implementations • WS 2017 • Boxing Chen, Colin Cherry, George Foster, Samuel Larkin
We compare cost weighting to two traditional domain adaptation techniques developed for statistical machine translation: data selection and sub-corpus weighting.