no code implementations • AMTA 2016 • Rebecca Knowles, Philipp Koehn
We present an interactive translation prediction method based on neural machine translation.
no code implementations • WMT (EMNLP) 2020 • Rebecca Knowles, Darlene Stewart, Samuel Larkin, Patrick Littell
We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut-English shared task on news translation at the Fifth Conference on Machine Translation (WMT20).
no code implementations • NAACL (AmericasNLP) 2021 • Rebecca Knowles, Darlene Stewart, Samuel Larkin, Patrick Littell
We describe the NRC-CNRC systems submitted to the AmericasNLP shared task on machine translation.
no code implementations • WMT (EMNLP) 2020 • Rebecca Knowles, Samuel Larkin, Darlene Stewart, Patrick Littell
We describe the National Research Council of Canada (NRC) neural machine translation systems for the German-Upper Sorbian supervised track of the 2020 shared task on Unsupervised MT and Very Low Resource Supervised MT.
no code implementations • MTSummit 2021 • Samuel Larkin, Michel Simard, Rebecca Knowles
We revisit the topic of translation direction in the data used for training neural machine translation systems and focusing on a real-world scenario with known translation direction and imbalances in translation direction: the Canadian Hansard.
no code implementations • LREC 2022 • Rebecca Knowles, Patrick Littell
Low-resource machine translation research often requires building baselines to benchmark estimates of progress in translation quality.
no code implementations • WMT (EMNLP) 2021 • Rebecca Knowles, Samuel Larkin
We describe our neural machine translation systems for the 2021 shared task on Unsupervised and Very Low Resource Supervised MT, translating between Upper Sorbian and German (low-resource) and between Lower Sorbian and German (unsupervised).
1 code implementation • WMT (EMNLP) 2021 • Rebecca Knowles
The current approach to collecting human judgments of machine translation quality for the news translation task at WMT – segment rating with document context – is the most recent in a sequence of changes to WMT human annotation protocol.
no code implementations • COLING 2020 • Roland Kuhn, Fineen Davis, Alain D{\'e}silets, Eric Joanis, Anna Kazantseva, Rebecca Knowles, Patrick Littell, Delaney Lothian, Aidan Pine, Caroline Running Wolf, Eddie Santos, Darlene Stewart, Gilles Boulianne, Vishwa Gupta, Brian Maracle Owennat{\'e}kha, Akwirat{\'e}kha{'} Martin, Christopher Cox, Marie-Odile Junker, Olivia Sammons, Delasie Torkornoo, Nathan Thanyeht{\'e}nhas Brinklow, Sara Child, Beno{\^\i}t Farley, David Huggins-Daines, Daisy Rosenblum, Heather Souter
This paper surveys the first, three-year phase of a project at the National Research Council of Canada that is developing software to assist Indigenous communities in Canada in preserving their languages and extending their use.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 11 May 2020 • Lane Schwartz, Francis Tyers, Lori Levin, Christo Kirov, Patrick Littell, Chi-kiu Lo, Emily Prud'hommeaux, Hyunji Hayley Park, Kenneth Steimel, Rebecca Knowles, Jeffrey Micher, Lonny Strunk, Han Liu, Coleman Haley, Katherine J. Zhang, Robbie Jimmerson, Vasilisa Andriyanets, Aldrian Obaja Muis, Naoki Otani, Jong Hyuk Park, Zhisong Zhang
In the literature, languages like Finnish or Turkish are held up as extreme examples of complexity that challenge common modelling assumptions.
no code implementations • LREC 2020 • Eric Joanis, Rebecca Knowles, Rol Kuhn, , Samuel Larkin, Patrick Littell, Chi-kiu Lo, Darlene Stewart, Jeffrey Micher
This paper describes a newly released sentence-aligned Inuktitut{--}English corpus based on the proceedings of the Legislative Assembly of Nunavut, covering sessions from April 1999 to June 2017.
no code implementations • LREC 2020 • Graham Neubig, Shruti Rijhwani, Alexis Palmer, Jordan MacKenzie, Hilaria Cruz, Xinjian Li, Matthew Lee, Aditi Chaudhary, Luke Gessler, Steven Abney, Shirley Anugrah Hayati, Antonios Anastasopoulos, Olga Zamaraeva, Emily Prud'hommeaux, Jennette Child, Sara Child, Rebecca Knowles, Sarah Moeller, Jeffrey Micher, Yiyuan Li, Sydney Zink, Mengzhou Xia, Roshan S Sharma, Patrick Littell
Despite recent advances in natural language processing and other language technology, the application of such technology to language documentation and conservation has been limited.
no code implementations • IJCNLP 2019 • Brian Thompson, Rebecca Knowles, Xuan Zhang, Huda Khayrallah, Kevin Duh, Philipp Koehn
Bilingual lexicons are valuable resources used by professional human translators.
no code implementations • EMNLP 2018 • Rebecca Knowles, Philipp Koehn
In this work, we show that they learn to copy words based on both the context in which the words appear as well as features of the words themselves.
no code implementations • WS 2018 • Sachith Sri Ram Kothur, Rebecca Knowles, Philipp Koehn
It is common practice to adapt machine translation systems to novel domains, but even a well-adapted system may be able to perform better on a particular document if it were to learn from a translator{'}s corrections within the document itself.
no code implementations • WS 2017 • Philipp Koehn, Rebecca Knowles
We explore six challenges for neural machine translation: domain mismatch, amount of training data, rare words, long sentences, word alignment, and beam search.
no code implementations • EACL 2017 • Christo Kirov, John Sylak-Glassman, Rebecca Knowles, Ryan Cotterell, Matt Post
A traditional claim in linguistics is that all human languages are equally expressive{---}able to convey the same wide range of meanings.