2 code implementations • SIGUL (LREC) 2022 • Patrick Littell, Eric Joanis, Aidan Pine, Marc Tessier, David Huggins Daines, Delasie Torkornoo
While the alignment of audio recordings and text (often termed “forced alignment”) is often treated as a solved problem, in practice the process of adapting an alignment system to a new, under-resourced language comes with significant challenges, requiring experience and expertise that many outside of the speech community lack.
no code implementations • ComputEL (ACL) 2022 • Aidan Pine, Patrick William Littell, Eric Joanis, David Huggins-Daines, Christopher Cox, Fineen Davis, Eddie Antonio Santos, Shankhalika Srikanth, Delasie Torkornoo, Sabrina Yu
This paper describes the motivation and implementation details for a rule-based, index-preserving grapheme-to-phoneme engine ‘G_i2P_i' implemented in pure Python and released under the open source MIT license.
1 code implementation • ACL 2022 • Aidan Pine, Dan Wells, Nathan Brinklow, Patrick Littell, Korin Richmond
This paper describes the motivation and development of speech synthesis systems for the purposes of language revitalization.
1 code implementation • 13 Jun 2024 • Cheng Gong, Erica Cooper, Xin Wang, Chunyu Qiang, Mengzhe Geng, Dan Wells, Longbiao Wang, Jianwu Dang, Marc Tessier, Aidan Pine, Korin Richmond, Junichi Yamagishi
Self-supervised learning (SSL) representations from massively multilingual models offer a promising solution for low-resource language speech tasks.
no code implementations • COLING 2020 • Roland Kuhn, Fineen Davis, Alain D{\'e}silets, Eric Joanis, Anna Kazantseva, Rebecca Knowles, Patrick Littell, Delaney Lothian, Aidan Pine, Caroline Running Wolf, Eddie Santos, Darlene Stewart, Gilles Boulianne, Vishwa Gupta, Brian Maracle Owennat{\'e}kha, Akwirat{\'e}kha{'} Martin, Christopher Cox, Marie-Odile Junker, Olivia Sammons, Delasie Torkornoo, Nathan Thanyeht{\'e}nhas Brinklow, Sara Child, Beno{\^\i}t Farley, David Huggins-Daines, Daisy Rosenblum, Heather Souter
This paper surveys the first, three-year phase of a project at the National Research Council of Canada that is developing software to assist Indigenous communities in Canada in preserving their languages and extending their use.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • COLING 2018 • Patrick Littell, Anna Kazantseva, Rol Kuhn, , Aidan Pine, Antti Arppe, Christopher Cox, Marie-Odile Junker
In this article, we discuss which text, speech, and image technologies have been developed, and would be feasible to develop, for the approximately 60 Indigenous languages spoken in Canada.
Optical Character Recognition
Optical Character Recognition (OCR)
+7
no code implementations • COLING 2018 • Anna Kazantseva, Owennatekha Brian Maracle, Ronkwe{'}tiy{\'o}hstha Josiah Maracle, Aidan Pine
In this paper we describe preliminary work on Kawenn{\'o}n:nis, a verb conjugator for Kanyen{'}k{\'e}ha (Ohsweken dialect).