no code implementations • 22 Mar 2023 • Chris Chinenye Emezue, Sanchit Gandhi, Lewis Tunstall, Abubakar Abid, Josh Meyer, Quentin Lhoest, Pete Allen, Patrick von Platen, Douwe Kiela, Yacine Jernite, Julien Chaumond, Merve Noyan, Omar Sanseviero
The advancement of speech technologies has been remarkable, yet its integration with African languages remains limited due to the scarcity of African speech corpora.
1 code implementation • 7 Jul 2022 • Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack Julian Weber, Salomon Kabongo, Elizabeth Salesky, Iroro Orife, Colin Leong, Perez Ogayo, Chris Emezue, Jonathan Mukiibi, Salomey Osei, Apelete Agbolo, Victor Akinode, Bernard Opoku, Samuel Olanrewaju, Jesujoba Alabi, Shamsuddeen Muhammad
BibleTTS is a large, high-quality, open speech dataset for ten languages spoken in Sub-Saharan Africa.
no code implementations • LREC 2022 • Jonathan Mukiibi, Andrew Katumba, Joyce Nakatumba-Nabende, Ali Hussein, Josh Meyer
Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 10 May 2021 • Francis M. Tyers, Josh Meyer
This technical report describes the methods and results of a three-week sprint to produce deployable speech recognition models for 31 under-served languages of the Common Voice project.
2 code implementations • 3 Apr 2021 • Mark Mazumder, Colby Banbury, Josh Meyer, Pete Warden, Vijay Janapa Reddi
With just five training examples, we fine-tune the embedding model for keyword spotting and achieve an average F1 score of 0. 75 on keyword classification for 180 new keywords unseen by the embedding model in these nine languages.
no code implementations • LREC 2020 • Josh Meyer, Lindy Rauchenstein, Joshua D. Eisenberg, Nicholas Howell
We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated {\textless}audio, transcript{\textgreater} pairs with demographic tags for age, gender, accent.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
2 code implementations • LREC 2020 • Rosana Ardila, Megan Branson, Kelly Davis, Michael Henretty, Michael Kohler, Josh Meyer, Reuben Morais, Lindsay Saunders, Francis M. Tyers, Gregor Weber
To our knowledge this is the largest audio corpus in the public domain for speech recognition, both in terms of number of hours and number of languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3