Search Results for author: Josh Meyer

Found 7 papers, 3 papers with code

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

no code implementations • 22 Mar 2023 • Chris Chinenye Emezue, Sanchit Gandhi, Lewis Tunstall, Abubakar Abid, Josh Meyer, Quentin Lhoest, Pete Allen, Patrick von Platen, Douwe Kiela, Yacine Jernite, Julien Chaumond, Merve Noyan, Omar Sanseviero

The advancement of speech technologies has been remarkable, yet its integration with African languages remains limited due to the scarcity of African speech corpora.

Paper
Add Code

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

1 code implementation • 7 Jul 2022 • Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack Julian Weber, Salomon Kabongo, Elizabeth Salesky, Iroro Orife, Colin Leong, Perez Ogayo, Chris Emezue, Jonathan Mukiibi, Salomey Osei, Apelete Agbolo, Victor Akinode, Bernard Opoku, Samuel Olanrewaju, Jesujoba Alabi, Shamsuddeen Muhammad

BibleTTS is a large, high-quality, open speech dataset for ten languages spoken in Sub-Saharan Africa.

Vocal Bursts Intensity Prediction

Paper
Code

The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

no code implementations • LREC 2022 • Jonathan Mukiibi, Andrew Katumba, Joyce Nakatumba-Nabende, Ali Hussein, Josh Meyer

Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice

no code implementations • 10 May 2021 • Francis M. Tyers, Josh Meyer

This technical report describes the methods and results of a three-week sprint to produce deployable speech recognition models for 31 under-served languages of the Common Voice project.

speech-recognition Speech Recognition

Paper
Add Code

Few-Shot Keyword Spotting in Any Language

2 code implementations • 3 Apr 2021 • Mark Mazumder, Colby Banbury, Josh Meyer, Pete Warden, Vijay Janapa Reddi

With just five training examples, we fine-tune the embedding model for keyword spotting and achieve an average F1 score of 0. 75 on keyword classification for 180 new keywords unseen by the embedding model in these nine languages.

Keyword Spotting Transfer Learning

149

Paper
Code

Artie Bias Corpus: An Open Dataset for Detecting Demographic Bias in Speech Applications

no code implementations • LREC 2020 • Josh Meyer, Lindy Rauchenstein, Joshua D. Eisenberg, Nicholas Howell

We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated {\textless}audio, transcript{\textgreater} pairs with demographic tags for age, gender, accent.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Common Voice: A Massively-Multilingual Speech Corpus

2 code implementations • LREC 2020 • Rosana Ardila, Megan Branson, Kelly Davis, Michael Henretty, Michael Kohler, Josh Meyer, Reuben Morais, Lindsay Saunders, Francis M. Tyers, Gregor Weber

To our knowledge this is the largest audio corpus in the public domain for speech recognition, both in terms of number of hours and number of languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

321

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.