no code implementations • 28 Jul 2018 • Emre Yilmaz, Astik Biswas, Ewald van der Westhuizen, Febe De Wet, Thomas Niesler
We present our first efforts towards building a single multilingual automatic speech recognition (ASR) system that can process code-switching (CS) speech in five languages spoken within the same population.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 16 Apr 2019 • Ryan Eloff, André Nortje, Benjamin van Niekerk, Avashna Govender, Leanne Nortje, Arnu Pretorius, Elan van Biljon, Ewald van der Westhuizen, Lisa van Staden, Herman Kamper
For our submission to the ZeroSpeech 2019 challenge, we apply discrete latent-variable neural networks to unlabelled speech and use the discovered units for speech synthesis.
no code implementations • 20 Jun 2019 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
Furthermore, because English is common to all language pairs in our data, it dominates when training a unified language model, leading to improved English ASR performance at the expense of the other languages.
no code implementations • 6 Jul 2019 • Astik Biswas, Raghav Menon, Ewald van der Westhuizen, Thomas Niesler
The automatic transcriptions from the best performing pass were used for language model augmentation.
no code implementations • 14 Nov 2018 • Raghav Menon, Herman Kamper, Ewald van der Westhuizen, John Quinn, Thomas Niesler
We compare features for dynamic time warping (DTW) when used to bootstrap keyword spotting (KWS) in an almost zero-resource setting.
no code implementations • LREC 2020 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
This paper reports on the semi-supervised development of acoustic and language models for under-resourced, code-switched speech in five South African languages.
no code implementations • LREC 2020 • Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
We present an analysis of semi-supervised acoustic and language model training for English-isiZulu code-switched (CS) ASR using soap opera speech.
no code implementations • LREC 2020 • Nick Wilkinson, Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
Automatic segmentation was applied in combination with automaticspeaker diarization.
1 code implementation • 31 Oct 2020 • Trideba Padhi, Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
In this work, we explore the benefits of using multilingual bottleneck features (mBNF) in acoustic modelling for the automatic speech recognition of code-switched (CS) speech in African languages.
no code implementations • 13 Aug 2021 • Ewald van der Westhuizen, Herman Kamper, Raghav Menon, John Quinn, Thomas Niesler
We show that, using these features, the CNN-DTW keyword spotter performs almost as well as the DTW keyword spotter while outperforming a baseline CNN trained only on the keyword templates.
no code implementations • 13 Aug 2021 • Ewald van der Westhuizen, Trideba Padhi, Thomas Niesler
We find that, although maximising the training pool by including all six additional languages provides improved speech recognition in both target languages, substantially better performance can be achieved by a more judicious choice.