no code implementations • 19 Jan 2024 • Christiaan Jacobs
This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data.
no code implementations • 5 Jul 2023 • Christiaan Jacobs, Herman Kamper
Acoustic word embeddings (AWEs) are fixed-dimensional vector representations of speech segments that encode phonetic content so that different realisations of the same word have similar embeddings.
no code implementations • 1 Jun 2023 • Christiaan Jacobs, Nathanaël Carraz Rakotonirina, Everlyn Asiko Chimoto, Bruce A. Bassett, Herman Kamper
But in an in-the-wild test on Swahili radio broadcasts with actual hate speech keywords, the AWE model (using one minute of template data) is more robust, giving similar performance to an ASR system trained on 30 hours of labelled data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
2 code implementations • 24 Jun 2021 • Christiaan Jacobs, Herman Kamper
Through finer-grained analysis, we show that training on even just a single related language gives the largest gain.
2 code implementations • 19 Mar 2021 • Christiaan Jacobs, Yevgen Matusevych, Herman Kamper
We consider how a recent contrastive learning loss can be used in both the purely unsupervised and multilingual transfer settings.