Search Results for author: Christiaan Jacobs

Found 5 papers, 2 papers with code

Multilingual acoustic word embeddings for zero-resource languages

no code implementations • 19 Jan 2024 • Christiaan Jacobs

This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data.

Hate Speech Detection Keyword Spotting +1

Paper
Add Code

Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings

no code implementations • 5 Jul 2023 • Christiaan Jacobs, Herman Kamper

Acoustic word embeddings (AWEs) are fixed-dimensional vector representations of speech segments that encode phonetic content so that different realisations of the same word have similar embeddings.

Word Embeddings Word Similarity

Paper
Add Code

Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

no code implementations • 1 Jun 2023 • Christiaan Jacobs, Nathanaël Carraz Rakotonirina, Everlyn Asiko Chimoto, Bruce A. Bassett, Herman Kamper

But in an in-the-wild test on Swahili radio broadcasts with actual hate speech keywords, the AWE model (using one minute of template data) is more robust, giving similar performance to an ASR system trained on 30 hours of labelled data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4