no code implementations • COLING 2022 • Éric Le Ferrand, Steven Bird, Laurent Besacier
An increasing number of papers have been addressing issues related to low-resource languages and the transcription bottleneck paradigm.
no code implementations • 11 Jun 2021 • Éric Le Ferrand, Steven Bird, Laurent Besacier
We investigate the efficiency of two very different spoken term detection approaches for transcription when the available data is insufficient to train a robust ASR system.
no code implementations • COLING 2020 • Éric Le Ferrand, Steven Bird, Laurent Besacier
We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment.
1 code implementation • LREC 2020 • Marcely Zanon Boito, William N. Havard, Mahault Garnerin, Éric Le Ferrand, Laurent Besacier
However, the fact that the source content (the Bible) is the same for all the languages is not exploited to date. Therefore, this article proposes to add multilingual links between speech segments in different languages, and shares a large and clean dataset of 8, 130 parallel spoken utterances across 8 languages (56 language pairs).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4