3 code implementations • 8 Nov 2022 • Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký, Dietrich Klakow
In this paper, we introduce the ATCO2 corpus, a dataset that aims at fostering research on the challenging ATC field, which has lagged behind due to lack of annotated data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
1 code implementation • 31 Oct 2021 • Martin Kocour, Kateřina Žmolíková, Lucas Ondel, Ján Švec, Marc Delcroix, Tsubasa Ochiai, Lukáš Burget, Jan Černocký
We modify the acoustic model to predict joint state posteriors for all speakers, enabling the network to express uncertainty about the attribution of parts of the speech signal to the speakers.
1 code implementation • 15 Aug 2022 • Ján Švec, Kateřina Žmolíková, Martin Kocour, Marc Delcroix, Tsubasa Ochiai, Ladislav Mošner, Jan Černocký
One of the factors causing such degradation may be intrinsic speaker variability, such as emotions, occurring commonly in realistic speech.
no code implementations • 29 Jan 2021 • Martin Kocour, Guillermo Cámbara, Jordi Luque, David Bonet, Mireia Farrús, Martin Karafiát, Karel Veselý, Jan ''Honza'' Ĉernocký
This paper describes joint effort of BUT and Telef\'onica Research on development of Automatic Speech Recognition systems for Albayzin 2020 Challenge.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 6 Apr 2021 • Igor Szoke, Santosh Kesiraju, Ondrej Novotny, Martin Kocour, Karel Vesely, Jan "Honza" Cernocky
The proposed English Language Detection (ELD) system is based on the embeddings from Bayesian subspace multinomial model.
no code implementations • 8 Apr 2021 • Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Karel Veselý, Martin Kocour, Igor Szöke
Results show that `unseen domains' (e. g. data from airports not present in the supervised training data) are further aided by contextual SSL when compared to standalone SSL.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 22 Oct 2021 • Lucas Ondel, Léa-Marie Lam-Yee-Mui, Martin Kocour, Caio Filippo Corro, Lukáš Burget
We propose to express the forward-backward algorithm in terms of operations between sparse matrices in a specific semiring.
no code implementations • 13 Apr 2022 • Alexander Blatt, Martin Kocour, Karel Veselý, Igor Szöke, Dietrich Klakow
The introduced data augmentation adds additional performance on high WER transcripts and allows the adaptation of the model to unseen airspaces.
no code implementations • LEGAL (LREC) 2022 • Mickaël Rigault, Claudia Cevenini, Khalid Choukri, Martin Kocour, Karel Veselý, Igor Szoke, Petr Motlicek, Juan Pablo Zuluaga-Gomez, Alexander Blatt, Dietrich Klakow, Allan Tart, Pavel Kolčárek, Jan Černocký
In this paper the authors detail the various legal and ethical issues faced during the ATCO2 project.
no code implementations • 21 May 2023 • Karel Beneš, Martin Kocour, Lukáš Burget
Furthermore, we show that utilizing Hystoc in fusion of multiple e2e ASR systems increases the gains from the fusion by up to 1\,\% WER absolute on Spanish RTVE2020 dataset.