3 code implementations • 8 Nov 2022 • Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký, Dietrich Klakow
In this paper, we introduce the ATCO2 corpus, a dataset that aims at fostering research on the challenging ATC field, which has lagged behind due to lack of annotated data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
no code implementations • 13 Apr 2022 • Alexander Blatt, Martin Kocour, Karel Veselý, Igor Szöke, Dietrich Klakow
The introduced data augmentation adds additional performance on high WER transcripts and allows the adaptation of the model to unseen airspaces.
no code implementations • 8 Apr 2021 • Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Karel Veselý, Martin Kocour, Igor Szöke
Results show that `unseen domains' (e. g. data from airports not present in the supervised training data) are further aided by contextual SSL when compared to standalone SSL.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 30 Jan 2020 • Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Hari Krishna Vydana, Karel Veselý, Jan "Honza'' Černocký
The paper describes the BUT Automatic Speech Recognition (ASR) systems submitted for OpenSAT evaluations under two domain categories such as low resourced languages and public safety communications.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • MediaEval 2015 Workshop 2015 • Miroslav Skácel, Igor Szöke
All our systems are based on Dynamic Time Warping (DTW).
Ranked #19 on Keyword Spotting on QUESST
no code implementations • 16 Oct 2014 • Igor Szöke, Miroslav Skácel, Lukáš Burget
The primary system we submitted was composed of 11 subsystems as the required run.
Ranked #1 on Keyword Spotting on QUESST (MinCnxe metric)