no code implementations • 14 Jan 2024 • Sergio Duarte-Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gomez-Alanis, Andreas Schwarz, Leif Rädel, Volker Leutnant
Context cues carry information which can improve multi-turn interactions in automatic speech recognition (ASR) systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 6 Sep 2023 • Matthias Wagner, Oliver Lang, Esmaeil Kavousi Ghafi, Andreas Schwarz, Mario Huemer
In our previous work, we proposed the homogeneity enforced calibration (HEC) approach, which circumvents this need by consecutively feeding a test signal and a scaled version of it into the ADC.
no code implementations • 23 May 2023 • Andreas Schwarz, Di He, Maarten Van Segbroeck, Mohammed Hethnawi, Ariya Rastrow
Streaming Automatic Speech Recognition (ASR) in voice assistants can utilize prefetching to partially hide the latency of response generation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 27 Oct 2022 • Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler
Also, we propose a dual-mode contextual-utterance training technique for streaming automatic speech recognition (ASR) systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 22 Jul 2022 • Patrick Ofner, Joana Pereira, Reinmar Kobler, Andreas Schwarz, Gernot R. Müller-Putz
Bibian et al. show in their recent paper (Bibi\'an et al. 2021) that eye and head movements can affect the EEG-based classification in a reaching motor task.
no code implementations • 15 Jun 2021 • Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin
Automatic speech recognition (ASR) in the cloud allows the use of larger models and more powerful multi-channel signal processing front-ends compared to on-device processing.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 20 Nov 2020 • Andreas Schwarz, Ilya Sklyar, Simon Wiesler
We present a training scheme for streaming automatic speech recognition (ASR) based on recurrent neural network transducers (RNN-T) which allows the encoder network to learn to exploit context audio from a stream, using segmented or partially labeled sequences of the stream during training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 12 Feb 2015 • Andreas Schwarz, Walter Kellermann
Several novel unbiased CDR estimators are proposed, and it is shown that knowledge of either the direction of arrival (DOA) of the target source or the coherence of the noise field is sufficient for unbiased CDR estimation.
Sound
no code implementations • 9 Oct 2014 • Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann
We propose a spatial diffuseness feature for deep neural network (DNN)-based automatic speech recognition to improve recognition accuracy in reverberant and noisy environments.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1