no code implementations • 3 Apr 2021 • Lujun Li, Yikai Kang, Yuchen Shi, Ludwig Kürzinger, Tobias Watzel, Gerhard Rigoll
Inspired by the extensive applications of the generative adversarial networks (GANs) in speech enhancement and ASR tasks, we propose an adversarial joint training framework with the self-attention mechanism to boost the noise robustness of the ASR system.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 15 Oct 2020 • Ludwig Kürzinger, Nicolas Lindae, Palle Klewitz, Gerhard Rigoll
For this, we propose Lightweight Sinc-Convolutions (LSC) that integrate Sinc-convolutions with depthwise convolutions as a low-parameter machine-learnable feature extraction for end-to-end ASR systems.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 25 Jul 2020 • Iustina Andronic, Ludwig Kürzinger, Edgar Ricardo Chavez Rosas, Gerhard Rigoll, Bernhard U. Seeber
The present work proposes MP3 compression as a means to decrease the impact of Adversarial Noise (AN) in audio samples transcribed by ASR systems.
Audio and Speech Processing Cryptography and Security Sound
12 code implementations • 17 Jul 2020 • Ludwig Kürzinger, Dominik Winkelbauer, Lujun Li, Tobias Watzel, Gerhard Rigoll
In this work, we combine freely available corpora for German speech recognition, including yet unlabeled speech data, to a big dataset of over $1700$h of speech data.
Ranked #5 on
Speech Recognition
on TUDA
(using extra training data)
Speech Recognition
Audio and Speech Processing
no code implementations • 15 Jun 2020 • Tobias Watzel, Ludwig Kürzinger, Lujun Li, Gerhard Rigoll
Nowadays, attention models are one of the popular candidates for speech recognition.
1 code implementation • 5 Nov 2019 • Simon Mittermaier, Ludwig Kürzinger, Bernd Waschneck, Gerhard Rigoll
Keyword Spotting (KWS) enables speech-based user interaction on smart devices.