no code implementations • 15 Sep 2023 • Peter Vieting, Simon Berger, Thilo von Neumann, Christoph Boeddeker, Ralf Schlüter, Reinhold Haeb-Umbach
This mixture encoder leverages the original overlapped speech to mitigate the effect of artifacts introduced by the speech separation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Aug 2023 • Peter Vieting, Ralf Schlüter, Hermann Ney
In this work, we study its capability to replace the standard feature extraction methods in a connectionist temporal classification (CTC) ASR model and compare it to an alternative neural FE.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 21 Jun 2023 • Simon Berger, Peter Vieting, Christoph Boeddeker, Ralf Schlüter, Reinhold Haeb-Umbach
Modular approaches separate speakers and recognize each of them with a single-speaker ASR system.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 26 Oct 2022 • Peter Vieting, Christoph Lüscher, Julian Dierkes, Ralf Schlüter, Hermann Ney
Unsupervised representation learning has recently helped automatic speech recognition (ASR) to tackle tasks with limited labeled data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 24 Oct 2022 • Christoph Lüscher, Mohammad Zeineldeen, Zijian Yang, Tina Raissi, Peter Vieting, Khai Le-Duc, Weiyue Wang, Ralf Schlüter, Hermann Ney
Language barriers present a great challenge in our increasingly connected and global world.
no code implementations • 9 Apr 2021 • Peter Vieting, Christoph Lüscher, Wilfried Michel, Ralf Schlüter, Hermann Ney
With the success of neural network based modeling in automatic speech recognition (ASR), many studies investigated acoustic modeling and learning of feature extractors directly based on the raw waveform.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2