no code implementations • 15 Sep 2023 • Peter Vieting, Simon Berger, Thilo von Neumann, Christoph Boeddeker, Ralf Schlüter, Reinhold Haeb-Umbach
This mixture encoder leverages the original overlapped speech to mitigate the effect of artifacts introduced by the speech separation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 21 Jun 2023 • Simon Berger, Peter Vieting, Christoph Boeddeker, Ralf Schlüter, Reinhold Haeb-Umbach
Modular approaches separate speakers and recognize each of them with a single-speaker ASR system.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 28 May 2023 • Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney
Modern public ASR tools usually provide rich support for training various sequence-to-sequence (S2S) models, but rather simple support for decoding open-vocabulary scenarios only.
no code implementations • 30 Oct 2020 • Wei Zhou, Simon Berger, Ralf Schlüter, Hermann Ney
To join the advantages of classical and end-to-end approaches for speech recognition, we present a simple, novel and competitive approach for phoneme-based neural transducer modeling.