no code implementations • 14 Jul 2020 • Balázs Tarján, György Szaszák, Tibor Fegyó, Péter Mihajlik
Recently Deep Transformer models have proven to be particularly powerful in language modeling tasks for ASR.
no code implementations • 9 Jun 2020 • Balázs Tarján, György Szaszák, Tibor Fegyó, Péter Mihajlik
In our recent work we have significantly improved the online performance of a conversational speech transcription system by transferring knowledge from a Recurrent Neural Network Language Model (RNNLM) to the single pass BNLM with text generation based data augmentation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 14 Nov 2019 • Dávid Sztahó, György Szaszák, András Beke
This paper summarizes the applied deep learning practices in the field of speaker recognition, both verification and identification.
no code implementations • 15 Jul 2019 • Balázs Tarján, György Szaszák, Tibor Fegyó, Péter Mihajlik
Recognition of Hungarian conversational telephone speech is challenging due to the informal style and morphological richness of the language.