Search Results for author: Thai-Son Nguyen

Found 18 papers, 1 papers with code

Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models

no code implementations2 Feb 2019 Thai-Son Nguyen, Sebastian Stueker, Alex Waibel

Acoustic-to-word (A2W) models that allow direct mapping from acoustic signals to word sequences are an appealing approach to end-to-end automatic speech recognition due to their simplicity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Learning Shared Encoding Representation for End-to-End Speech Recognition Models

no code implementations31 Mar 2019 Thai-Son Nguyen, Sebastian Stueker, Alex Waibel

In this work, we learn a shared encoding representation for a multi-task neural network model optimized with connectionist temporal classification (CTC) and conventional framewise cross-entropy training criteria.

Deep Attention General Classification +2

Very Deep Self-Attention Networks for End-to-End Speech Recognition

no code implementations30 Apr 2019 Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Sebastian Stüker, Alexander Waibel

Recently, end-to-end sequence-to-sequence models for speech recognition have gained significant interest in the research community.

speech-recognition Speech Recognition

Toward Cross-Domain Speech Recognition with End-to-End Models

no code implementations9 Mar 2020 Thai-Son Nguyen, Sebastian Stüker, Alex Waibel

We show that for the hybrid models, supplying additional training data from other domains with mismatched acoustic conditions does not increase the performance on specific domains.

speech-recognition Speech Recognition

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition

no code implementations22 Mar 2020 Thai-Son Nguyen, Ngoc-Quan Pham, Sebastian Stueker, Alex Waibel

However, when it comes to performing run-on recognition on an input stream of audio data while producing recognition results in real-time and with low word-based latency, these models face several challenges.

speech-recognition Speech Recognition +1

Relative Positional Encoding for Speech Recognition and Direct Translation

no code implementations20 May 2020 Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

We also show that this model is able to better utilize synthetic data than the Transformer, and adapts better to variable sentence segmentation quality for speech translation.

Position Sentence +4

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

1 code implementation7 Oct 2020 Thai-Son Nguyen, Sebastian Stueker, Alex Waibel

Achieving super-human performance in recognizing human speech has been a goal for several decades, as researchers have worked on increasingly challenging tasks.

The 2016 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations IWSLT 2016 Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel

For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.

The 2017 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations IWSLT 2017 Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Sebastian Stüker, Alex Waibel

For the English lecture task, our best combination system has a WER of 8. 3% on the tst2015 development set while our other combinations gained 25. 7% WER for German lecture tasks.

Multi-stage Large Language Model Correction for Speech Recognition

no code implementations17 Oct 2023 Jie Pu, Thai-Son Nguyen, Sebastian Stüker

In this paper, we investigate the usage of large language models (LLMs) to improve the performance of competitive speech recognition systems.

Language Modelling Large Language Model +2

Cannot find the paper you are looking for? You can Submit a new open access paper.