no code implementations • 7 Aug 2023 • Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel
Secondly, we compare different approaches to low-latency speech translation using this framework.
1 code implementation • 8 Jun 2023 • Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues
In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks.
1 code implementation • https://github.com/vietai/ASR 2021 • Thai Binh Nguyen
Our models are pre-trained on 13k hours of Vietnamese youtube audio (un-label data) and fine-tuned on 250 hours labeled of VLSP ASR dataset on 16kHz sampled speech audio.
Ranked #1 on Speech Recognition on VIVOS (using extra training data)
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 1 Oct 2020 • Thai Binh Nguyen, Quang Minh Nguyen, Thi Thu Hien Nguyen, Quoc Truong Do, Chi Mai Luong
The model takes input text from an ASR system and performs two tasks at the same time, producing proper text formatting that helps to improve NER performance.
no code implementations • 12 Oct 2019 • Quang Minh Nguyen, Thai Binh Nguyen, Ngoc Phuong Pham, The Loc Nguyen
Automatic Speech Recognition (ASR) systems have been evolving quickly and reaching human parity in certain cases.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 12 Oct 2019 • Thai Binh Nguyen, Quang Minh Nguyen, Thu Hien Nguyen, Ngoc Phuong Pham, The Loc Nguyen, Quoc Truong Do
Nowadays, Social network sites (SNSs) such as Facebook, Twitter are common places where people show their opinions, sentiments and share information with others.