no code implementations • IWSLT (ACL) 2022 • Ngoc-Quan Pham, Tuan Nam Nguyen, Thai-Binh Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Alexander Waibel
Pretrained models in acoustic and textual modalities can potentially improve speech translation for both Cascade and End-to-end approaches.
no code implementations • 27 Nov 2024 • Thai-Binh Nguyen, Alexander Waibel
Speaker-attributed automatic speech recognition (SA-ASR) aims to transcribe speech while assigning transcripts to the corresponding speakers accurately.
no code implementations • 15 Oct 2024 • Fevziye Irem Eyiokur, Christian Huber, Thai-Binh Nguyen, Tuan-Nam Nguyen, Fabian Retkowski, Enes Yavuz Ugan, Dogucan Yaman, Alexander Waibel
In this paper, we report on communication experiments conducted in the summer of 2022 during a deep dive to the wreck of the Titanic.
no code implementations • 24 Jun 2024 • Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues
Firstly, we refine the ASR outputs by utilizing the N-best lists generated by our system and fine-tuning the LLM to predict the transcript accurately.
no code implementations • 22 Aug 2023 • Thai-Binh Nguyen, Alexander Waibel
The model utilizes a single-channel speech enhancement module that isolates the speaker's voice from background noise (ConVoiFilter) and an ASR module.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Jun 2023 • Thi-Hai-Yen Vuong, Hai-Long Nguyen, Tan-Minh Nguyen, Hoang-Trung Nguyen, Thai-Binh Nguyen, Ha-Thanh Nguyen
This paper presents the NOWJ team's approach to the COLIEE 2023 Competition, which focuses on advancing legal information processing techniques and applying them to real-world legal scenarios.
no code implementations • 11 Apr 2023 • Tan-Minh Nguyen, Thai-Binh Nguyen, Hoang-Trung Nguyen, Hai-Long Nguyen, Tam Doan Thanh, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong
Multi-document summarization is challenging because the summaries should not only describe the most important information from all documents but also provide a coherent interpretation of the documents.