Search Results for author: Tuan-Nam Nguyen

Found 8 papers, 1 papers with code

Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck

no code implementations15 Oct 2024 Fevziye Irem Eyiokur, Christian Huber, Thai-Binh Nguyen, Tuan-Nam Nguyen, Fabian Retkowski, Enes Yavuz Ugan, Dogucan Yaman, Alexander Waibel

In this paper, we report on communication experiments conducted in the summer of 2022 during a deep dive to the wreck of the Titanic.

KIT's Multilingual Speech Translation System for IWSLT 2023

1 code implementation8 Jun 2023 Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks.

Data Augmentation Retrieval +1

Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

no code implementations9 Jun 2022 Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazim Kemal Ekenel

The system is designed to combine multiple component models and produces a video of the original speaker speaking in the target language that is lip-synchronous with the target speech, yet maintains emphases in speech, voice characteristics, face video of the original speaker.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +7

Efficient Weight factorization for Multilingual Speech Recognition

no code implementations7 May 2021 Ngoc-Quan Pham, Tuan-Nam Nguyen, Sebastian Stueker, Alexander Waibel

The key idea of the method is to assign fast weight matrices for each language by decomposing each weight matrix into a shared component and a language dependent component.

speech-recognition Speech Recognition

Relative Positional Encoding for Speech Recognition and Direct Translation

no code implementations20 May 2020 Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

We also show that this model is able to better utilize synthetic data than the Transformer, and adapts better to variable sentence segmentation quality for speech translation.

Position Sentence +4

Cannot find the paper you are looking for? You can Submit a new open access paper.