Search Results for author: Ngoc-Quan Pham

Found 28 papers, 2 papers with code

Effective combination of pretrained models - KIT@IWSLT2022

no code implementations IWSLT (ACL) 2022 Ngoc-Quan Pham, Tuan Nam Nguyen, Thai-Binh Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Alexander Waibel

Pretrained models in acoustic and textual modalities can potentially improve speech translation for both Cascade and End-to-end approaches.

Translation

Multilingual Speech Translation KIT @ IWSLT2021

no code implementations ACL (IWSLT) 2021 Ngoc-Quan Pham, Tuan Nam Nguyen, Thanh-Le Ha, Sebastian Stüker, Alexander Waibel, Dan He

This paper contains the description for the submission of Karlsruhe Institute of Technology (KIT) for the multilingual TEDx translation task in the IWSLT 2021 evaluation campaign.

Translation

KIT’s Multilingual Neural Machine Translation systems for IWSLT 2017

no code implementations IWSLT 2017 Ngoc-Quan Pham, Matthias Sperber, Elizabeth Salesky, Thanh-Le Ha, Jan Niehues, Alexander Waibel

For the SLT track, in addition to a monolingual neural translation system used to generate correct punctuations and true cases of the data prior to training our multilingual system, we introduced a noise model in order to make our system more robust.

Machine Translation NMT +1

Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages

no code implementations5 Aug 2024 Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel

We explore how this zero-shot translation capability develops with varying number of languages seen by the encoder.

Cross-Lingual Word Embeddings Sentence +3

Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024

no code implementations24 Jun 2024 Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues

Firstly, we refine the ASR outputs by utilizing the N-best lists generated by our system and fine-tuning the LLM to predict the transcript accurately.

Action Detection Activity Detection +5

KIT's Multilingual Speech Translation System for IWSLT 2023

1 code implementation8 Jun 2023 Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks.

Data Augmentation Retrieval +1

Towards continually learning new languages

no code implementations21 Nov 2022 Ngoc-Quan Pham, Jan Niehues, Alexander Waibel

Multilingual speech recognition with neural networks is often implemented with batch-learning, when all of the languages are available before training.

speech-recognition Speech Recognition +1

Adaptive multilingual speech recognition with pretrained models

no code implementations24 May 2022 Ngoc-Quan Pham, Alex Waibel, Jan Niehues

Multilingual speech recognition with supervised learning has achieved great results as reflected in recent research.

speech-recognition Speech Recognition

Efficient Weight factorization for Multilingual Speech Recognition

no code implementations7 May 2021 Ngoc-Quan Pham, Tuan-Nam Nguyen, Sebastian Stueker, Alexander Waibel

The key idea of the method is to assign fast weight matrices for each language by decomposing each weight matrix into a shared component and a language dependent component.

speech-recognition Speech Recognition

Relative Positional Encoding for Speech Recognition and Direct Translation

no code implementations20 May 2020 Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

We also show that this model is able to better utilize synthetic data than the Transformer, and adapts better to variable sentence segmentation quality for speech translation.

Position Sentence +4

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition

no code implementations22 Mar 2020 Thai-Son Nguyen, Ngoc-Quan Pham, Sebastian Stueker, Alex Waibel

However, when it comes to performing run-on recognition on an input stream of audio data while producing recognition results in real-time and with low word-based latency, these models face several challenges.

speech-recognition Speech Recognition +1

Self-Attentional Models for Lattice Inputs

no code implementations ACL 2019 Matthias Sperber, Graham Neubig, Ngoc-Quan Pham, Alex Waibel

Lattices are an efficient and effective method to encode ambiguity of upstream systems in natural language processing tasks, for example to compactly capture multiple speech recognition hypotheses, or to represent multiple linguistic analyses.

Computational Efficiency speech-recognition +2

Very Deep Self-Attention Networks for End-to-End Speech Recognition

no code implementations30 Apr 2019 Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Sebastian Stüker, Alexander Waibel

Recently, end-to-end sequence-to-sequence models for speech recognition have gained significant interest in the research community.

speech-recognition Speech Recognition

Low-Latency Neural Speech Translation

no code implementations1 Aug 2018 Jan Niehues, Ngoc-Quan Pham, Thanh-Le Ha, Matthias Sperber, Alex Waibel

After adaptation, we are able to reduce the number of corrections displayed during incremental output construction by 45%, without a decrease in translation quality.

Machine Translation Multi-Task Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.