Search Results for author: Tsz Kin Lam

Found 13 papers, 4 papers with code

From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM

1 code implementation13 Mar 2025 Kshitij Ambilduke, Ben Peters, Sonal Sannigrahi, Anil Keshwani, Tsz Kin Lam, Bruno Martins, Marcely Zanon Boito, André F. T. Martins

Large language models (LLMs) have shown remarkable performance and generalization capabilities across multiple languages and tasks, making them very attractive targets for multi-modality integration (e. g., images or speech).

Translation

Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison

no code implementations4 Jan 2025 Tsz Kin Lam, Marco Gaido, Sara Papi, Luisa Bentivogli, Barry Haddow

Following the remarkable success of Large Language Models (LLMs) in NLP tasks, there is increasing interest in extending their capabilities to speech -- the most common form of communication.

Decoder Knowledge Distillation +1

Pitfalls and Outlooks in Using COMET

1 code implementation27 Aug 2024 Vilém Zouhar, Pinzhen Chen, Tsz Kin Lam, Nikita Moghe, Barry Haddow

The COMET metric has blazed a trail in the machine translation community, given its strong correlation with human judgements of translation quality.

Machine Translation Translation

Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases

no code implementations1 Feb 2024 Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow

While there has been a growing interest in developing direct speech translation systems to avoid propagating errors and losing non-verbal content, prior work in direct S2TT has struggled to conclusively establish the advantages of integrating the acoustic signal directly into the translation process.

speech-recognition Speech Recognition +3

On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR

1 code implementation3 Apr 2021 Tsz Kin Lam, Mayumi Ohta, Shigehiko Schamoni, Stefan Riezler

Our method, called Aligned Data Augmentation (ADA) for ASR, replaces transcribed tokens and the speech representations in an aligned manner to generate previously unseen training pairs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation

no code implementations WS 2019 Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

We propose an interactive-predictive neural machine translation framework for easier model personalization using reinforcement and imitation learning.

Form Imitation Learning +2

A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation

1 code implementation3 May 2018 Tsz Kin Lam, Julia Kreutzer, Stefan Riezler

We present an approach to interactive-predictive neural machine translation that attempts to reduce human effort from three directions: Firstly, instead of requiring humans to select, correct, or delete segments, we employ the idea of learning from human reinforcements in form of judgments on the quality of partial translations.

Machine Translation reinforcement-learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.