Search Results for author: Christian Saam

Found 9 papers, 4 papers with code

An Empirical Study of Topic Transition in Dialogue

no code implementations • COLING (CODI, CRAC) 2022 • Mayank Soni, Brendan Spillane, Emer Gilmartin, Christian Saam, Benjamin R. Cowan, Vincent Wade

Transitioning between topics is a natural component of human-human dialog.

Open-Domain Dialog

Paper
Add Code

Learning to Count Words in Fluent Speech enables Online Speech Recognition

1 code implementation • 8 Jun 2020 • George Sterpu, Christian Saam, Naomi Harte

Sequence to Sequence models, in particular the Transformer, achieve state of the art results in Automatic Speech Recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition

1 code implementation • 19 May 2020 • George Sterpu, Christian Saam, Naomi Harte

The audio-visual speech fusion strategy AV Align has shown significant performance improvements in audio-visual speech recognition (AVSR) on the challenging LRS2 dataset.

Audio-Visual Speech Recognition speech-recognition +1

Paper
Code

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

1 code implementation • 17 Apr 2020 • George Sterpu, Christian Saam, Naomi Harte

A recently proposed multimodal fusion strategy, AV Align, based on state-of-the-art sequence to sequence neural networks, attempts to model this relationship by explicitly aligning the acoustic and visual representations of speech.

Audio-Visual Speech Recognition speech-recognition +1

Paper
Code

Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition

3 code implementations • 5 Sep 2018 • George Sterpu, Christian Saam, Naomi Harte

Automatic speech recognition can potentially benefit from the lip motion patterns, complementing acoustic speech to improve the overall recognition performance, particularly in noise.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1