no code implementations • COLING (CODI, CRAC) 2022 • Mayank Soni, Brendan Spillane, Emer Gilmartin, Christian Saam, Benjamin R. Cowan, Vincent Wade
Transitioning between topics is a natural component of human-human dialog.
1 code implementation • 8 Jun 2020 • George Sterpu, Christian Saam, Naomi Harte
Sequence to Sequence models, in particular the Transformer, achieve state of the art results in Automatic Speech Recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 19 May 2020 • George Sterpu, Christian Saam, Naomi Harte
The audio-visual speech fusion strategy AV Align has shown significant performance improvements in audio-visual speech recognition (AVSR) on the challenging LRS2 dataset.
1 code implementation • 17 Apr 2020 • George Sterpu, Christian Saam, Naomi Harte
A recently proposed multimodal fusion strategy, AV Align, based on state-of-the-art sequence to sequence neural networks, attempts to model this relationship by explicitly aligning the acoustic and visual representations of speech.
3 code implementations • 5 Sep 2018 • George Sterpu, Christian Saam, Naomi Harte
Automatic speech recognition can potentially benefit from the lip motion patterns, complementing acoustic speech to improve the overall recognition performance, particularly in noise.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • WS 2018 • Emer Gilmartin, Christian Saam, Carl Vogel, Nick Campbell, Vincent Wade
Casual conversation has become a focus for artificial dialogue applications.
no code implementations • 29 May 2018 • George Sterpu, Christian Saam, Naomi Harte
Finding visual features and suitable models for lipreading tasks that are more complex than a well-constrained vocabulary has proven challenging.