Search Results for author: George Sterpu

Found 7 papers, 5 papers with code

Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys

no code implementations18 Nov 2023 Gabriel Cosache, Francisco Salgado, Cosmin Rotariu, George Sterpu, Rishabh Jain, Peter Corcoran

An overview is given of the DAVID Smart-Toy platform, one of the first Edge AI platform designs to incorporate advanced low-power data processing by neural inference models co-located with the relevant image or audio sensors.

AV Taris: Online Audio-Visual Speech Recognition

1 code implementation14 Dec 2020 George Sterpu, Naomi Harte

In recent years, Automatic Speech Recognition (ASR) technology has approached human-level performance on conversational speech under relatively clean listening conditions.

Action Detection Activity Detection +5

Learning to Count Words in Fluent Speech enables Online Speech Recognition

1 code implementation8 Jun 2020 George Sterpu, Christian Saam, Naomi Harte

Sequence to Sequence models, in particular the Transformer, achieve state of the art results in Automatic Speech Recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition

1 code implementation19 May 2020 George Sterpu, Christian Saam, Naomi Harte

The audio-visual speech fusion strategy AV Align has shown significant performance improvements in audio-visual speech recognition (AVSR) on the challenging LRS2 dataset.

Audio-Visual Speech Recognition speech-recognition +1

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

1 code implementation17 Apr 2020 George Sterpu, Christian Saam, Naomi Harte

A recently proposed multimodal fusion strategy, AV Align, based on state-of-the-art sequence to sequence neural networks, attempts to model this relationship by explicitly aligning the acoustic and visual representations of speech.

Audio-Visual Speech Recognition speech-recognition +1

Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition

3 code implementations5 Sep 2018 George Sterpu, Christian Saam, Naomi Harte

Automatic speech recognition can potentially benefit from the lip motion patterns, complementing acoustic speech to improve the overall recognition performance, particularly in noise.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Can DNNs Learn to Lipread Full Sentences?

no code implementations29 May 2018 George Sterpu, Christian Saam, Naomi Harte

Finding visual features and suitable models for lipreading tasks that are more complex than a well-constrained vocabulary has proven challenging.

General Classification Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.