Search Results for author: Abhinav Thanda

Found 4 papers, 0 papers with code

Audio-Visual Decision Fusion for WFST-based and seq2seq Models

no code implementations • 29 Jan 2020 • Rohith Aralikatti, Sharad Roy, Abhinav Thanda, Dilip Kumar Margam, Pujitha Appan Kandala, Tanay Sharma, Shankar M Venkatesan

In this work, we propose novel methods to fuse information from audio and visual modalities at inference time.

speech-recognition Speech Recognition

Paper
Add Code

LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models

no code implementations • 25 Jun 2019 • Dilip Kumar Margam, Rohith Aralikatti, Tanay Sharma, Abhinav Thanda, Pujitha A K, Sharad Roy, Shankar M Venkatesan

We also verify the method on a second dataset of $81$ speakers which we collected.

Paper
Add Code

Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition

no code implementations • 10 Jan 2017 • Abhinav Thanda, Shankar M Venkatesan

Multi-task learning (MTL) involves the simultaneous training of two or more related tasks over shared representations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Audio Visual Speech Recognition using Deep Recurrent Neural Networks

no code implementations • 9 Nov 2016 • Abhinav Thanda, Shankar M Venkatesan

The frame labels obtained from the acoustic model are then used to perform a non-linear dimensionality reduction of the visual features using a deep bottleneck network.

Audio-Visual Speech Recognition Automatic Speech Recognition +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.