3D Convolutional Neural Networks for Cross Audio-Visual Matching Recognition

18 Jun 2017Amirsina TorfiSeyed Mehdi IranmaneshNasser M. NasrabadiJeremy Dawson

Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information... (read more)

PDF Abstract

Evaluation results from the paper


  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.