no code implementations • 8 Mar 2024 • Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis
As Automatic Speech Recognition (ASR) models become ever more pervasive, it is important to ensure that they make reliable predictions under corruptions present in the physical and digital world.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • EMNLP (nlpbt) 2020 • Muhammad A. Shah, Shikib Mehri, Tejas Srinivasan
While neural models have been shown to exhibit strong performance on single-turn visual question answering (VQA) tasks, extending VQA to a multi-turn, conversational setting remains a challenge.
no code implementations • 28 May 2020 • Muhammad A. Shah, Raphael Olivier, Bhiksha Raj
Deploying deep learning models, comprising of non-linear combination of millions, even billions, of parameters is challenging given the memory, power and compute constraints of the real world.