Search Results for author: Sri Garimella

Found 5 papers, 0 papers with code

Unified Modeling of Multi-Domain Multi-Device ASR Systems

no code implementations • 13 May 2022 • Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Arunasish Sen, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella

Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Improving RNN-T ASR Performance with Date-Time and Location Awareness

no code implementations • 11 Jun 2021 • Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella

In this paper, we explore the benefits of incorporating context into a Recurrent Neural Network (RNN-T) based Automatic Speech Recognition (ASR) model to improve the speech recognition for virtual assistants.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Knowledge Distillation and Data Selection for Semi-Supervised Learning in CTC Acoustic Models

no code implementations • 10 Aug 2020 • Prakhar Swarup, Debmalya Chakrabarty, Ashtosh Sapru, Hitesh Tulsiani, Harish Arsikere, Sri Garimella

Semi-supervised learning (SSL) is an active area of research which aims to utilize unlabelled data in order to improve the accuracy of speech recognition systems.

Knowledge Distillation speech-recognition +1

Paper
Add Code

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

no code implementations • 8 Jul 2020 • Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann

Experiments show that for English-Spanish, the bilingual joint ASR-LID architecture matches monolingual ASR and acoustic-only LID accuracies.

Language Identification

Paper
Add Code

Language Model Bootstrapping Using Neural Machine Translation For Conversational Speech Recognition

no code implementations • 2 Dec 2019 • Surabhi Punjabi, Harish Arsikere, Sri Garimella

Machine translation (MT) offers a systematic way of incorporating collections from mature, resource-rich conversational systems that may be available for a different language.

Data Augmentation Domain Adaptation +8

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.