Search Results for author: Monica Sunkara

Found 10 papers, 0 papers with code

Masked Audio Text Encoders are Effective Multi-Modal Rescorers

no code implementations11 May 2023 Jinglun Cai, Monica Sunkara, Xilai Li, Anshu Bhatia, Xiao Pan, Sravan Bodapati

Masked Language Models (MLMs) have proven to be effective for second-pass rescoring in Automatic Speech Recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Adaptation Approaches for Nearest Neighbor Language Models

no code implementations15 Nov 2022 Rishabh Bhardwaj, George Polovets, Monica Sunkara

Semi-parametric Nearest Neighbor Language Models ($k$NN-LMs) have produced impressive gains over purely parametric LMs, by leveraging large-scale neighborhood retrieval over external memory datastores.

Retrieval

Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting

no code implementations18 Oct 2022 Saket Dingliwal, Monica Sunkara, Sravan Bodapati, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff

End-to-end speech recognition models trained using joint Connectionist Temporal Classification (CTC)-Attention loss have gained popularity recently.

speech-recognition Speech Recognition

Remember the context! ASR slot error correction through memorization

no code implementations10 Sep 2021 Dhanush Bekal, Ashish Shenoy, Monica Sunkara, Sravan Bodapati, Katrin Kirchhoff

Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Adapting Long Context NLM for ASR Rescoring in Conversational Agents

no code implementations21 Apr 2021 Ashish Shenoy, Sravan Bodapati, Monica Sunkara, Srikanth Ronanki, Katrin Kirchhoff

Neural Language Models (NLM), when trained and evaluated with context spanning multiple utterances, have been shown to consistently outperform both conventional n-gram language models and NLMs that use limited context.

intent-classification Intent Classification +2

Neural Inverse Text Normalization

no code implementations12 Feb 2021 Monica Sunkara, Chaitanya Shivade, Sravan Bodapati, Katrin Kirchhoff

We propose an efficient and robust neural solution for ITN leveraging transformer based seq2seq models and FST-based text normalization techniques for data preparation.

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

no code implementations3 Aug 2020 Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff

Experiments conducted on the Fisher corpus show that our proposed approach achieves ~6-9% and ~3-4% absolute improvement (F1 score) over the baseline BLSTM model on reference transcripts and ASR outputs respectively.

Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.