no code implementations • 17 Oct 2023 • Vishal Sunder, Beulah Karrolla, Eric Fosler-Lussier
To train this pointer network, we generate ground truth training signals by using forced alignment between the read speech and the text being read on the training set.
no code implementations • 11 Apr 2022 • Vishal Sunder, Samuel Thomas, Hong-Kwang J. Kuo, Jatin Ganhotra, Brian Kingsbury, Eric Fosler-Lussier
In the absence of gold transcripts to fine-tune an ASR model, our model outperforms this baseline by a significant margin of 10% absolute F1 score.
no code implementations • 11 Apr 2022 • Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury
Recent advances in End-to-End (E2E) Spoken Language Understanding (SLU) have been primarily due to effective pretraining of speech representations.
no code implementations • 11 Apr 2022 • Vishal Sunder, Prashant Serai, Eric Fosler-Lussier
As it is difficult to collect spoken data from users without a functioning SLU system, our method does not rely on spoken data for training, rather we use an ASR error predictor to "speechify" the text data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 23 Mar 2021 • Prashant Serai, Vishal Sunder, Eric Fosler-Lussier
Automatic Speech Recognition (ASR) is an imperfect process that results in certain mismatches in ASR output text when compared to plain written text or transcriptions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
1 code implementation • 28 Oct 2020 • Vishal Sunder, Eric Fosler-Lussier
Utterance classification performance in low-resource dialogue systems is constrained by an inevitably high degree of data imbalance in class labels.
no code implementations • 6 Jun 2019 • Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam Shroff, Rohit Rahul
Our interest in this paper is in meeting a rapidly growing industrial demand for information extraction from images of documents such as invoices, bills, receipts etc.
no code implementations • 19 Sep 2018 • Vishal Sunder, Lovekesh Vig, Arnab Chatterjee, Gautam Shroff
We further train a meta agent with a mixture of behaviors by learning an ensemble of different models using reinforcement learning.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 1 Apr 2018 • Karamjit Singh, Vishal Sunder
We present an approach where two different models (Deep and Shallow) are trained separately on the data and a weighted average of the outputs is taken as the final result.