Search Results for author: Vishal Sunder

Found 9 papers, 1 papers with code

End-to-End real time tracking of children's reading with pointer network

no code implementations • 17 Oct 2023 • Vishal Sunder, Beulah Karrolla, Eric Fosler-Lussier

To train this pointer network, we generate ground truth training signals by using forced alignment between the read speech and the text being read on the training set.

Paper
Add Code

Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding

no code implementations • 11 Apr 2022 • Vishal Sunder, Samuel Thomas, Hong-Kwang J. Kuo, Jatin Ganhotra, Brian Kingsbury, Eric Fosler-Lussier

In the absence of gold transcripts to fine-tune an ASR model, our model outperforms this baseline by a significant margin of 10% absolute F1 score.

Action Recognition Spoken Language Understanding

Paper
Add Code

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems

no code implementations • 11 Apr 2022 • Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury

Recent advances in End-to-End (E2E) Spoken Language Understanding (SLU) have been primarily due to effective pretraining of speech representations.

Intent Recognition Spoken Language Understanding

Paper
Add Code

Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data

no code implementations • 11 Apr 2022 • Vishal Sunder, Prashant Serai, Eric Fosler-Lussier

As it is difficult to collect spoken data from users without a functioning SLU system, our method does not rely on spoken data for training, rather we use an ASR error predictor to "speechify" the text data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Hallucination of speech recognition errors with sequence to sequence learning

no code implementations • 23 Mar 2021 • Prashant Serai, Vishal Sunder, Eric Fosler-Lussier

Automatic Speech Recognition (ASR) is an imperfect process that results in certain mismatches in ASR output text when compared to plain written text or transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Handling Class Imbalance in Low-Resource Dialogue Systems by Combining Few-Shot Classification and Interpolation

1 code implementation • 28 Oct 2020 • Vishal Sunder, Eric Fosler-Lussier

Utterance classification performance in low-resource dialogue systems is constrained by an inevitably high degree of data imbalance in class labels.

Classification Dialogue Act Classification +1

Paper
Code

One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis

no code implementations • 6 Jun 2019 • Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam Shroff, Rohit Rahul

Our interest in this paper is in meeting a rapidly growing industrial demand for information extraction from images of documents such as invoices, bills, receipts etc.

Program Synthesis

Paper
Add Code

Prosocial or Selfish? Agents with different behaviors for Contract Negotiation using Reinforcement Learning

no code implementations • 19 Sep 2018 • Vishal Sunder, Lovekesh Vig, Arnab Chatterjee, Gautam Shroff

We further train a meta agent with a mixture of behaviors by learning an ensemble of different models using reinforcement learning.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

CIKM AnalytiCup 2017 Lazada Product Title Quality Challenge An Ensemble of Deep and Shallow Learning to predict the Quality of Product Titles

no code implementations • 1 Apr 2018 • Karamjit Singh, Vishal Sunder

We present an approach where two different models (Deep and Shallow) are trained separately on the data and a weighted average of the outputs is taken as the final result.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.