Search Results for author: Ruchir Travadi

Found 5 papers, 2 papers with code

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

no code implementations • 16 Oct 2023 • Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, MingBin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang

Recent advances in deep learning and automatic speech recognition have improved the accuracy of end-to-end speech recognition systems, but recognition of personal content such as contact names remains a challenge.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

no code implementations • 2 Nov 2022 • Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

This work studies the use of attention masking in transformer transducer based speech recognition for building a single configurable model for different deployment scenarios.

speech-recognition Speech Recognition

Paper
Add Code

Online Automatic Speech Recognition with Listen, Attend and Spell Model

no code implementations • 12 Aug 2020 • Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal

The Listen, Attend and Spell (LAS) model and other attention-based automatic speech recognition (ASR) models have known limitations when operated in a fully online mode.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features

1 code implementation • 16 Jul 2019 • Kunal Dhawan, Colin Vaz, Ruchir Travadi, Shrikanth Narayanan

We propose an algorithm to extract noise-robust acoustic features from noisy speech.

Paper
Code

Multimodal Representation Learning using Deep Multiset Canonical Correlation

1 code implementation • 3 Apr 2019 • Krishna Somandepalli, Naveen Kumar, Ruchir Travadi, Shrikanth Narayanan

We propose Deep Multiset Canonical Correlation Analysis (dMCCA) as an extension to representation learning using CCA when the underlying signal is observed across multiple (more than two) modalities.

Representation Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.