1 code implementation • 29 Aug 2024 • Ashish Mittal, Darshan Prabhu, Sunita Sarawagi, Preethi Jyothi
A challenge of our proposed coupling is handling the mismatch between the tokenizers of the LLM and ASR systems.
no code implementations • 21 Nov 2023 • Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei zhang, George Saon, Brian Kingsbury
Soft random sampling (SRS) is a simple yet effective approach for efficient training of large-scale deep neural networks when dealing with massive data.
no code implementations • 11 Jul 2023 • Vinit S. Unni, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi
RNN-Transducers (RNN-Ts) have gained widespread acceptance as an end-to-end model for speech to text conversion because of their high accuracy and streaming capabilities.
no code implementations • 30 Oct 2022 • Ashish Mittal, Durga Sivasubramanian, Rishabh Iyer, Preethi Jyothi, Ganesh Ramakrishnan
Training state-of-the-art ASR systems such as RNN-T often has a high associated financial and environmental cost.
no code implementations • 21 Feb 2022 • Vinit Unni, Shreya Khare, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi, Samarth Bharadwaj
RNN-Transducer (RNN-T) models have become synonymous with streaming end-to-end ASR systems.
1 code implementation • 29 Jun 2021 • Ashish Mittal, Samarth Bharadwaj, Shreya Khare, Saneem Chemmengath, Karthik Sankaranarayanan, Brian Kingsbury
Spoken intent detection has become a popular approach to interface with various smart devices with ease.
1 code implementation • 1 Apr 2021 • Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan, Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan, Tejaswi Seeram, Basil Abraham
For this purpose, we provide a total of ~600 hours of transcribed speech data, comprising train and test sets, in these languages including two code-switched language pairs, Hindi-English and Bengali-English.
1 code implementation • ACL 2019 • Priyanka Agrawal, Parag Jain, Ayushi Dalmia, Abhishek Bansal, Ashish Mittal, Karthik Sankaranarayanan
Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains.