no code implementations • 16 Sep 2024 • Hitesh Tulsiani, David M. Chan, Shalini Ghosh, Garima Lalwani, Prabhat Pandey, Ankish Bansal, Sri Garimella, Ariya Rastrow, Björn Hoffmeister
Dialog systems, such as voice assistants, are expected to engage with users in complex, evolving conversations.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
1 code implementation • 4 Jan 2024 • David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister
We demonstrate that our CLC family of approaches can improve the performance of ASR models on OD3, a new public large-scale semi-synthetic meta-dataset of audio task-oriented dialogues, by up to 19. 2%.
no code implementations • 10 Aug 2020 • Prakhar Swarup, Debmalya Chakrabarty, Ashtosh Sapru, Hitesh Tulsiani, Harish Arsikere, Sri Garimella
Semi-supervised learning (SSL) is an active area of research which aims to utilize unlabelled data in order to improve the accuracy of speech recognition systems.
no code implementations • MediaEval 2015 Workshop 2015 • Hitesh Tulsiani, Preeti Rao
This paper describes the system developed at I. I. T.
Ranked #33 on
Keyword Spotting
on QUESST