1 code implementation • 23 Feb 2024 • Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen
One promising approach is to align vocal-tract parameters between adults and children through children-specific data augmentation, referred here to as ChildAugment.
no code implementations • 13 Jun 2023 • Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen
The first dataset, used for addressing short-term ageing (up to 10 years time difference between enrollment and test) under uncontrolled conditions, is VoxCeleb.
no code implementations • 13 Mar 2022 • Vishwanath Pratap Singh, Hardik Sailor, Supratik Bhattacharya, Abhishek Pandey
Then, this modified adult spectrum is used as augmented data to improve end-to-end ASR systems for children's speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 2 Dec 2021 • Vishwanath Pratap Singh, Shakti P. Rath, Abhishek Pandey
This paper presents a novel deep learning architecture for acoustic model in the context of Automatic Speech Recognition (ASR), termed as MixNet.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 2 Dec 2021 • Vishwanath Pratap Singh, Shakti P. Rath, Abhishek Pandey
In this paper we have proposed higher order minkowski loss (4th Order and 6th Order) during inference time, without any changes during training time.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 15 Jun 2021 • Vishwanath Pratap Singh, Shashi Kumar, Ravi Shekhar Jha, Abhishek Pandey
The COVID-19 pandemic has resulted in more than 125 million infections and more than 2. 7 million casualties.