1 code implementation • 2 Jun 2023 • Jinhan Wang, Vijay Ravi, Abeer Alwan
We find that a greater adversarial weight for the initial layers leads to performance improvement.
no code implementations • 27 Jun 2022 • Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan
To learn instance-spread-out embeddings, we explore methods for sampling instances for a training batch (distinct speaker-based and random sampling).
no code implementations • 20 Jun 2022 • Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan
With adversarial training, depression classification improves for every feature when compared to the baseline.
no code implementations • 3 Apr 2022 • Alexander Johnson, Kevin Everson, Vijay Ravi, Anissa Gladney, Mari Ostendorf, Abeer Alwan
In this paper, we explore automatic prediction of dialect density of the African American English (AAE) dialect, where dialect density is defined as the percentage of words in an utterance that contain characteristics of the non-standard dialect.
no code implementations • 11 Feb 2022 • Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan
The improvements for the CONVERGE (Mandarin) dataset when using the x-vector embeddings with CNN as the backend and MFCCs as input features were 9. 32% (validation) and 12. 99% (test).
no code implementations • 30 Nov 2020 • Vijay Ravi, Yile Gu, Ankur Gandhe, Ariya Rastrow, Linda Liu, Denis Filimonov, Scott Novotney, Ivan Bulyko
We show that this simple method can improve performance on rare words by 3. 7% WER relative without degradation on general test set, and the improvement from USF is additive to any additional language model based rescoring.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Aug 2020 • Amber Afshan, Jinxi Guo, Soo Jin Park, Vijay Ravi, Alan McCree, Abeer Alwan
For instance, when enrolled with conversation utterances, the EER increased to 3. 03%, 2. 96% and 22. 12% when tested on read, narrative, and pet-directed speech, respectively.
no code implementations • 8 Aug 2020 • Vijay Ravi, Ruchao Fan, Amber Afshan, Huanhua Lu, Abeer Alwan
A fusion of the x-vector/PLDA baseline and the SID/PLDA scores prior to PID fusion further improved performance by 15% indicating complementarity of the proposed approach to the x-vector system.