Search Results for author: Vijay Ravi

Found 8 papers, 1 papers with code

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

no code implementations27 Jun 2022 Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan

To learn instance-spread-out embeddings, we explore methods for sampling instances for a training batch (distinct speaker-based and random sampling).

Data Augmentation Depression Detection +1

Automatic Dialect Density Estimation for African American English

no code implementations3 Apr 2022 Alexander Johnson, Kevin Everson, Vijay Ravi, Anissa Gladney, Mari Ostendorf, Abeer Alwan

In this paper, we explore automatic prediction of dialect density of the African American English (AAE) dialect, where dialect density is defined as the percentage of words in an utterance that contain characteristics of the non-standard dialect.

Density Estimation Language Modelling

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

no code implementations11 Feb 2022 Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

The improvements for the CONVERGE (Mandarin) dataset when using the x-vector embeddings with CNN as the backend and MFCCs as input features were 9. 32% (validation) and 12. 99% (test).

Data Augmentation Depression Detection

Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion

no code implementations30 Nov 2020 Vijay Ravi, Yile Gu, Ankur Gandhe, Ariya Rastrow, Linda Liu, Denis Filimonov, Scott Novotney, Ivan Bulyko

We show that this simple method can improve performance on rare words by 3. 7% WER relative without degradation on general test set, and the improvement from USF is additive to any additional language model based rescoring.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

no code implementations8 Aug 2020 Amber Afshan, Jinxi Guo, Soo Jin Park, Vijay Ravi, Alan McCree, Abeer Alwan

For instance, when enrolled with conversation utterances, the EER increased to 3. 03%, 2. 96% and 22. 12% when tested on read, narrative, and pet-directed speech, respectively.

Data Augmentation Speaker Verification

Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification

no code implementations8 Aug 2020 Vijay Ravi, Ruchao Fan, Amber Afshan, Huanhua Lu, Abeer Alwan

A fusion of the x-vector/PLDA baseline and the SID/PLDA scores prior to PID fusion further improved performance by 15% indicating complementarity of the proposed approach to the x-vector system.

Text-Dependent Speaker Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.