Search Results for author: Vijay Ravi

Found 8 papers, 1 papers with code

Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals

1 code implementation • 2 Jun 2023 • Jinhan Wang, Vijay Ravi, Abeer Alwan

We find that a greater adversarial weight for the initial layers leads to performance improvement.

Ranked #1 on Depression Detection on Distress Analysis Interview Corpus/Wizard-of-Oz set (DAIC-WOZ)

Depression Detection Disentanglement +2

Paper
Code

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

no code implementations • 27 Jun 2022 • Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan

To learn instance-spread-out embeddings, we explore methods for sampling instances for a training batch (distinct speaker-based and random sampling).

Data Augmentation Depression Detection +1

Paper
Add Code

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement

no code implementations • 20 Jun 2022 • Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

With adversarial training, depression classification improves for every feature when compared to the baseline.

Depression Detection Disentanglement

Paper
Add Code

Automatic Dialect Density Estimation for African American English

no code implementations • 3 Apr 2022 • Alexander Johnson, Kevin Everson, Vijay Ravi, Anissa Gladney, Mari Ostendorf, Abeer Alwan

In this paper, we explore automatic prediction of dialect density of the African American English (AAE) dialect, where dialect density is defined as the percentage of words in an utterance that contain characteristics of the non-standard dialect.

Density Estimation Language Modelling

Paper
Add Code

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

no code implementations • 11 Feb 2022 • Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

The improvements for the CONVERGE (Mandarin) dataset when using the x-vector embeddings with CNN as the backend and MFCCs as input features were 9. 32% (validation) and 12. 99% (test).

Data Augmentation Depression Detection

Paper
Add Code

Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion

no code implementations • 30 Nov 2020 • Vijay Ravi, Yile Gu, Ankur Gandhe, Ariya Rastrow, Linda Liu, Denis Filimonov, Scott Novotney, Ivan Bulyko

We show that this simple method can improve performance on rare words by 3. 7% WER relative without degradation on general test set, and the improvement from USF is additive to any additional language model based rescoring.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

no code implementations • 8 Aug 2020 • Amber Afshan, Jinxi Guo, Soo Jin Park, Vijay Ravi, Alan McCree, Abeer Alwan

For instance, when enrolled with conversation utterances, the EER increased to 3. 03%, 2. 96% and 22. 12% when tested on read, narrative, and pet-directed speech, respectively.

Data Augmentation Speaker Verification

Paper
Add Code

Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification

no code implementations • 8 Aug 2020 • Vijay Ravi, Ruchao Fan, Amber Afshan, Huanhua Lu, Abeer Alwan

A fusion of the x-vector/PLDA baseline and the SID/PLDA scores prior to PID fusion further improved performance by 15% indicating complementarity of the proposed approach to the x-vector system.

Text-Dependent Speaker Verification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.