Search Results for author: Hira Dhamyal

Found 11 papers, 2 papers with code

Prompting Audios Using Acoustic Properties For Emotion Representation

no code implementations • 3 Oct 2023 • Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

In this work, we address the challenge of automatically generating these prompts and training a model to better learn emotion representations from audio and prompt pairs.

Contrastive Learning Retrieval +1

Paper
Add Code

LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

no code implementations • 2 Oct 2023 • Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh

We hypothesize that for attacks to be transferrable, it is sufficient if the proxy can approximate the target model in the neighborhood of the harmful query.

Language Modelling Large Language Model

Paper
Add Code

Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

1 code implementation • 1 Oct 2023 • Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh

In this paper, we propose an evaluation technique involving the training of an ASR model on synthetic speech and assessing its performance on real speech.

speech-recognition Speech Recognition +1

Paper
Code

Describing emotions with acoustic property prompts for speech emotion recognition

no code implementations • 14 Nov 2022 • Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

We investigate how the model can learn to associate the audio with the descriptions, resulting in performance improvement of Speech Emotion Recognition and Speech Audio Retrieval.

Retrieval Speech Emotion Recognition

Paper
Add Code

Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition

no code implementations • 29 Oct 2022 • Roshan Sharma, Hira Dhamyal, Bhiksha Raj, Rita Singh

Accordingly, models that have been proposed for emotion detection use one or the other of these label types.

Multi-Task Learning Speech Emotion Recognition

Paper
Add Code

Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

no code implementations • 25 Jun 2022 • Roshan Sharma, Tyler Vuong, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj

This work presents a multitask approach to the simultaneous estimation of age, country of origin, and emotion given vocal burst audio for the 2022 ICML Expressive Vocalizations Challenge ExVo-MultiTask track.

Paper
Add Code

On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice

no code implementations • 11 Apr 2022 • Ankit Shah, Hira Dhamyal, Yang Gao, Daniel Arancibia, Mario Arancibia, Bhiksha Raj, Rita Singh

Lately, there has been a global effort by multiple research groups to detect COVID-19 from voice.

Paper
Add Code

An Overview of Techniques for Biomarker Discovery in Voice Signal

no code implementations • 10 Oct 2021 • Rita Singh, Ankit Shah, Hira Dhamyal

This paper reflects on the effect of several categories of medical conditions on human voice, focusing on those that may be hypothesized to have effects on voice, but for which the changes themselves may be subtle enough to have eluded observation in standard analytical examinations of the voice signal.

Paper
Add Code

Masked Proxy Loss For Text-Independent Speaker Verification

1 code implementation • 9 Nov 2020 • Jiachen Lian, Aiswarya Vinod Kumar, Hira Dhamyal, Bhiksha Raj, Rita Singh

We further propose Multinomial Masked Proxy (MMP) loss to leverage the hardness of speaker pairs.

Metric Learning Speaker Recognition +2

Paper
Code

The phonetic bases of vocal expressed emotion: natural versus acted

no code implementations • 13 Nov 2019 • Hira Dhamyal, Shahan Ali Memon, Bhiksha Raj, Rita Singh

Our tests show significant differences in the manner and choice of phonemes in acted and natural speech, concluding moderate to low validity and value in using acted speech databases for emotion classification tasks.

Emotion Classification General Classification +1

Paper
Add Code

Detecting gender differences in perception of emotion in crowdsourced data

no code implementations • 24 Oct 2019 • Shahan Ali Memon, Hira Dhamyal, Oren Wright, Daniel Justice, Vijaykumar Palat, William Boler, Bhiksha Raj, Rita Singh

While we limit ourselves to a single modality (i. e. speech), our framework is applicable to studies of emotion perception from all such loosely annotated data in general.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.