Search Results for author: Arun Balaji Buduru

Found 35 papers, 5 papers with code

FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems

1 code implementation Findings (NAACL) 2022 Divya Sharma, Arun Balaji Buduru

Empirical results suggest that frame-attentive embedding can cost-effectively reduce linguistic bias and enhance the usability of baselines.

Domain Adaptation Fairness +1

Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution

no code implementations23 Dec 2024 Orchid Chetia Phukan, Drishti Singh, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma

This higher performance can be due to its speaker recognition pre-training that enables it for capturing unique prosodic characteristics of the sources in a better way.

Audio Deepfake Detection Face Swapping +3

Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks

no code implementations16 Oct 2024 Orchid Chetia Phukan, Devyani Koshal, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma

However, most prior efforts have centered on building individual models for each task separately, despite the inherent similarities among these tasks.

Age Estimation Multi-Task Learning +3

Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals

no code implementations16 Oct 2024 Orchid Chetia Phukan, Swarup Ranjan Behera, Girish, Mohd Mujtaba Akhtar, Arun Balaji Buduru, Rajesh Sharma

Despite being trained exclusively on speech data, speech foundation models (SFMs) like Whisper have shown impressive performance in non-speech tasks such as audio classification.

Audio Classification Time Series

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition

no code implementations21 Sep 2024 Orchid Chetia Phukan, Mohd Mujtaba Akhtar, Girish, Swarup Ranjan Behera, Sishir Kalita, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna

Through MATA coupled with the combination of MFMs: LanguageBind and ImageBind, we report the topmost performance with accuracies of 76. 47%, 77. 40%, 75. 12% and F1-scores of 70. 35%, 76. 19%, 74. 63% for ASVP-ESD, JNV, and VIVAE datasets against individual FMs and baseline fusion techniques and report SOTA on the benchmark datasets.

Audio Deepfake Detection Emotion Recognition +3

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models

no code implementations21 Sep 2024 Orchid Chetia Phukan, Sarthak Jain, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna

In this study, for the first time, we extensively investigate whether music foundation models (MFMs) or speech foundation models (SFMs) work better for singing voice deepfake detection (SVDD), which has recently attracted attention in the research community.

DeepFake Detection Face Swapping +3

Towards identifying Source credibility on Information Leakage in Digital Gadget Market

no code implementations7 Sep 2024 Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru

With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed.

NER

VoxMed: One-Step Respiratory Disease Classifier using Digital Stethoscope Sounds

1 code implementation10 Jul 2024 Paridhi Mundra, Manik Sharma, Yashwardhan Chaudhuri, Orchid Chetia Phukan, Arun Balaji Buduru

As respiratory illnesses become more common, it is crucial to quickly and accurately detect them to improve patient care.

Diagnostic

ASGIR: Audio Spectrogram Transformer Guided Classification And Information Retrieval For Birds

1 code implementation10 Jul 2024 Yashwardhan Chaudhuri, Paridhi Mundra, Arnesh Batra, Orchid Chetia Phukan, Arun Balaji Buduru

Recognition and interpretation of bird vocalizations are pivotal in ornithological research and ecological conservation efforts due to their significance in understanding avian behaviour, performing habitat assessment and judging ecological health.

Information Retrieval Retrieval

FGA: Fourier-Guided Attention Network for Crowd Count Estimation

no code implementations8 Jul 2024 Yashwardhan Chaudhuri, Ankit Kumar, Arun Balaji Buduru, Adel Alshamrani

Crowd counting is gaining societal relevance, particularly in domains of Urban Planning, Crowd Management, and Public Safety.

Crowd Counting

PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

no code implementations10 Jun 2024 Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain, Arun Balaji Buduru, Rajesh Sharma

Emotion Recognition (ER), Gender Recognition (GR), and Age Estimation (AE) constitute paralinguistic tasks that rely not on the spoken content but primarily on speech characteristics such as pitch and tone.

Age Estimation Emotion Recognition +2

ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

no code implementations10 Jun 2024 Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

To address this, in this paper, we introduce ComFeAT, an application that employs a CNN model trained on a combination of features extracted from PTMs, a. k. a.

Depression Detection

CoLLAB: A Collaborative Approach for Multilingual Abuse Detection

no code implementations5 Jun 2024 Orchid Chetia Phukan, Yashasvi Chaurasia, Arun Balaji Buduru, Rajesh Sharma

In this study, we investigate representations from paralingual Pre-Trained model (PTM) for Audio Abuse Detection (AAD), which has not been explored for AAD.

Abuse Detection

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

no code implementations9 May 2024 Satyadwyoom Kumar, Saurabh Gupta, Arun Balaji Buduru

Most of these adversarial attack strategies assume that the adversary has access to the training data, the model parameters, and the input during deployment, hence, focus on perturbing the pixel level information present in the input image.

Adversarial Attack Deep Learning +1

SONIC: Synergizing VisiON Foundation Models for Stress RecogNItion from ECG signals

no code implementations31 Mar 2024 Orchid Chetia Phukan, Ankita Das, Arun Balaji Buduru, Rajesh Sharma

Stress recognition through physiological signals such as Electrocardiogram (ECG) signals has garnered significant attention.

Heterogeneity over Homogeneity: Investigating Multilingual Speech Pre-Trained Models for Detecting Audio Deepfake

1 code implementation31 Mar 2024 Orchid Chetia Phukan, Gautam Siddharth Kashyap, Arun Balaji Buduru, Rajesh Sharma

To validate our hypothesis, we extract representations from state-of-the-art (SOTA) PTMs including monolingual, multilingual as well as PTMs trained for speaker and emotion recognition, and evaluated them on ASVSpoof 2019 (ASV), In-the-Wild (ITW), and DECRO benchmark databases.

Audio Deepfake Detection Emotion Recognition +1

A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting

no code implementations11 Jan 2024 Yashwardhan Chaudhuri, Ankit Kumar, Orchid Chetia Phukan, Arun Balaji Buduru

However, most of the previous methods rely on a heavy backbone and a complex downstream architecture that restricts the deployment.

Computational Efficiency Crowd Counting

A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition

no code implementations22 Apr 2023 Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma

In this work, we exploit this research gap and perform a comparative analysis of embeddings extracted from eight speech and audio PTMs (wav2vec 2. 0, data2vec, wavLM, UniSpeech-SAT, wav2clip, YAMNet, x-vector, ECAPA).

Speaker Recognition Speech Emotion Recognition

imdpGAN: Generating Private and Specific Data with Generative Adversarial Networks

no code implementations29 Sep 2020 Saurabh Gupta, Arun Balaji Buduru, Ponnurangam Kumaraguru

With experiments on MNIST dataset, we show that imdpGAN preserves the privacy of the individual data point, and learns latent codes to control the specificity of the generated samples.

Binary Classification Generative Adversarial Network +1

Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments

no code implementations29 Sep 2020 Saurabh Gupta, Siddhant Bhambri, Karan Dhingra, Arun Balaji Buduru, Ponnurangam Kumaraguru

We experiment on real-world smart home data, and show that the multi-objective approaches: i) establish trade-off between the two objectives, ii) achieve better combined user satisfaction and power consumption than single-objective approaches.

Management Multi-Objective Reinforcement Learning

Fine-grained Language Identification with Multilingual CapsNet Model

no code implementations12 Jul 2020 Mudit Verma, Arun Balaji Buduru

Hence, there is an increasing need for real-time and fine-grained content analysis services, including language identification, content transcription, and analysis.

Language Identification model

Making Smart Homes Smarter: Optimizing Energy Consumption with Human in the Loop

no code implementations6 Dec 2019 Mudit Verma, Siddhant Bhambri, Saurabh Gupta, Arun Balaji Buduru

Rapid advancements in the Internet of Things (IoT) have facilitated more efficient deployment of smart environment solutions for specific user requirement.

Clustering Reinforcement Learning +1

A Survey of Black-Box Adversarial Attacks on Computer Vision Models

no code implementations3 Dec 2019 Siddhant Bhambri, Sumanyu Muku, Avinash Tulasi, Arun Balaji Buduru

Machine learning has seen tremendous advances in the past few years, which has lead to deep learning models being deployed in varied applications of day-to-day life.

Is change the only constant? Profile change perspective on #LokSabhaElections2019

no code implementations22 Sep 2019 Kumari Neha, Shashank Srikanth, Sonali Singhal, Shwetanshu Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru

Users on Twitter are identified with the help of their profile attributes that consists of username, display name, profile image, to name a few.

Attribute

Hashtags are (not) judgemental: The untold story of Lok Sabha elections 2019

no code implementations16 Sep 2019 Saurabh Gupta, Asmit Kumar Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru

In the political context, hashtags on Twitter are used by users to campaign for their parties, spread news, or to get followers and get a general idea by following a discussion built around a hashtag.

Semantic Similarity Semantic Textual Similarity

An approach to predictively securing critical cloud infrastructures through probabilistic modeling

no code implementations29 Oct 2018 Satvik Jain, Arun Balaji Buduru, Anshuman Chhabra

Cloud infrastructures are being increasingly utilized in critical infrastructures such as banking/finance, transportation and utility management.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.