1 code implementation • Findings (NAACL) 2022 • Divya Sharma, Arun Balaji Buduru
Empirical results suggest that frame-attentive embedding can cost-effectively reduce linguistic bias and enhance the usability of baselines.
no code implementations • NLP4DH (ICON) 2021 • Avinash Tulasi, Asanobu Kitamoto, Ponnurangam Kumaraguru, Arun Balaji Buduru
In this work, we aim to show the topics under discussion, evolution of discussions, change in user sentiment during the pandemic.
no code implementations • 23 Dec 2024 • Orchid Chetia Phukan, Drishti Singh, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma
This higher performance can be due to its speaker recognition pre-training that enables it for capturing unique prosodic characteristics of the sources in a better way.
no code implementations • 16 Oct 2024 • Orchid Chetia Phukan, Devyani Koshal, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma
However, most prior efforts have centered on building individual models for each task separately, despite the inherent similarities among these tasks.
no code implementations • 16 Oct 2024 • Orchid Chetia Phukan, Swarup Ranjan Behera, Girish, Mohd Mujtaba Akhtar, Arun Balaji Buduru, Rajesh Sharma
Despite being trained exclusively on speech data, speech foundation models (SFMs) like Whisper have shown impressive performance in non-speech tasks such as audio classification.
no code implementations • 16 Oct 2024 • Sarthak Jain, Orchid Chetia Phukan, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma
In this work, we introduce SeQuiFi, a novel approach for mitigating catastrophic forgetting (CF) in speech emotion recognition (SER).
no code implementations • 24 Sep 2024 • Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Nitin Choudhury, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna
We show that such random selection preserves more performance than the SOTA dimensionality reduction techniques while reducing model parameters and inference time by almost over half.
no code implementations • 22 Sep 2024 • Orchid Chetia Phukan, Swarup Ranjan Behera, Shubham Singh, Muskaan Singh, Vandana Rajan, Arun Balaji Buduru, Rajesh Sharma, S. R. Mahadeva Prasanna
In this work, we demonstrate that the amalgamation of NSFs results in complementary behavior, leading to enhanced depression detection performance.
no code implementations • 21 Sep 2024 • Orchid Chetia Phukan, Mohd Mujtaba Akhtar, Girish, Swarup Ranjan Behera, Sishir Kalita, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna
Through MATA coupled with the combination of MFMs: LanguageBind and ImageBind, we report the topmost performance with accuracies of 76. 47%, 77. 40%, 75. 12% and F1-scores of 70. 35%, 76. 19%, 74. 63% for ASVP-ESD, JNV, and VIVAE datasets against individual FMs and baseline fusion techniques and report SOTA on the benchmark datasets.
no code implementations • 21 Sep 2024 • Orchid Chetia Phukan, Sarthak Jain, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna
In this study, for the first time, we extensively investigate whether music foundation models (MFMs) or speech foundation models (SFMs) work better for singing voice deepfake detection (SVDD), which has recently attracted attention in the research community.
no code implementations • 7 Sep 2024 • Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru
With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed.
1 code implementation • 10 Jul 2024 • Paridhi Mundra, Manik Sharma, Yashwardhan Chaudhuri, Orchid Chetia Phukan, Arun Balaji Buduru
As respiratory illnesses become more common, it is crucial to quickly and accurately detect them to improve patient care.
1 code implementation • 10 Jul 2024 • Yashwardhan Chaudhuri, Paridhi Mundra, Arnesh Batra, Orchid Chetia Phukan, Arun Balaji Buduru
Recognition and interpretation of bird vocalizations are pivotal in ornithological research and ecological conservation efforts due to their significance in understanding avian behaviour, performing habitat assessment and judging ecological health.
no code implementations • 8 Jul 2024 • Yashwardhan Chaudhuri, Ankit Kumar, Arun Balaji Buduru, Adel Alshamrani
Crowd counting is gaining societal relevance, particularly in domains of Urban Planning, Crowd Management, and Public Safety.
no code implementations • 15 Jun 2024 • Sarthak Sharma, Orchid Chetia Phukan, Drishti Singh, Arun Balaji Buduru, Rajesh Sharma
In this work, we present, AVR application for audio-visual humor detection.
1 code implementation • 13 Jun 2024 • Orchid Chetia Phukan, Priyabrata Mallick, Swarup Ranjan Behera, Aalekhya Satya Narayani, Arun Balaji Buduru, Rajesh Sharma
In this paper, we work towards extending Audio-Visual Question Answering (AVQA) to multilingual settings.
Audio-visual Question Answering
Audio-Visual Question Answering (AVQA)
+3
no code implementations • 10 Jun 2024 • Sarthak Jain, Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma
In this paper, we focus on audio violence detection (AVD).
no code implementations • 10 Jun 2024 • Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain, Arun Balaji Buduru, Rajesh Sharma
Emotion Recognition (ER), Gender Recognition (GR), and Age Estimation (AE) constitute paralinguistic tasks that rely not on the spoken content but primarily on speech characteristics such as pitch and tone.
no code implementations • 10 Jun 2024 • Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma
To address this, in this paper, we introduce ComFeAT, an application that employs a CNN model trained on a combination of features extracted from PTMs, a. k. a.
no code implementations • 5 Jun 2024 • Orchid Chetia Phukan, Yashasvi Chaurasia, Arun Balaji Buduru, Rajesh Sharma
In this study, we investigate representations from paralingual Pre-Trained model (PTM) for Audio Abuse Detection (AAD), which has not been explored for AAD.
no code implementations • 9 May 2024 • Satyadwyoom Kumar, Saurabh Gupta, Arun Balaji Buduru
Most of these adversarial attack strategies assume that the adversary has access to the training data, the model parameters, and the input during deployment, hence, focus on perturbing the pixel level information present in the input image.
no code implementations • 31 Mar 2024 • Orchid Chetia Phukan, Ankita Das, Arun Balaji Buduru, Rajesh Sharma
Stress recognition through physiological signals such as Electrocardiogram (ECG) signals has garnered significant attention.
1 code implementation • 31 Mar 2024 • Orchid Chetia Phukan, Gautam Siddharth Kashyap, Arun Balaji Buduru, Rajesh Sharma
To validate our hypothesis, we extract representations from state-of-the-art (SOTA) PTMs including monolingual, multilingual as well as PTMs trained for speaker and emotion recognition, and evaluated them on ASVSpoof 2019 (ASV), In-the-Wild (ITW), and DECRO benchmark databases.
no code implementations • 2 Feb 2024 • Orchid Chetia Phukan, Gautam Siddharth Kashyap, Arun Balaji Buduru, Rajesh Sharma
However, such paralinguistic PTM representations haven't been evaluated for SER in linguistic environments other than English.
no code implementations • 11 Jan 2024 • Yashwardhan Chaudhuri, Ankit Kumar, Orchid Chetia Phukan, Arun Balaji Buduru
However, most of the previous methods rely on a heavy backbone and a complex downstream architecture that restricts the deployment.
no code implementations • 29 May 2023 • Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma
Speech emotion recognition (SER) is a field that has drawn a lot of attention due to its applications in diverse fields.
no code implementations • 22 Apr 2023 • Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma
In this work, we exploit this research gap and perform a comparative analysis of embeddings extracted from eight speech and audio PTMs (wav2vec 2. 0, data2vec, wavLM, UniSpeech-SAT, wav2clip, YAMNet, x-vector, ECAPA).
no code implementations • 29 Sep 2020 • Saurabh Gupta, Arun Balaji Buduru, Ponnurangam Kumaraguru
With experiments on MNIST dataset, we show that imdpGAN preserves the privacy of the individual data point, and learns latent codes to control the specificity of the generated samples.
no code implementations • 29 Sep 2020 • Saurabh Gupta, Siddhant Bhambri, Karan Dhingra, Arun Balaji Buduru, Ponnurangam Kumaraguru
We experiment on real-world smart home data, and show that the multi-objective approaches: i) establish trade-off between the two objectives, ii) achieve better combined user satisfaction and power consumption than single-objective approaches.
no code implementations • 12 Jul 2020 • Mudit Verma, Arun Balaji Buduru
Hence, there is an increasing need for real-time and fine-grained content analysis services, including language identification, content transcription, and analysis.
no code implementations • 6 Dec 2019 • Mudit Verma, Siddhant Bhambri, Saurabh Gupta, Arun Balaji Buduru
Rapid advancements in the Internet of Things (IoT) have facilitated more efficient deployment of smart environment solutions for specific user requirement.
no code implementations • 3 Dec 2019 • Siddhant Bhambri, Sumanyu Muku, Avinash Tulasi, Arun Balaji Buduru
Machine learning has seen tremendous advances in the past few years, which has lead to deep learning models being deployed in varied applications of day-to-day life.
no code implementations • 22 Sep 2019 • Kumari Neha, Shashank Srikanth, Sonali Singhal, Shwetanshu Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru
Users on Twitter are identified with the help of their profile attributes that consists of username, display name, profile image, to name a few.
no code implementations • 16 Sep 2019 • Saurabh Gupta, Asmit Kumar Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru
In the political context, hashtags on Twitter are used by users to campaign for their parties, spread news, or to get followers and get a general idea by following a discussion built around a hashtag.
no code implementations • 29 Oct 2018 • Satvik Jain, Arun Balaji Buduru, Anshuman Chhabra
Cloud infrastructures are being increasingly utilized in critical infrastructures such as banking/finance, transportation and utility management.