Search Results for author: Carlos Busso

Found 11 papers, 0 papers with code

emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition

no code implementations • 21 Mar 2024 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Bjorn W. Schuller, Carlos Busso

This study presents emoDARTS, a DARTS-optimised joint CNN and Sequential Neural Network (SeqNN: LSTM, RNN) architecture that enhances SER performance.

Ranked #1 on Speech Emotion Recognition on MSP-IMPROV

Neural Architecture Search Speech Emotion Recognition

Paper
Add Code

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

no code implementations • 19 Jan 2024 • Ismail Rasim Ulgen, Zongyang Du, Carlos Busso, Berrak Sisman

In order to leverage this information, we introduce a novel contrastive pretraining approach applied to emotion-unlabeled data for speech emotion recognition.

Contrastive Learning Speech Emotion Recognition

Paper
Add Code

Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks

no code implementations • 12 May 2023 • Lucas Goncalves, Seong-Gyun Leem, Wei-Cheng Lin, Berrak Sisman, Carlos Busso

This study proposes a \emph{versatile audio-visual learning} (VAVL) framework for handling unimodal and multimodal systems for emotion regression and emotion classification tasks.

Ranked #1 on Video Emotion Recognition on CREMA-D

Arousal Estimation Attribute +7

Paper
Add Code

Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion

no code implementations • 25 Oct 2022 • Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li

To achieve this, we propose a novel EVC framework, Mixed-EVC, which only leverages discrete emotion training labels.

Attribute Voice Conversion

Paper
Add Code

Driving Anomaly Detection Using Conditional Generative Adversarial Network

no code implementations • 15 Mar 2022 • Yuning Qiu, Teruhisa Misu, Carlos Busso

The experimental results reveal that recordings annotated with events that are likely to be anomalous, such as avoiding on-road pedestrians and traffic rule violations, have higher anomaly scores than recordings without any event annotation.

Anomaly Detection Generative Adversarial Network

Paper
Add Code

Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech

no code implementations • 19 Jan 2022 • Kusha Sridhar, Carlos Busso

A practical approach to improve valence prediction from speech is to adapt the models to the target speakers in the test set.

Speech Emotion Recognition Transfer Learning

Paper
Add Code

Estimation of Driver's Gaze Region from Head Position and Orientation using Probabilistic Confidence Regions

no code implementations • 23 Dec 2020 • Sumit Jha, Carlos Busso

Specific traits in human behavior can be automatically predicted, which can help the vehicle make decisions, increasing safety.

GPR Position +1

Paper
Add Code

The Ambiguous World of Emotion Representation

no code implementations • 1 Sep 2019 • Vidhyasaharan Sethu, Emily Mower Provost, Julien Epps, Carlos Busso, NIcholas Cummins, Shrikanth Narayanan

A key reason for this is the lack of a common mathematical framework to describe all the relevant elements of emotion representations.

Face Recognition Speaker Verification +2

Paper
Add Code

End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models

no code implementations • 12 Sep 2018 • Fei Tao, Carlos Busso

Recent advances in audiovisual speech processing using deep learning have opened opportunities to capture in a principled way the temporal relationships between acoustic and visual features.

Action Detection Activity Detection +3

Paper
Add Code

Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial Networks

no code implementations • 1 Jun 2018 • Najmeh Sadoughi, Carlos Busso

Subjective evaluations show significantly better results for this model compared with the CSG model when the target emotion is happiness.

Human-Computer Interaction

Paper
Add Code

MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception

no code implementations • IEEE Transactions on Affective Computing 2016 • Carlos Busso, Srinivas Parthasarathy, Alec Burmania, Mohammed AbdelWahab, Najmeh Sadoughi, Emily Mower Provost

The paper also provides the performance for speech and facial emotion classifiers.

Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.