Search Results for author: Carlos Busso

Found 9 papers, 0 papers with code

Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks

no code implementations12 May 2023 Lucas Goncalves, Seong-Gyun Leem, Wei-Cheng Lin, Berrak Sisman, Carlos Busso

This study proposes a \emph{versatile audio-visual learning} (VAVL) framework for handling unimodal and multimodal systems for emotion regression and emotion classification tasks.

Arousal Estimation audio-visual learning +6

Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion

no code implementations25 Oct 2022 Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li

To achieve this, we propose a novel EVC framework, Mixed-EVC, which only leverages discrete emotion training labels.

Voice Conversion

Driving Anomaly Detection Using Conditional Generative Adversarial Network

no code implementations15 Mar 2022 Yuning Qiu, Teruhisa Misu, Carlos Busso

The experimental results reveal that recordings annotated with events that are likely to be anomalous, such as avoiding on-road pedestrians and traffic rule violations, have higher anomaly scores than recordings without any event annotation.

Anomaly Detection

Estimation of Driver's Gaze Region from Head Position and Orientation using Probabilistic Confidence Regions

no code implementations23 Dec 2020 Sumit Jha, Carlos Busso

Specific traits in human behavior can be automatically predicted, which can help the vehicle make decisions, increasing safety.

GPR regression

The Ambiguous World of Emotion Representation

no code implementations1 Sep 2019 Vidhyasaharan Sethu, Emily Mower Provost, Julien Epps, Carlos Busso, NIcholas Cummins, Shrikanth Narayanan

A key reason for this is the lack of a common mathematical framework to describe all the relevant elements of emotion representations.

Face Recognition Speaker Verification +2

End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models

no code implementations12 Sep 2018 Fei Tao, Carlos Busso

Recent advances in audiovisual speech processing using deep learning have opened opportunities to capture in a principled way the temporal relationships between acoustic and visual features.

Action Detection Activity Detection +3

Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial Networks

no code implementations1 Jun 2018 Najmeh Sadoughi, Carlos Busso

Subjective evaluations show significantly better results for this model compared with the CSG model when the target emotion is happiness.

Human-Computer Interaction

Cannot find the paper you are looking for? You can Submit a new open access paper.