Emotion Recognition

465 papers with code • 7 benchmarks • 45 datasets

Emotion Recognition is an important area of research to enable effective human-computer interaction. Human emotions can be detected using speech signal, facial expressions, body language, and electroencephalography (EEG). Source: Using Deep Autoencoders for Facial Expression Recognition

Benchmarks

Add a Result

These leaderboards are used to track progress in Emotion Recognition

Dataset	Best Model	Compare
RAVDESS	LogisticRegression on posteriors of xlsr-Wav2Vec2.0&bi-LSTM+Attention	See all
Emomusic	Jukebox (Pre-training: CALM)	See all
EMOTIC	FocusCLIP	See all
MPED	BiHDM	See all
SEED	4D-aNN	See all
MSP-Podcast	w2v2-L-robust-12	See all
FER2013	VGG based	See all

Libraries

Use these libraries to find Emotion Recognition models and implementations

HSE-asavchenko/face-emotion-recogni…

6 papers

575

SenticNet/conv-emotion

5 papers

1,268

tomas-gajarsky/facetorch

4 papers

412

aris-ai/Audio-and-text-based-emotio…

3 papers

138

See all 10 libraries.

Datasets

Subtasks

Emotion-Cause Pair Extraction

Facial Emotion Recognition

Emotion Cause Extraction

EEG Emotion Recognition

Video Emotion Recognition

Emotion Recognition in Context

A-VB High

A-VB Two

A-VB Culture

Latest papers

Most implemented Social Latest No code

Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition

clearloveyuan/after • • 1 May 2024

To address these issues, we propose an active learning (AL)-based fine-tuning framework for SER, called \textsc{After}, that leverages task adaptation pre-training (TAPT) and AL methods to enhance performance and efficiency.

01 May 2024

Paper
Code

A Systematic Evaluation of Adversarial Attacks against Speech Emotion Recognition Models

limunimi/thesis_adversarial_ml_audio • 29 Apr 2024

In summary, this work contributes to the understanding of the robustness of CNN-LSTM models, particularly in SER scenarios, and the impact of AEs.

29 Apr 2024

Paper
Code

MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion Recognition

Peihao-Xiang/MultiMAE-DFER • • 28 Apr 2024

In comparison to state-of-the-art multimodal supervised learning models for dynamic emotion recognition, MultiMAE-DER enhances the weighted average recall (WAR) by 4. 41% on the RAVDESS dataset and by 2. 06% on the CREMAD.

28 Apr 2024

Paper
Code

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

zeroqiaoba/mertools • • 26 Apr 2024

In addition to expanding the dataset size, we introduce a new track around open-vocabulary emotion recognition.

104

26 Apr 2024

Paper
Code

EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning

aimmemotion/emovit • 25 Apr 2024

Visual Instruction Tuning represents a novel learning paradigm involving the fine-tuning of pre-trained language models using task-specific instructions.

25 Apr 2024

Paper
Code

CAGE: Circumplex Affect Guided Expression Inference

wagner-niklas/cage_expression_inference • • 23 Apr 2024

Using a small-scaled MaxViT-based model architecture, we evaluate the impact of discrete expression category labels in training with the continuous valence and arousal labels.

23 Apr 2024

Paper
Code

Cooperative Sentiment Agents for Multimodal Sentiment Analysis

smwanghhh/co-sa • • 19 Apr 2024

In this paper, we propose a new Multimodal Representation Learning (MRL) method for Multimodal Sentiment Analysis (MSA), which facilitates the adaptive interaction between modalities through Cooperative Sentiment Agents, named Co-SA.

19 Apr 2024

Paper
Code

MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild

katerynaCh/MMA-DFER • • 13 Apr 2024

Within the field of multimodal DFER, recent methods have focused on exploiting advances of self-supervised learning (SSL) for pre-training of strong multimodal encoders.

13 Apr 2024

Paper
Code

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

faceonlive/ai-research • 11 Apr 2024

Remote photoplethysmography (rPPG) technology has become increasingly popular due to its non-invasive monitoring of various physiological indicators, making it widely applicable in multimedia interaction, healthcare, and emotion analysis.

189

11 Apr 2024

Paper
Code

What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions

hanyu-meng/adapting-leaf • • 10 Apr 2024

There is increasing interest in the use of the LEArnable Front-end (LEAF) in a variety of speech processing systems.

10 Apr 2024

Paper
Code

Emotion Recognition

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result