TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Emotion Recognition	IEMOCAP	TAP (Low Resource)	AUC	0.649	# 1
Speech Emotion Recognition	IEMOCAP	TAP (5-fold)	WA	0.742	# 9
Speech Emotion Recognition	IEMOCAP	TAP	WA	0.81	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/speaker-normalization-for-self-supervised/speech-emotion-recognition-on-iemocap)](https://paperswithcode.com/sota/speech-emotion-recognition-on-iemocap?p=speaker-normalization-for-self-supervised)`

Speaker Normalization for Self-supervised Speech Emotion Recognition

2 Feb 2022 · Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory ·

Large speech emotion recognition datasets are hard to obtain, and small datasets may contain biases. Deep-net-based classifiers, in turn, are prone to exploit those biases and find shortcuts such as speaker characteristics. These shortcuts usually harm a model's ability to generalize. To address this challenge, we propose a gradient-based adversary learning framework that learns a speech emotion recognition task while normalizing speaker characteristics from the feature representation. We demonstrate the efficacy of our method on both speaker-independent and speaker-dependent settings and obtain new state-of-the-art results on the challenging IEMOCAP dataset.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Emotion Recognition

Speech Emotion Recognition

Datasets

IEMOCAP

Results from the Paper

Edit

Ranked #1 on Speech Emotion Recognition on IEMOCAP (AUC metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Emotion Recognition	IEMOCAP	TAP (Low Resource)	AUC	0.649	# 1	Compare
Speech Emotion Recognition	IEMOCAP	TAP (5-fold)	WA	0.742	# 9	Compare
Speech Emotion Recognition	IEMOCAP	TAP	WA	0.81	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Speaker Normalization for Self-supervised Speech Emotion Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove