TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Emotion Recognition in Conversation	IEMOCAP	TRMSM-Att	Weighted-F1	65.94	# 34
Emotion Recognition in Conversation	MELD	TRMSM-Att	Weighted-F1	62.36	# 40

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-hierarchical-transformer-with-speaker/emotion-recognition-in-conversation-on)](https://paperswithcode.com/sota/emotion-recognition-in-conversation-on?p=a-hierarchical-transformer-with-speaker)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-hierarchical-transformer-with-speaker/emotion-recognition-in-conversation-on-meld)](https://paperswithcode.com/sota/emotion-recognition-in-conversation-on-meld?p=a-hierarchical-transformer-with-speaker)`

A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversation

29 Dec 2020 · Jiangnan Li, Zheng Lin, Peng Fu, Qingyi Si, Weiping Wang ·

Emotion Recognition in Conversation (ERC) is a more challenging task than conventional text emotion recognition. It can be regarded as a personalized and interactive emotion recognition task, which is supposed to consider not only the semantic information of text but also the influences from speakers. The current method models speakers' interactions by building a relation between every two speakers. However, this fine-grained but complicated modeling is computationally expensive, hard to extend, and can only consider local context. To address this problem, we simplify the complicated modeling to a binary version: Intra-Speaker and Inter-Speaker dependencies, without identifying every unique speaker for the targeted speaker. To better achieve the simplified interaction modeling of speakers in Transformer, which shows excellent ability to settle long-distance dependency, we design three types of masks and respectively utilize them in three independent Transformer blocks. The designed masks respectively model the conventional context modeling, Intra-Speaker dependency, and Inter-Speaker dependency. Furthermore, different speaker-aware information extracted by Transformer blocks diversely contributes to the prediction, and therefore we utilize the attention mechanism to automatically weight them. Experiments on two ERC datasets indicate that our model is efficacious to achieve better performance.

PDF Abstract

Code

Add Remove Mark official

leqsnan/skaig-erc official

Tasks

Add Remove

Emotion Recognition

Emotion Recognition in Conversation

Datasets

IEMOCAP

MELD

Results from the Paper

Edit

Ranked #34 on Emotion Recognition in Conversation on IEMOCAP

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Emotion Recognition in Conversation	IEMOCAP	TRMSM-Att	Weighted-F1	65.94	# 34		Compare
Emotion Recognition in Conversation	MELD	TRMSM-Att	Weighted-F1	62.36	# 40		Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove