TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Emotion Recognition in Conversation	EmoWoz	COSMIC	Weighted F1	85.94	# 2
Emotion Recognition in Conversation	EmoWoz	COSMIC	Macro F1 (w/o Neutral)	56.34	# 1
Emotion Recognition in Conversation	EmoWoz	COSMIC	Weighted F1 (w/o Neutral)	77.09	# 2
Emotion Recognition in Conversation	EmoWoz	COSMIC	Macro F1	61.12	# 2
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-GloVe	Weighted F1	80.76	# 5
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-GloVe	Macro F1 (w/o Neutral)	40.14	# 5
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-GloVe	Weighted F1 (w/o Neutral)	74.56	# 4
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-GloVe	Macro F1	46.33	# 6
Emotion Recognition in Conversation	EmoWoz	BERT	Weighted F1	84.83	# 3
Emotion Recognition in Conversation	EmoWoz	BERT	Macro F1 (w/o Neutral)	50.14	# 4
Emotion Recognition in Conversation	EmoWoz	BERT	Weighted F1 (w/o Neutral)	73.55	# 5
Emotion Recognition in Conversation	EmoWoz	BERT	Macro F1	55.80	# 5
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-BERT	Weighted F1	83.41	# 4
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-BERT	Macro F1 (w/o Neutral)	52.15	# 3
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-BERT	Weighted F1 (w/o Neutral)	75.50	# 3
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-BERT	Macro F1	57.10	# 4
Emotion Recognition in Conversation	EmoWoz	ContextBERT	Weighted F1	88.33	# 1
Emotion Recognition in Conversation	EmoWoz	ContextBERT	Macro F1 (w/o Neutral)	54.30	# 2
Emotion Recognition in Conversation	EmoWoz	ContextBERT	Weighted F1 (w/o Neutral)	79.67	# 1
Emotion Recognition in Conversation	EmoWoz	ContextBERT	Macro F1	59.79	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/emowoz-a-large-scale-corpus-and-labelling/emotion-recognition-in-conversation-on-emowoz)](https://paperswithcode.com/sota/emotion-recognition-in-conversation-on-emowoz?p=emowoz-a-large-scale-corpus-and-labelling)`

EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

LREC 2022 · Shutong Feng, Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Michael Heck, Carel van Niekerk, Milica Gašić ·

The ability to recognise emotions lends a conversational artificial intelligence a human touch. While emotions in chit-chat dialogues have received substantial attention, emotions in task-oriented dialogues remain largely unaddressed. This is despite emotions and dialogue success having equally important roles in a natural system. Existing emotion-annotated task-oriented corpora are limited in size, label richness, and public availability, creating a bottleneck for downstream tasks. To lay a foundation for studies on emotions in task-oriented dialogues, we introduce EmoWOZ, a large-scale manually emotion-annotated corpus of task-oriented dialogues. EmoWOZ is based on MultiWOZ, a multi-domain task-oriented dialogue dataset. It contains more than 11K dialogues with more than 83K emotion annotations of user utterances. In addition to Wizard-of-Oz dialogues from MultiWOZ, we collect human-machine dialogues within the same set of domains to sufficiently cover the space of various emotions that can happen during the lifetime of a data-driven dialogue system. To the best of our knowledge, this is the first large-scale open-source corpus of its kind. We propose a novel emotion labelling scheme, which is tailored to task-oriented dialogues. We report a set of experimental results to show the usability of this corpus for emotion recognition and state tracking in task-oriented dialogues.

PDF Abstract LREC 2022 PDF LREC 2022 Abstract

Code

Add Remove Mark official

dsml/emowoz-public official

Tasks

Add Remove

Emotion Recognition

Emotion Recognition in Conversation

Task-Oriented Dialogue Systems

Datasets

Introduced in the Paper:

EmoWOZ

Used in the Paper:

MultiWOZ

MELD

EmoryNLP

Results from the Paper

Edit

Ranked #1 on Emotion Recognition in Conversation on EmoWoz

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Emotion Recognition in Conversation	EmoWoz	COSMIC	Weighted F1	85.94	# 2	Compare
			Macro F1 (w/o Neutral)	56.34	# 1	Compare
			Weighted F1 (w/o Neutral)	77.09	# 2	Compare
			Macro F1	61.12	# 2	Compare
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-GloVe	Weighted F1	80.76	# 5	Compare
			Macro F1 (w/o Neutral)	40.14	# 5	Compare
			Weighted F1 (w/o Neutral)	74.56	# 4	Compare
			Macro F1	46.33	# 6	Compare
Emotion Recognition in Conversation	EmoWoz	BERT	Weighted F1	84.83	# 3	Compare
			Macro F1 (w/o Neutral)	50.14	# 4	Compare
			Weighted F1 (w/o Neutral)	73.55	# 5	Compare
			Macro F1	55.80	# 5	Compare
Emotion Recognition in Conversation	EmoWoz	DialogueRNN-BERT	Weighted F1	83.41	# 4	Compare
			Macro F1 (w/o Neutral)	52.15	# 3	Compare
			Weighted F1 (w/o Neutral)	75.50	# 3	Compare
			Macro F1	57.10	# 4	Compare
Emotion Recognition in Conversation	EmoWoz	ContextBERT	Weighted F1	88.33	# 1	Compare
			Macro F1 (w/o Neutral)	54.30	# 2	Compare
			Weighted F1 (w/o Neutral)	79.67	# 1	Compare
			Macro F1	59.79	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove