TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Response Generation	MMConv	SimpleTOD	Inform	14.6	# 2
Response Generation	MMConv	SimpleTOD	Success	9.2	# 2
Response Generation	MMConv	SimpleTOD	BLEU	20.3	# 2
Response Generation	MMConv	SimpleTOD	Comb.	32.2	# 2
End-To-End Dialogue Modelling	MULTIWOZ 2.0	SimpleTOD	MultiWOZ (Success)	70.1	# 5
End-To-End Dialogue Modelling	MULTIWOZ 2.0	SimpleTOD	MultiWOZ (Inform)	84.4	# 5
End-To-End Dialogue Modelling	MULTIWOZ 2.0	SimpleTOD	BLEU	15.0	# 6
End-To-End Dialogue Modelling	MULTIWOZ 2.1	SimpleTOD	MultiWOZ (Success)	70.5	# 3
End-To-End Dialogue Modelling	MULTIWOZ 2.1	SimpleTOD	MultiWOZ (Inform)	85.0	# 3
End-To-End Dialogue Modelling	MULTIWOZ 2.1	SimpleTOD	BLEU	15.2	# 4
Multi-domain Dialogue State Tracking	MULTIWOZ 2.1	SimpleTOD	Joint Acc	55.76	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-simple-language-model-for-task-oriented/response-generation-on-mmconv)](https://paperswithcode.com/sota/response-generation-on-mmconv?p=a-simple-language-model-for-task-oriented)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-simple-language-model-for-task-oriented/end-to-end-dialogue-modelling-on-multiwoz-2-1)](https://paperswithcode.com/sota/end-to-end-dialogue-modelling-on-multiwoz-2-1?p=a-simple-language-model-for-task-oriented)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-simple-language-model-for-task-oriented/end-to-end-dialogue-modelling-on-multiwoz-2-0)](https://paperswithcode.com/sota/end-to-end-dialogue-modelling-on-multiwoz-2-0?p=a-simple-language-model-for-task-oriented)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-simple-language-model-for-task-oriented/multi-domain-dialogue-state-tracking-on-1)](https://paperswithcode.com/sota/multi-domain-dialogue-state-tracking-on-1?p=a-simple-language-model-for-task-oriented)`

A Simple Language Model for Task-Oriented Dialogue

NeurIPS 2020 · Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher ·

Task-oriented dialogue is often decomposed into three tasks: understanding user input, deciding actions, and generating a response. While such decomposition might suggest a dedicated model for each sub-task, we find a simple, unified approach leads to state-of-the-art performance on the MultiWOZ dataset. SimpleTOD is a simple approach to task-oriented dialogue that uses a single, causal language model trained on all sub-tasks recast as a single sequence prediction problem. This allows SimpleTOD to fully leverage transfer learning from pre-trained, open domain, causal language models such as GPT-2. SimpleTOD improves over the prior state-of-the-art in joint goal accuracy for dialogue state tracking, and our analysis reveals robustness to noisy annotations in this setting. SimpleTOD also improves the main metrics used to evaluate action decisions and response generation in an end-to-end setting: inform rate by 8.1 points, success rate by 9.7 points, and combined score by 7.2 points.

PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract

Code

Add Remove Mark official

salesforce/simpletod official

232

Tasks

Add Remove

Dialogue State Tracking

End-To-End Dialogue Modelling

Language Modelling

Multi-domain Dialogue State Tracking

Response Generation

Transfer Learning

Datasets

MultiWOZ

MMConv

Results from the Paper

Edit

Ranked #2 on Response Generation on MMConv

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Response Generation	MMConv	SimpleTOD	Inform	14.6	# 2	Compare
			Success	9.2	# 2	Compare
			BLEU	20.3	# 2	Compare
			Comb.	32.2	# 2	Compare
End-To-End Dialogue Modelling	MULTIWOZ 2.0	SimpleTOD	MultiWOZ (Success)	70.1	# 5	Compare
			MultiWOZ (Inform)	84.4	# 5	Compare
			BLEU	15.0	# 6	Compare
End-To-End Dialogue Modelling	MULTIWOZ 2.1	SimpleTOD	MultiWOZ (Success)	70.5	# 3	Compare
			MultiWOZ (Inform)	85.0	# 3	Compare
			BLEU	15.2	# 4	Compare
Multi-domain Dialogue State Tracking	MULTIWOZ 2.1	SimpleTOD	Joint Acc	55.76	# 9	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Discriminative Fine-Tuning • Dropout • GELU • GPT-2 • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay

Edit Social Preview

A Simple Language Model for Task-Oriented Dialogue

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove