TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Request	-	# 4
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Area	90	# 2
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Food	84	# 2
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Price	92	# 2
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Joint	72	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dialogue-learning-with-human-teaching-and/dialogue-state-tracking-on-second-dialogue)](https://paperswithcode.com/sota/dialogue-state-tracking-on-second-dialogue?p=dialogue-learning-with-human-teaching-and)`

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

NAACL 2018 · Bing Liu, Gokhan Tur, Dilek Hakkani-Tur, Pararth Shah, Larry Heck ·

In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback on supervised pre-training models. Efficiency of such learning method may suffer from the mismatch of dialogue state distribution between offline training and online interactive learning stages. To address this challenge, we propose a hybrid imitation and reinforcement learning method, with which a dialogue agent can effectively learn from its interaction with users by learning from human teaching and feedback. We design a neural network based task-oriented dialogue agent that can be optimized end-to-end with the proposed learning method. Experimental results show that our end-to-end dialogue agent can learn effectively from the mistake it makes via imitation learning from user teaching. Applying reinforcement learning with user feedback after the imitation learning stage further improves the agent's capability in successfully completing a task.

PDF Abstract NAACL 2018 PDF NAACL 2018 Abstract

Code

Add Remove Mark official

google-research-datasets/simulated-… official

226

Tasks

Add Remove

Dialogue State Tracking

Imitation Learning

reinforcement-learning

Reinforcement Learning (RL)

Task-Oriented Dialogue Systems

Datasets

Dialogue State Tracking Challenge

Results from the Paper

Edit

Ranked #6 on Dialogue State Tracking on Second dialogue state tracking challenge

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Dialogue State Tracking	Second dialogue state tracking challenge	Liu et al.	Request	-	# 4	Compare
			Area	90	# 2	Compare
			Food	84	# 2	Compare
			Price	92	# 2	Compare
			Joint	72	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove