TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Conversation Disentanglement	Linux IRC (Ch2 Elsner)	Heuristic	1-1	45.1	# 3
Conversation Disentanglement	Linux IRC (Ch2 Elsner)	Heuristic	Local	73.8	# 3
Conversation Disentanglement	Linux IRC (Ch2 Elsner)	Heuristic	Shen F-1	51.8	# 3
Conversation Disentanglement	Linux IRC (Ch2 Kummerfeld)	Heuristic	1-1	43.4	# 3
Conversation Disentanglement	Linux IRC (Ch2 Kummerfeld)	Heuristic	Local	67.9	# 2
Conversation Disentanglement	Linux IRC (Ch2 Kummerfeld)	Heuristic	Shen F-1	50.7	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/training-end-to-end-dialogue-systems-with-the/conversation-disentanglement-on-linux-irc-ch2-1)](https://paperswithcode.com/sota/conversation-disentanglement-on-linux-irc-ch2-1?p=training-end-to-end-dialogue-systems-with-the)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/training-end-to-end-dialogue-systems-with-the/conversation-disentanglement-on-linux-irc-ch2)](https://paperswithcode.com/sota/conversation-disentanglement-on-linux-irc-ch2?p=training-end-to-end-dialogue-systems-with-the)`

Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus

1 Jan 2017 · Ryan Lowe, Nissan Pow, Iulian Vlad Serban, Laurent Charlin, Chia-Wei Liu, Joelle Pineau ·

In this paper, we analyze neural network-based dialogue systems trained in an end-to-end manner using an updated version of the recent Ubuntu Dialogue Corpus, a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This dataset is interesting because of its size, long context lengths, and technical nature; thus, it can be used to train large models directly from data with minimal feature engineering. We provide baselines in two different environments: one where models are trained to select the correct next response from a list of candidate responses, and one where models are trained to maximize the loglikelihood of a generated utterance conditioned on the context of the conversation. These are both evaluated on a recall task that we call next utterance classification (NUC), and using vector-based metrics that capture the topicality of the responses. We observe that current end-to-end models are unable to completely solve these tasks; thus, we provide a qualitative error analysis to determine the primary causes of error for end-to-end models evaluated on NUC, and examine sample utterances from the generative models. As a result of this analysis, we suggest some promising directions for future research on the Ubuntu Dialogue Corpus, which can also be applied to end-to-end dialogue systems in general.

PDF

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Conversation Disentanglement

Feature Engineering

Datasets

irc-disentanglement

Results from the Paper

Add Remove

Ranked #2 on Conversation Disentanglement on Linux IRC (Ch2 Kummerfeld)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Conversation Disentanglement	Linux IRC (Ch2 Elsner)	Heuristic	1-1	45.1	# 3	Compare
			Local	73.8	# 3	Compare
			Shen F-1	51.8	# 3	Compare
Conversation Disentanglement	Linux IRC (Ch2 Kummerfeld)	Heuristic	1-1	43.4	# 3	Compare
			Local	67.9	# 2	Compare
			Shen F-1	50.7	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove