TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Conversational Response Selection	Douban	UMS_BERT+	MAP	0.625	# 6
Conversational Response Selection	Douban	UMS_BERT+	MRR	0.664	# 6
Conversational Response Selection	Douban	UMS_BERT+	P@1	0.499	# 6
Conversational Response Selection	Douban	UMS_BERT+	R10@1	0.318	# 6
Conversational Response Selection	Douban	UMS_BERT+	R10@2	0.482	# 8
Conversational Response Selection	Douban	UMS_BERT+	R10@5	0.858	# 5
Conversational Response Selection	E-commerce	UMS_BERT+	R10@1	0.762	# 5
Conversational Response Selection	E-commerce	UMS_BERT+	R10@2	0.905	# 5
Conversational Response Selection	E-commerce	UMS_BERT+	R10@5	0.986	# 6
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	UMS_BERT+	R10@1	0.875	# 8
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	UMS_BERT+	R10@2	0.942	# 8
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	UMS_BERT+	R10@5	0.988	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-response-selection-models-really-know-what/conversational-response-selection-on-e)](https://paperswithcode.com/sota/conversational-response-selection-on-e?p=do-response-selection-models-really-know-what)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-response-selection-models-really-know-what/conversational-response-selection-on-douban-1)](https://paperswithcode.com/sota/conversational-response-selection-on-douban-1?p=do-response-selection-models-really-know-what)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-response-selection-models-really-know-what/conversational-response-selection-on-ubuntu-1)](https://paperswithcode.com/sota/conversational-response-selection-on-ubuntu-1?p=do-response-selection-models-really-know-what)`

Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

10 Sep 2020 · Taesun Whang, Dongyub Lee, Dongsuk Oh, Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee ·

In this paper, we study the task of selecting the optimal response given a user and system utterance history in retrieval-based multi-turn dialog systems. Recently, pre-trained language models (e.g., BERT, RoBERTa, and ELECTRA) showed significant improvements in various natural language processing tasks. This and similar response selection tasks can also be solved using such language models by formulating the tasks as dialog--response binary classification tasks. Although existing works using this approach successfully obtained state-of-the-art results, we observe that language models trained in this manner tend to make predictions based on the relatedness of history and candidates, ignoring the sequential nature of multi-turn dialog systems. This suggests that the response selection task alone is insufficient for learning temporal dependencies between utterances. To this end, we propose utterance manipulation strategies (UMS) to address this problem. Specifically, UMS consist of several strategies (i.e., insertion, deletion, and search), which aid the response selection model towards maintaining dialog coherence. Further, UMS are self-supervised methods that do not require additional annotation and thus can be easily incorporated into existing approaches. Extensive evaluation across multiple languages and models shows that UMS are highly effective in teaching dialog consistency, which leads to models pushing the state-of-the-art with significant margins on multiple public benchmark datasets.

PDF Abstract

Code

Add Remove Mark official

taesunwhang/UMS-ResSel official

Tasks

Add Remove

Binary Classification

Conversational Response Selection

Retrieval

Datasets

Douban

UDC E-commerce

Results from the Paper

Edit

Ranked #5 on Conversational Response Selection on E-commerce

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Conversational Response Selection	Douban	UMS_BERT+	MAP	0.625	# 6	Compare
			MRR	0.664	# 6	Compare
			P@1	0.499	# 6	Compare
			R10@1	0.318	# 6	Compare
			R10@2	0.482	# 8	Compare
			R10@5	0.858	# 5	Compare
Conversational Response Selection	E-commerce	UMS_BERT+	R10@1	0.762	# 5	Compare
			R10@2	0.905	# 5	Compare
			R10@5	0.986	# 6	Compare
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	UMS_BERT+	R10@1	0.875	# 8	Compare
			R10@2	0.942	# 8	Compare
			R10@5	0.988	# 8	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • RoBERTa • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove