TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Conversational Response Selection	Douban	Poly-encoder	MAP	0.608	# 9
Conversational Response Selection	Douban	Poly-encoder	MRR	0.650	# 9
Conversational Response Selection	Douban	Poly-encoder	P@1	0.475	# 9
Conversational Response Selection	Douban	Poly-encoder	R10@1	0.299	# 9
Conversational Response Selection	Douban	Poly-encoder	R10@2	0.494	# 7
Conversational Response Selection	Douban	Poly-encoder	R10@5	0.822	# 10
Conversational Response Selection	DSTC7 Ubuntu	Bi-encoder (v2)	1-of-100 Accuracy	70.9%	# 2
Conversational Response Selection	DSTC7 Ubuntu	Bi-encoder	1-of-100 Accuracy	66.3%	# 3
Conversational Response Selection	RRS Ranking Test	Poly-encoder	NDCG@3	0.679	# 1
Conversational Response Selection	RRS Ranking Test	Poly-encoder	NDCG@5	0.765	# 1
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	Poly-encoder	R10@1	0.882	# 7
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	Poly-encoder	R10@2	0.949	# 4
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	Poly-encoder	R10@5	0.990	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190501969/conversational-response-selection-on-rrs-1)](https://paperswithcode.com/sota/conversational-response-selection-on-rrs-1?p=190501969)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190501969/conversational-response-selection-on-dstc7)](https://paperswithcode.com/sota/conversational-response-selection-on-dstc7?p=190501969)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190501969/conversational-response-selection-on-ubuntu-1)](https://paperswithcode.com/sota/conversational-response-selection-on-ubuntu-1?p=190501969)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190501969/conversational-response-selection-on-douban-1)](https://paperswithcode.com/sota/conversational-response-selection-on-douban-1?p=190501969)`

Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring

22 Apr 2019 · Samuel Humeau, Kurt Shuster, Marie-Anne Lachaux, Jason Weston ·

The use of deep pre-trained bidirectional transformers has led to remarkable progress in a number of applications (Devlin et al., 2018). For tasks that make pairwise comparisons between sequences, matching a given input with a corresponding label, two approaches are common: Cross-encoders performing full self-attention over the pair and Bi-encoders encoding the pair separately. The former often performs better, but is too slow for practical use. In this work, we develop a new transformer architecture, the Poly-encoder, that learns global rather than token level self-attention features. We perform a detailed comparison of all three approaches, including what pre-training and fine-tuning strategies work best. We show our models achieve state-of-the-art results on three existing tasks; that Poly-encoders are faster than Cross-encoders and more accurate than Bi-encoders; and that the best results are obtained by pre-training on large datasets similar to the downstream tasks.

PDF Abstract

Code

Add Remove Mark official

sfzhou5678/PolyEncoder

248

chijames/Poly-Encoder

161

csong27/collision-bert

llStringll/Poly-encoders

fangrouli/Document-embedding-genera…

See all 7 implementations

Tasks

Add Remove

Conversational Response Selection

Sentence

Datasets

ConvAI2 Douban

UDC

DSTC7 Task 1 RRS Ranking Test

Results from the Paper

Edit

Ranked #1 on Conversational Response Selection on RRS Ranking Test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Conversational Response Selection	Douban	Poly-encoder	MAP	0.608	# 9	Compare
			MRR	0.650	# 9	Compare
			P@1	0.475	# 9	Compare
			R10@1	0.299	# 9	Compare
			R10@2	0.494	# 7	Compare
			R10@5	0.822	# 10	Compare
Conversational Response Selection	DSTC7 Ubuntu	Bi-encoder (v2)	1-of-100 Accuracy	70.9%	# 2	Compare
Conversational Response Selection	DSTC7 Ubuntu	Bi-encoder	1-of-100 Accuracy	66.3%	# 3	Compare
Conversational Response Selection	RRS Ranking Test	Poly-encoder	NDCG@3	0.679	# 1	Compare
Conversational Response Selection	RRS Ranking Test	Poly-encoder	NDCG@5	0.765	# 1	Compare
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	Poly-encoder	R10@1	0.882	# 7	Compare
			R10@2	0.949	# 4	Compare
			R10@5	0.990	# 4	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • ReLU • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove