TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Textual Similarity	MRPC	GenSen	Accuracy	78.6%	# 35
Semantic Textual Similarity	MRPC	GenSen	F1	84.4%	# 15
Natural Language Inference	MultiNLI	GenSen	Matched	71.4	# 47
Natural Language Inference	MultiNLI	GenSen	Mismatched	71.3	# 38
Paraphrase Identification	Quora Question Pairs	GenSen	Accuracy	87.01	# 18
Semantic Textual Similarity	SentEval	GenSen	MRPC	78.6/84.4	# 1
Semantic Textual Similarity	SentEval	GenSen	SICK-R	0.888	# 1
Semantic Textual Similarity	SentEval	GenSen	SICK-E	87.8	# 1
Semantic Textual Similarity	SentEval	GenSen	STS	78.9/78.6	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-general-purpose-distributed-sentence/semantic-textual-similarity-on-senteval)](https://paperswithcode.com/sota/semantic-textual-similarity-on-senteval?p=learning-general-purpose-distributed-sentence)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-general-purpose-distributed-sentence/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=learning-general-purpose-distributed-sentence)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-general-purpose-distributed-sentence/semantic-textual-similarity-on-mrpc)](https://paperswithcode.com/sota/semantic-textual-similarity-on-mrpc?p=learning-general-purpose-distributed-sentence)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-general-purpose-distributed-sentence/natural-language-inference-on-multinli)](https://paperswithcode.com/sota/natural-language-inference-on-multinli?p=learning-general-purpose-distributed-sentence)`

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

ICLR 2018 · Sandeep Subramanian, Adam Trischler, Yoshua Bengio, Christopher J. Pal ·

A lot of the recent success in natural language processing (NLP) has been driven by distributed vector representations of words trained on large amounts of text in an unsupervised manner. These representations are typically used as general purpose features for words across a range of NLP problems. However, extending this success to learning representations of sequences of words, such as sentences, remains an open problem. Recent work has explored unsupervised as well as supervised learning techniques with different training objectives to learn general purpose fixed-length sentence representations. In this work, we present a simple, effective multi-task learning framework for sentence representations that combines the inductive biases of diverse training objectives in a single model. We train this model on several data sources with multiple training objectives on over 100 million sentences. Extensive experiments demonstrate that sharing a single recurrent sentence encoder across weakly related tasks leads to consistent improvements over previous methods. We present substantial improvements in the context of transfer learning and low-resource settings using our learned general-purpose representations.

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Code

Add Remove Mark official

facebookresearch/SentEval official

2,049

Maluuba/gensen official

310

facebookresearch/InferSent

2,279

najafmurtaza/Developing-Machine-Lea…

Tasks

Add Remove

Multi-Task Learning

Natural Language Inference

Paraphrase Identification

Semantic Textual Similarity

Sentence

Transfer Learning

Datasets

GLUE

MultiNLI

SNLI

MRPC

BookCorpus

SentEval

Quora

Quora Question Pairs

Results from the Paper

Edit

Ranked #1 on Semantic Textual Similarity on SentEval

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Textual Similarity	MRPC	GenSen	Accuracy	78.6%	# 35	Compare
Semantic Textual Similarity	MRPC	GenSen	F1	84.4%	# 15	Compare
Natural Language Inference	MultiNLI	GenSen	Matched	71.4	# 47	Compare
Natural Language Inference	MultiNLI	GenSen	Mismatched	71.3	# 38	Compare
Paraphrase Identification	Quora Question Pairs	GenSen	Accuracy	87.01	# 18	Compare
Semantic Textual Similarity	SentEval	GenSen	MRPC	78.6/84.4	# 1	Compare
			SICK-R	0.888	# 1	Compare
			SICK-E	87.8	# 1	Compare
			STS	78.9/78.6	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove