TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Textual Similarity	SICK	IS-BERT-NLI	Spearman Correlation	0.6425	# 21
Semantic Textual Similarity	STS12	IS-BERT-NLI	Spearman Correlation	0.5677	# 21
Semantic Textual Similarity	STS13	IS-BERT-NLI	Spearman Correlation	0.6924	# 23
Semantic Textual Similarity	STS14	IS-BERT-NLI	Spearman Correlation	0.6121	# 21
Semantic Textual Similarity	STS15	IS-BERT-NLI	Spearman Correlation	0.7523	# 20
Semantic Textual Similarity	STS16	IS-BERT-NLI	Spearman Correlation	0.7016	# 20
Semantic Textual Similarity	STS Benchmark	IS-BERT-NLI	Spearman Correlation	0.6921	# 39

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts15)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts15?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts16)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts16?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sick)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sick?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts12)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts12?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts14)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts14?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts13)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts13?p=an-unsupervised-sentence-embedding-method)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-unsupervised-sentence-embedding-method/semantic-textual-similarity-on-sts-benchmark)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts-benchmark?p=an-unsupervised-sentence-embedding-method)`

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

EMNLP 2020 · Yan Zhang, Ruidan He, Zuozhu Liu, Kwan Hui Lim, Lidong Bing ·

BERT is inefficient for sentence-pair tasks such as clustering or semantic search as it needs to evaluate combinatorially many sentence pairs which is very time-consuming. Sentence BERT (SBERT) attempted to solve this challenge by learning semantically meaningful representations of single sentences, such that similarity comparison can be easily accessed. However, SBERT is trained on corpus with high-quality labeled sentence pairs, which limits its application to tasks where labeled data is extremely scarce. In this paper, we propose a lightweight extension on top of BERT and a novel self-supervised learning objective based on mutual information maximization strategies to derive meaningful sentence embeddings in an unsupervised manner. Unlike SBERT, our method is not restricted by the availability of labeled data, such that it can be applied on different domain-specific corpus. Experimental results show that the proposed method significantly outperforms other unsupervised sentence embedding baselines on common semantic textual similarity (STS) tasks and downstream supervised tasks. It also outperforms SBERT in a setting where in-domain labeled data is not available, and achieves performance competitive with supervised methods on various tasks.

PDF Abstract EMNLP 2020 PDF EMNLP 2020 Abstract

Code

Add Remove Mark official

yanzhangnlp/IS-BERT official

Tasks

Add Remove

Clustering

Self-Supervised Learning

Semantic Textual Similarity

Sentence

Sentence Embedding

Sentence-Embedding

Sentence Embeddings

STS

Datasets

SST

MultiNLI

SNLI

SICK

MPQA Opinion Corpus

SentEval STS Benchmark

Results from the Paper

Edit

Ranked #20 on Semantic Textual Similarity on STS16

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Textual Similarity	SICK	IS-BERT-NLI	Spearman Correlation	0.6425	# 21	Compare
Semantic Textual Similarity	STS12	IS-BERT-NLI	Spearman Correlation	0.5677	# 21	Compare
Semantic Textual Similarity	STS13	IS-BERT-NLI	Spearman Correlation	0.6924	# 23	Compare
Semantic Textual Similarity	STS14	IS-BERT-NLI	Spearman Correlation	0.6121	# 21	Compare
Semantic Textual Similarity	STS15	IS-BERT-NLI	Spearman Correlation	0.7523	# 20	Compare
Semantic Textual Similarity	STS16	IS-BERT-NLI	Spearman Correlation	0.7016	# 20	Compare
Semantic Textual Similarity	STS Benchmark	IS-BERT-NLI	Spearman Correlation	0.6921	# 39	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • SBERT • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove