TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	BoolQ	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	86.0	# 14
Linguistic Acceptability	CoLA	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	86.4%	# 5
Sentiment Analysis	CR	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	92.5	# 3
Sentiment Analysis	IMDb	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	96.1	# 5
Sentiment Analysis	MPQA	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	90.8	# 1
Sentiment Analysis	MR	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	92.5	# 2
Semantic Textual Similarity	MRPC	RoBERTa-large 355M + Entailment as Few-shot Learner	F1	91.0	# 8
Topic Classification	OS	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	95.1	# 1
Natural Language Inference	QNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	94.5%	# 15
Paraphrase Identification	Quora Question Pairs	RoBERTa-large 355M + Entailment as Few-shot Learner	F1	89.2	# 2
Natural Language Inference	RTE	RoBERTa-large 355M + EFL + UCA	Accuracy	87.2%	# 21
Natural Language Inference	RTE	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	90.5%	# 15
Natural Language Inference	SNLI	EFL (Entailment as Few-shot Learner) + RoBERTa-large	% Test Accuracy	93.1	# 1
Natural Language Inference	SNLI	EFL (Entailment as Few-shot Learner) + RoBERTa-large	% Train Accuracy	?	# 74
Natural Language Inference	SNLI	EFL (Entailment as Few-shot Learner) + RoBERTa-large	Parameters	355m	# 4
Natural Language Inference	SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	% Test Accuracy	93.1	# 1
Natural Language Inference	SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	Parameters	355	# 1
Sentiment Analysis	SST-2 Binary classification	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	96.9	# 8
Semantic Textual Similarity	STS Benchmark	RoBERTa-large 355M + Entailment as Few-shot Learner	Pearson Correlation	0.918	# 11
Subjectivity Analysis	SUBJ	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	97.1	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/sentiment-analysis-on-mpqa)](https://paperswithcode.com/sota/sentiment-analysis-on-mpqa?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/topic-classification-on-os)](https://paperswithcode.com/sota/topic-classification-on-os?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/sentiment-analysis-on-mr)](https://paperswithcode.com/sota/sentiment-analysis-on-mr?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/sentiment-analysis-on-cr)](https://paperswithcode.com/sota/sentiment-analysis-on-cr?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/subjectivity-analysis-on-subj)](https://paperswithcode.com/sota/subjectivity-analysis-on-subj?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/linguistic-acceptability-on-cola)](https://paperswithcode.com/sota/linguistic-acceptability-on-cola?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/sentiment-analysis-on-imdb)](https://paperswithcode.com/sota/sentiment-analysis-on-imdb?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/semantic-textual-similarity-on-mrpc)](https://paperswithcode.com/sota/semantic-textual-similarity-on-mrpc?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/sentiment-analysis-on-sst-2-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-2-binary?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/semantic-textual-similarity-on-sts-benchmark)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts-benchmark?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/question-answering-on-boolq)](https://paperswithcode.com/sota/question-answering-on-boolq?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/natural-language-inference-on-qnli)](https://paperswithcode.com/sota/natural-language-inference-on-qnli?p=entailment-as-few-shot-learner)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/entailment-as-few-shot-learner/natural-language-inference-on-rte)](https://paperswithcode.com/sota/natural-language-inference-on-rte?p=entailment-as-few-shot-learner)`

Entailment as Few-Shot Learner

29 Apr 2021 · Sinong Wang, Han Fang, Madian Khabsa, Hanzi Mao, Hao Ma ·

Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners. However, their success hinges largely on scaling model parameters to a degree that makes it challenging to train and serve. In this paper, we propose a new approach, named as EFL, that can turn small LMs into better few-shot learners. The key idea of this approach is to reformulate potential NLP task into an entailment one, and then fine-tune the model with as little as 8 examples. We further demonstrate our proposed method can be: (i) naturally combined with an unsupervised contrastive learning-based data augmentation method; (ii) easily extended to multilingual few-shot learning. A systematic evaluation on 18 standard NLP tasks demonstrates that this approach improves the various existing SOTA few-shot learning methods by 12\%, and yields competitive few-shot performance with 500 times larger models, such as GPT-3.

PDF Abstract

Code

Add Remove Mark official

PaddlePaddle/PaddleNLP

11,583

sunyilgdx/prompts4keras

cactilab/hateguard

Tasks

Add Remove

Contrastive Learning

Data Augmentation

Few-Shot Learning

Linguistic Acceptability

Natural Language Inference

Paraphrase Identification

Question Answering

Semantic Textual Similarity

Sentiment Analysis

Subjectivity Analysis

Topic Classification

Datasets

GLUE

SST

MultiNLI

IMDb Movie Reviews SST-2

SNLI

QNLI

AG News

MRPC

CoLA

BoolQ

SuperGLUE

MPQA Opinion Corpus

Quora

Quora Question Pairs RTE STS Benchmark

MR SUBJ

Results from the Paper

Edit

Ranked #1 on Topic Classification on OS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	BoolQ	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	86.0	# 14	Compare
Linguistic Acceptability	CoLA	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	86.4%	# 5	Compare
Sentiment Analysis	CR	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	92.5	# 3	Compare
Sentiment Analysis	IMDb	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	96.1	# 5	Compare
Sentiment Analysis	MPQA	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	90.8	# 1	Compare
Sentiment Analysis	MR	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	92.5	# 2	Compare
Semantic Textual Similarity	MRPC	RoBERTa-large 355M + Entailment as Few-shot Learner	F1	91.0	# 8	Compare
Topic Classification	OS	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	95.1	# 1	Compare
Natural Language Inference	QNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	94.5%	# 15	Compare
Paraphrase Identification	Quora Question Pairs	RoBERTa-large 355M + Entailment as Few-shot Learner	F1	89.2	# 2	Compare
Natural Language Inference	RTE	RoBERTa-large 355M + EFL + UCA	Accuracy	87.2%	# 21	Compare
Natural Language Inference	RTE	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	90.5%	# 15	Compare
Natural Language Inference	SNLI	EFL (Entailment as Few-shot Learner) + RoBERTa-large	% Test Accuracy	93.1	# 1	Compare
			% Train Accuracy	?	# 74	Compare
			Parameters	355m	# 4	Compare
Natural Language Inference	SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	% Test Accuracy	93.1	# 1	Compare
Natural Language Inference	SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	Parameters	355	# 1	Compare
Sentiment Analysis	SST-2 Binary classification	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	96.9	# 8	Compare
Semantic Textual Similarity	STS Benchmark	RoBERTa-large 355M + Entailment as Few-shot Learner	Pearson Correlation	0.918	# 11	Compare
Subjectivity Analysis	SUBJ	RoBERTa-large 355M + Entailment as Few-shot Learner	Accuracy	97.1	# 3	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Dropout • Fixed Factorized Attention • GELU • GPT-3 • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Strided Attention • Weight Decay

Edit Social Preview

Entailment as Few-Shot Learner

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove