TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Natural Language Inference	MultiNLI	Finetuned Transformer LM	Matched	82.1	# 38
Natural Language Inference	MultiNLI	Finetuned Transformer LM	Mismatched	81.4	# 28
Question Answering	RACE	Finetuned Transformer LM	RACE-m	62.9	# 4
Question Answering	RACE	Finetuned Transformer LM	RACE-h	57.4	# 3
Question Answering	RACE	Finetuned Transformer LM	RACE	59.0	# 4
Natural Language Inference	SciTail	Finetuned Transformer LM	Accuracy	88.3	# 3
Natural Language Inference	SNLI	Fine-Tuned LM-Pretrained Transformer	% Test Accuracy	89.9	# 13
Natural Language Inference	SNLI	Fine-Tuned LM-Pretrained Transformer	% Train Accuracy	96.6	# 5
Natural Language Inference	SNLI	Fine-Tuned LM-Pretrained Transformer	Parameters	85m	# 4
Question Answering	StoryCloze	Finetuned Transformer LM	Accuracy	86.5	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-language-understanding-by/natural-language-inference-on-scitail)](https://paperswithcode.com/sota/natural-language-inference-on-scitail?p=improving-language-understanding-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-language-understanding-by/question-answering-on-race)](https://paperswithcode.com/sota/question-answering-on-race?p=improving-language-understanding-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-language-understanding-by/question-answering-on-storycloze)](https://paperswithcode.com/sota/question-answering-on-storycloze?p=improving-language-understanding-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-language-understanding-by/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=improving-language-understanding-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-language-understanding-by/natural-language-inference-on-multinli)](https://paperswithcode.com/sota/natural-language-inference-on-multinli?p=improving-language-understanding-by)`

Improving Language Understanding by Generative Pre-Training

Preprint 2018 · Alec Radford, Karthik Narasimhan, Tim Salimans, Ilya Sutskever ·

Natural language understanding comprises a wide range of diverse tasks such as textual entailment, question answering, semantic similarity assessment, and document classification. Although large unlabeled text corpora are abundant, labeled data for learning these specific tasks is scarce, making it challenging for discriminatively trained models to perform adequately. We demonstrate that large gains on these tasks can be realized by generative pre-training of a language model on a diverse corpus of unlabeled text, followed by discriminative fine-tuning on each specific task. In contrast to previous approaches, we make use of task-aware input transformations during fine-tuning to achieve effective transfer while requiring minimal changes to the model architecture. We demonstrate the effectiveness of our approach on a wide range of benchmarks for natural language understanding. Our general task-agnostic model outperforms discriminatively trained models that use architectures specifically crafted for each task, significantly improving upon the state of the art in 9 out of the 12 tasks studied. For instance, we achieve absolute improvements of 8.9% on commonsense reasoning (Stories Cloze Test), 5.7% on question answering (RACE), and 1.5% on textual entailment (MultiNLI).

PDF

Code

Add Remove Mark official

huggingface/transformers

125,725

openai/finetune-transformer-lm

2,087

PaddlePaddle/FleetX

244

lvyufeng/bert4ms

milmor/GPT

See all 12 implementations

Tasks

Add Remove

Cloze Test

Document Classification

Language Modelling

Natural Language Inference

Natural Language Understanding

Question Answering

Semantic Similarity

Semantic Textual Similarity

Datasets

GLUE

SST

MultiNLI SST-2

SNLI

QNLI

MRPC

RACE StoryCloze

SciTail

Results from the Paper

Add Remove

Ranked #3 on Natural Language Inference on SciTail

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Natural Language Inference	MultiNLI	Finetuned Transformer LM	Matched	82.1	# 38	Compare
Natural Language Inference	MultiNLI	Finetuned Transformer LM	Mismatched	81.4	# 28	Compare
Question Answering	RACE	Finetuned Transformer LM	RACE-m	62.9	# 4	Compare
			RACE-h	57.4	# 3	Compare
			RACE	59.0	# 4	Compare
Natural Language Inference	SciTail	Finetuned Transformer LM	Accuracy	88.3	# 3	Compare
Natural Language Inference	SNLI	Fine-Tuned LM-Pretrained Transformer	% Test Accuracy	89.9	# 13	Compare
			% Train Accuracy	96.6	# 5	Compare
			Parameters	85m	# 4	Compare
Question Answering	StoryCloze	Finetuned Transformer LM	Accuracy	86.5	# 8	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Discriminative Fine-Tuning • Dropout • GELU • GPT • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay

Edit Social Preview

Improving Language Understanding by Generative Pre-Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove