TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Linguistic Acceptability	CoLA	ERNIE	Accuracy	52.3%	# 35
Relation Extraction	FewRel	ERNIE	F1	88.32	# 2
Relation Extraction	FewRel	ERNIE	Precision	88.49	# 1
Relation Extraction	FewRel	ERNIE	Recall	88.44	# 1
Entity Linking	FIGER	ERNIE	Accuracy	57.19	# 1
Entity Linking	FIGER	ERNIE	Macro F1	76.51	# 1
Entity Linking	FIGER	ERNIE	Micro F1	73.39	# 1
Semantic Textual Similarity	MRPC	ERNIE	Accuracy	88.2%	# 21
Natural Language Inference	MultiNLI	ERNIE	Matched	84.0	# 32
Natural Language Inference	MultiNLI	ERNIE	Mismatched	83.2	# 23
Entity Typing	Open Entity	ERNIE	F1	75.56	# 3
Entity Typing	Open Entity	ERNIE	Precision	78.42	# 3
Entity Typing	Open Entity	ERNIE	Recall	72.9	# 3
Natural Language Inference	QNLI	ERNIE	Accuracy	91.3%	# 29
Paraphrase Identification	Quora Question Pairs	ERNIE	F1	71.2	# 15
Natural Language Inference	RTE	ERNIE	Accuracy	68.8%	# 59
Sentiment Analysis	SST-2 Binary classification	ERNIE	Accuracy	93.5	# 39
Semantic Textual Similarity	STS Benchmark	ERNIE	Pearson Correlation	0.832	# 26
Relation Classification	TACRED	ERNIE	F1	68.0	# 6
Relation Extraction	TACRED	ERNIE	F1	67.97	# 28
Relation Classification	TACRED	BERT	F1	66.0	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/entity-linking-on-figer)](https://paperswithcode.com/sota/entity-linking-on-figer?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/relation-extraction-on-fewrel)](https://paperswithcode.com/sota/relation-extraction-on-fewrel?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/entity-typing-on-open-entity)](https://paperswithcode.com/sota/entity-typing-on-open-entity?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/relation-classification-on-tacred-1)](https://paperswithcode.com/sota/relation-classification-on-tacred-1?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/semantic-textual-similarity-on-mrpc)](https://paperswithcode.com/sota/semantic-textual-similarity-on-mrpc?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/semantic-textual-similarity-on-sts-benchmark)](https://paperswithcode.com/sota/semantic-textual-similarity-on-sts-benchmark?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/relation-extraction-on-tacred)](https://paperswithcode.com/sota/relation-extraction-on-tacred?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/natural-language-inference-on-qnli)](https://paperswithcode.com/sota/natural-language-inference-on-qnli?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/natural-language-inference-on-multinli)](https://paperswithcode.com/sota/natural-language-inference-on-multinli?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/linguistic-acceptability-on-cola)](https://paperswithcode.com/sota/linguistic-acceptability-on-cola?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/sentiment-analysis-on-sst-2-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-2-binary?p=ernie-enhanced-language-representation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ernie-enhanced-language-representation-with/natural-language-inference-on-rte)](https://paperswithcode.com/sota/natural-language-inference-on-rte?p=ernie-enhanced-language-representation-with)`

ERNIE: Enhanced Language Representation with Informative Entities

ACL 2019 · Zhengyan Zhang, Xu Han, Zhiyuan Liu, Xin Jiang, Maosong Sun, Qun Liu ·

Neural language representation models such as BERT pre-trained on large-scale corpora can well capture rich semantic patterns from plain text, and be fine-tuned to consistently improve the performance of various NLP tasks. However, the existing pre-trained language models rarely consider incorporating knowledge graphs (KGs), which can provide rich structured knowledge facts for better language understanding. We argue that informative entities in KGs can enhance language representation with external knowledge. In this paper, we utilize both large-scale textual corpora and KGs to train an enhanced language representation model (ERNIE), which can take full advantage of lexical, syntactic, and knowledge information simultaneously. The experimental results have demonstrated that ERNIE achieves significant improvements on various knowledge-driven tasks, and meanwhile is comparable with the state-of-the-art model BERT on other common NLP tasks. The source code of this paper can be obtained from https://github.com/thunlp/ERNIE.

PDF Abstract ACL 2019 PDF ACL 2019 Abstract

Code

Add Remove Mark official

thunlp/ERNIE official

1,402

Mind23-2/MindCode-136

Tasks

Add Remove

Entity Linking

Entity Typing

Knowledge Graphs

Linguistic Acceptability

Natural Language Inference

Paraphrase Identification

Relation Classification

Relation Extraction

Semantic Textual Similarity

Sentiment Analysis

Datasets

GLUE

SST

MultiNLI SST-2

QNLI

MRPC

CoLA

TACRED

FewRel

FIGER

Quora

Quora Question Pairs RTE STS Benchmark

Open Entity

Results from the Paper

Edit

Ranked #1 on Entity Linking on FIGER

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Linguistic Acceptability	CoLA	ERNIE	Accuracy	52.3%	# 35	Compare
Relation Extraction	FewRel	ERNIE	F1	88.32	# 2	Compare
			Precision	88.49	# 1	Compare
			Recall	88.44	# 1	Compare
Entity Linking	FIGER	ERNIE	Accuracy	57.19	# 1	Compare
			Macro F1	76.51	# 1	Compare
			Micro F1	73.39	# 1	Compare
Semantic Textual Similarity	MRPC	ERNIE	Accuracy	88.2%	# 21	Compare
Natural Language Inference	MultiNLI	ERNIE	Matched	84.0	# 32	Compare
Natural Language Inference	MultiNLI	ERNIE	Mismatched	83.2	# 23	Compare
Entity Typing	Open Entity	ERNIE	F1	75.56	# 3	Compare
			Precision	78.42	# 3	Compare
			Recall	72.9	# 3	Compare
Natural Language Inference	QNLI	ERNIE	Accuracy	91.3%	# 29	Compare
Paraphrase Identification	Quora Question Pairs	ERNIE	F1	71.2	# 15	Compare
Natural Language Inference	RTE	ERNIE	Accuracy	68.8%	# 59	Compare
Sentiment Analysis	SST-2 Binary classification	ERNIE	Accuracy	93.5	# 39	Compare
Semantic Textual Similarity	STS Benchmark	ERNIE	Pearson Correlation	0.832	# 26	Compare
Relation Classification	TACRED	ERNIE	F1	68.0	# 6	Compare
Relation Extraction	TACRED	ERNIE	F1	67.97	# 28	Compare
Relation Classification	TACRED	BERT	F1	66.0	# 4	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • ERNIE • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

ERNIE: Enhanced Language Representation with Informative Entities

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove