TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT	BLEU	64.11	# 9
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT	METEOR	46.30	# 9
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT	ROUGE	74.57	# 9
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT w/o pretrain	BLEU	62.3	# 10
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT w/o pretrain	METEOR	44.33	# 10
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT w/o pretrain	ROUGE	73	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/kgpt-knowledge-grounded-pre-training-for-data/kg-to-text-generation-on-webnlg-2-0)](https://paperswithcode.com/sota/kg-to-text-generation-on-webnlg-2-0?p=kgpt-knowledge-grounded-pre-training-for-data)`

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

EMNLP 2020 · Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang ·

Data-to-text generation has recently attracted substantial interests due to its wide applications. Existing methods have shown impressive performance on an array of tasks. However, they rely on a significant amount of labeled data for each task, which is costly to acquire and thus limits their application to new tasks and domains. In this paper, we propose to leverage pre-training and transfer learning to address this issue. We propose a knowledge-grounded pre-training (KGPT), which consists of two parts, 1) a general knowledge-grounded generation model to generate knowledge-enriched text. 2) a pre-training paradigm on a massive knowledge-grounded text corpus crawled from the web. The pre-trained model can be fine-tuned on various data-to-text generation tasks to generate task-specific text. We adopt three settings, namely fully-supervised, zero-shot, few-shot to evaluate its effectiveness. Under the fully-supervised setting, our model can achieve remarkable gains over the known baselines. Under zero-shot setting, our model without seeing any examples achieves over 30 ROUGE-L on WebNLG while all other baselines fail. Under the few-shot setting, our model only needs about one-fifteenth as many labeled examples to achieve the same level of performance as baseline models. These experiments consistently prove the strong generalization ability of our proposed framework https://github.com/wenhuchen/KGPT.

PDF Abstract EMNLP 2020 PDF EMNLP 2020 Abstract

Code

Add Remove Mark official

wenhuchen/KGPT official

146

Tasks

Add Remove

Data-to-Text Generation

General Knowledge

KG-to-Text Generation

Text Generation

Transfer Learning

Datasets

WebNLG

WikiBio

Results from the Paper

Edit

Ranked #9 on KG-to-Text Generation on WebNLG 2.0 (Unconstrained)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT	BLEU	64.11	# 9	Compare
			METEOR	46.30	# 9	Compare
			ROUGE	74.57	# 9	Compare
KG-to-Text Generation	WebNLG 2.0 (Unconstrained)	KGPT w/o pretrain	BLEU	62.3	# 10	Compare
			METEOR	44.33	# 10	Compare
			ROUGE	73	# 10	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove