TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Table-to-Text Generation	DART	HTLM (fine-tuning)	BLEU	47.2	# 1
Table-to-Text Generation	DART	HTLM (fine-tuning)	METEOR	0.39	# 1
Table-to-Text Generation	DART	HTLM (fine-tuning)	TER	0.44	# 1
Table-to-Text Generation	DART	HTLM (fine-tuning)	Mover	0.51	# 1
Table-to-Text Generation	DART	HTLM (fine-tuning)	BERT	0.94	# 1
Table-to-Text Generation	DART	HTLM (fine-tuning)	BLEURT	0.4	# 1
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	BLEU	47.0	# 2
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	METEOR	0.39	# 1
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	TER	0.46	# 2
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	Mover	0.51	# 1
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	BERT	0.94	# 1
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	BLEURT	0.4	# 1
Table-to-Text Generation	E2E	HTLM (fine-tuning)	BLEU	70.3	# 1
Table-to-Text Generation	E2E	HTLM (fine-tuning)	NIST	8.90	# 1
Table-to-Text Generation	E2E	HTLM (fine-tuning)	METEOR	46.3	# 1
Table-to-Text Generation	E2E	HTLM (fine-tuning)	ROUGE-L	70.8	# 1
Table-to-Text Generation	E2E	HTLM (fine-tuning)	CIDEr	2.47	# 1
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	BLEU	68.5	# 2
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	NIST	8.78	# 2
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	METEOR	46.0	# 2
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	ROUGE-L	69.9	# 2
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	CIDEr	2.45	# 2
Data-to-Text Generation	WebNLG	HTML (fine-tuning)	BLEU	65.4	# 4
Table-to-Text Generation	WebNLG (All)	GPT-2-Large (fine-tuning)	BLEU	55.5	# 2
Table-to-Text Generation	WebNLG (All)	GPT-2-Large (fine-tuning)	METEOR	0.42	# 1
Table-to-Text Generation	WebNLG (All)	GPT-2-Large (fine-tuning)	TER	0.42	# 1
Table-to-Text Generation	WebNLG (All)	HTLM (fine-tuning)	BLEU	55.6	# 1
Table-to-Text Generation	WebNLG (All)	HTLM (fine-tuning)	METEOR	0.42	# 1
Table-to-Text Generation	WebNLG (All)	HTLM (fine-tuning)	TER	0.4	# 2
Data-to-Text Generation	WebNLG Full	HTLM (prefix 0.1%)	BLEU	56.3	# 6
Table-to-Text Generation	WebNLG (Seen)	HTLM (fine-tuning)	BLEU	65.4	# 1
Table-to-Text Generation	WebNLG (Seen)	HTLM (fine-tuning)	METEOR	0.46	# 1
Table-to-Text Generation	WebNLG (Seen)	HTLM (fine-tuning)	TER	0.33	# 1
Table-to-Text Generation	WebNLG (Seen)	GPT-2-Large (fine-tuning)	BLEU	65.3	# 2
Table-to-Text Generation	WebNLG (Seen)	GPT-2-Large (fine-tuning)	METEOR	0.46	# 1
Table-to-Text Generation	WebNLG (Seen)	GPT-2-Large (fine-tuning)	TER	0.33	# 1
Table-to-Text Generation	WebNLG (Unseen)	HTLM (fine-tuning)	BLEU	48.4	# 1
Table-to-Text Generation	WebNLG (Unseen)	HTLM (fine-tuning)	METEOR	0.39	# 1
Table-to-Text Generation	WebNLG (Unseen)	HTLM (fine-tuning)	TER	0.51	# 2
Table-to-Text Generation	WebNLG (Unseen)	GPT-2-Large (fine-tuning)	BLEU	43.1	# 2
Table-to-Text Generation	WebNLG (Unseen)	GPT-2-Large (fine-tuning)	METEOR	0.38	# 2
Table-to-Text Generation	WebNLG (Unseen)	GPT-2-Large (fine-tuning)	TER	0.53	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/table-to-text-generation-on-dart)](https://paperswithcode.com/sota/table-to-text-generation-on-dart?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/table-to-text-generation-on-e2e)](https://paperswithcode.com/sota/table-to-text-generation-on-e2e?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/table-to-text-generation-on-webnlg-all)](https://paperswithcode.com/sota/table-to-text-generation-on-webnlg-all?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/table-to-text-generation-on-webnlg-seen)](https://paperswithcode.com/sota/table-to-text-generation-on-webnlg-seen?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/table-to-text-generation-on-webnlg-unseen)](https://paperswithcode.com/sota/table-to-text-generation-on-webnlg-unseen?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/data-to-text-generation-on-webnlg)](https://paperswithcode.com/sota/data-to-text-generation-on-webnlg?p=htlm-hyper-text-pre-training-and-prompting-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/htlm-hyper-text-pre-training-and-prompting-of/data-to-text-generation-on-webnlg-full-1)](https://paperswithcode.com/sota/data-to-text-generation-on-webnlg-full-1?p=htlm-hyper-text-pre-training-and-prompting-of)`

HTLM: Hyper-Text Pre-Training and Prompting of Language Models

ICLR 2022 · Armen Aghajanyan, Dmytro Okhonko, Mike Lewis, Mandar Joshi, Hu Xu, Gargi Ghosh, Luke Zettlemoyer ·

We introduce HTLM, a hyper-text language model trained on a large-scale web crawl. Modeling hyper-text has a number of advantages: (1) it is easily gathered at scale, (2) it provides rich document-level and end-task-adjacent supervision (e.g. class and id attributes often encode document category information), and (3) it allows for new structured prompting that follows the established semantics of HTML (e.g. to do zero-shot summarization by infilling title tags for a webpage that contains the input text). We show that pretraining with a BART-style denoising loss directly on simplified HTML provides highly effective transfer for a wide range of end tasks and supervision levels. HTLM matches or exceeds the performance of comparably sized text-only LMs for zero-shot prompting and fine-tuning for classification benchmarks, while also setting new state-of-the-art performance levels for zero-shot summarization. We also find that hyper-text prompts provide more value to HTLM, in terms of data efficiency, than plain text prompts do for existing LMs, and that HTLM is highly effective at auto-prompting itself, by simply generating the most likely hyper-text formatting for any available training data. We will release all code and models to support future HTLM research.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Data-to-Text Generation

Denoising

Language Modelling

Table-to-Text Generation

Datasets

GLUE

MultiNLI

HellaSwag

BoolQ WiC

WebNLG E2E

Reddit TIFU DART

Results from the Paper

Edit

Ranked #1 on Table-to-Text Generation on DART

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Table-to-Text Generation	DART	HTLM (fine-tuning)	BLEU	47.2	# 1	Compare
			METEOR	0.39	# 1	Compare
			TER	0.44	# 1	Compare
			Mover	0.51	# 1	Compare
			BERT	0.94	# 1	Compare
			BLEURT	0.4	# 1	Compare
Table-to-Text Generation	DART	GPT-2-Large (fine-tuning)	BLEU	47.0	# 2	Compare
			METEOR	0.39	# 1	Compare
			TER	0.46	# 2	Compare
			Mover	0.51	# 1	Compare
			BERT	0.94	# 1	Compare
			BLEURT	0.4	# 1	Compare
Table-to-Text Generation	E2E	HTLM (fine-tuning)	BLEU	70.3	# 1	Compare
			NIST	8.90	# 1	Compare
			METEOR	46.3	# 1	Compare
			ROUGE-L	70.8	# 1	Compare
			CIDEr	2.47	# 1	Compare
Table-to-Text Generation	E2E	GPT-2-Large (fine-tuning)	BLEU	68.5	# 2	Compare
			NIST	8.78	# 2	Compare
			METEOR	46.0	# 2	Compare
			ROUGE-L	69.9	# 2	Compare
			CIDEr	2.45	# 2	Compare
Data-to-Text Generation	WebNLG	HTML (fine-tuning)	BLEU	65.4	# 4	Compare
Table-to-Text Generation	WebNLG (All)	GPT-2-Large (fine-tuning)	BLEU	55.5	# 2	Compare
			METEOR	0.42	# 1	Compare
			TER	0.42	# 1	Compare
Table-to-Text Generation	WebNLG (All)	HTLM (fine-tuning)	BLEU	55.6	# 1	Compare
			METEOR	0.42	# 1	Compare
			TER	0.4	# 2	Compare
Data-to-Text Generation	WebNLG Full	HTLM (prefix 0.1%)	BLEU	56.3	# 6	Compare
Table-to-Text Generation	WebNLG (Seen)	HTLM (fine-tuning)	BLEU	65.4	# 1	Compare
			METEOR	0.46	# 1	Compare
			TER	0.33	# 1	Compare
Table-to-Text Generation	WebNLG (Seen)	GPT-2-Large (fine-tuning)	BLEU	65.3	# 2	Compare
			METEOR	0.46	# 1	Compare
			TER	0.33	# 1	Compare
Table-to-Text Generation	WebNLG (Unseen)	HTLM (fine-tuning)	BLEU	48.4	# 1	Compare
			METEOR	0.39	# 1	Compare
			TER	0.51	# 2	Compare
Table-to-Text Generation	WebNLG (Unseen)	GPT-2-Large (fine-tuning)	BLEU	43.1	# 2	Compare
			METEOR	0.38	# 2	Compare
			TER	0.53	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

HTLM: Hyper-Text Pre-Training and Prompting of Language Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove