TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Abstractive Text Summarization	CNN / Daily Mail	UniLMv2	ROUGE-1	43.16	# 24
Abstractive Text Summarization	CNN / Daily Mail	UniLMv2	ROUGE-2	20.42	# 22
Abstractive Text Summarization	CNN / Daily Mail	UniLMv2	ROUGE-L	40.14	# 25
Question Generation	SQuAD1.1	UniLMv2	BLEU-4	24.43	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unilmv2-pseudo-masked-language-models-for/question-generation-on-squad11)](https://paperswithcode.com/sota/question-generation-on-squad11?p=unilmv2-pseudo-masked-language-models-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unilmv2-pseudo-masked-language-models-for/abstractive-text-summarization-on-cnn-daily)](https://paperswithcode.com/sota/abstractive-text-summarization-on-cnn-daily?p=unilmv2-pseudo-masked-language-models-for)`

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

28 Feb 2020 · Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Songhao Piao, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon ·

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM). Given an input text with masked tokens, we rely on conventional masks to learn inter-relations between corrupted tokens and context via autoencoding, and pseudo masks to learn intra-relations between masked spans via partially autoregressive modeling. With well-designed position embeddings and self-attention masks, the context encodings are reused to avoid redundant computation. Moreover, conventional masks used for autoencoding provide global masking information, so that all the position embeddings are accessible in partially autoregressive language modeling. In addition, the two tasks pre-train a unified language model as a bidirectional encoder and a sequence-to-sequence decoder, respectively. Our experiments show that the unified language models pre-trained using PMLM achieve new state-of-the-art results on a wide range of natural language understanding and generation tasks across several widely used benchmarks.

PDF Abstract

Code

Add Remove Mark official

microsoft/unilm official

↳ Quickstart in

Spaces

18,315

microsoft/dialoglm

132

facebookresearch/data2vec_vision

Tasks

Add Remove

Abstractive Text Summarization

Language Modelling

Natural Language Understanding

Position

Question Generation

Datasets

GLUE

SST

SQuAD SST-2

QNLI

MRPC

CoLA

CNN/Daily Mail

BookCorpus

Results from the Paper

Add Remove

Ranked #4 on Question Generation on SQuAD1.1 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Abstractive Text Summarization	CNN / Daily Mail	UniLMv2	ROUGE-1	43.16	# 24	Compare
			ROUGE-2	20.42	# 22	Compare
			ROUGE-L	40.14	# 25	Compare
Question Generation	SQuAD1.1	UniLMv2	BLEU-4	24.43	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove