TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text Summarization	arXiv Summarization Dataset	PRIMER	ROUGE-1	47.6	# 1
Text Summarization	arXiv Summarization Dataset	PRIMER	ROUGE-2	20.8	# 1
Text Summarization	arXiv Summarization Dataset	PRIMER	ROUGE-L	42.6	# 1
Multi-Document Summarization	Multi-News	PRIMER	ROUGE-2	21.1	# 1
Multi-Document Summarization	Multi-News	PRIMER	ROUGE-1	49.9	# 1
Multi-Document Summarization	Multi-News	PRIMER	ROUGE-L	25.9	# 1
Multi-Document Summarization	WCEP	PRIMER	ROUGE-1	46.1	# 1
Multi-Document Summarization	WCEP	PRIMER	ROUGE-2	25.2	# 1
Multi-Document Summarization	WCEP	PRIMER	ROUGE-L	37.9	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/primer-pyramid-based-masked-sentence-pre/text-summarization-on-arxiv-summarization)](https://paperswithcode.com/sota/text-summarization-on-arxiv-summarization?p=primer-pyramid-based-masked-sentence-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/primer-pyramid-based-masked-sentence-pre/multi-document-summarization-on-multi-news)](https://paperswithcode.com/sota/multi-document-summarization-on-multi-news?p=primer-pyramid-based-masked-sentence-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/primer-pyramid-based-masked-sentence-pre/multi-document-summarization-on-wcep)](https://paperswithcode.com/sota/multi-document-summarization-on-wcep?p=primer-pyramid-based-masked-sentence-pre)`

PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

ACL 2022 · Wen Xiao, Iz Beltagy, Giuseppe Carenini, Arman Cohan ·

We introduce PRIMERA, a pre-trained model for multi-document representation with a focus on summarization that reduces the need for dataset-specific architectures and large amounts of fine-tuning labeled data. PRIMERA uses our newly proposed pre-training objective designed to teach the model to connect and aggregate information across documents. It also uses efficient encoder-decoder transformers to simplify the processing of concatenated input documents. With extensive experiments on 6 multi-document summarization datasets from 3 different domains on zero-shot, few-shot and full-supervised settings, PRIMERA outperforms current state-of-the-art dataset-specific and pre-trained models on most of these settings with large margins. The code and pre-trained models can be found at \url{https://github.com/allenai/PRIMER}.

PDF Abstract ACL 2022 PDF ACL 2022 Abstract

Code

Add Remove Mark official

allenai/primer official

148

allenai/open-mds

↳ Quickstart in

Colab

Tasks

Add Remove

Abstractive Text Summarization

Document Summarization

Multi-Document Summarization

Sentence

Text Summarization

Datasets

Multi-News

WikiSum Arxiv HEP-TH citation graph

WCEP NewSHead arXiv Summarization Dataset

Results from the Paper

Edit

Ranked #1 on Multi-Document Summarization on Multi-News

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text Summarization	arXiv Summarization Dataset	PRIMER	ROUGE-1	47.6	# 1	Compare
			ROUGE-2	20.8	# 1	Compare
			ROUGE-L	42.6	# 1	Compare
Multi-Document Summarization	Multi-News	PRIMER	ROUGE-2	21.1	# 1	Compare
			ROUGE-1	49.9	# 1	Compare
			ROUGE-L	25.9	# 1	Compare
Multi-Document Summarization	WCEP	PRIMER	ROUGE-1	46.1	# 1	Compare
			ROUGE-2	25.2	# 1	Compare
			ROUGE-L	37.9	# 1	Compare

Methods

Add Remove

AdamW • Attention Dropout • Dense Connections • Dilated Sliding Window Attention • Dropout • GELU • Global and Sliding Window Attention • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Longformer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Sliding Window Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove