Multi-Document Summarization

93 papers with code • 5 benchmarks • 15 datasets

Multi-Document Summarization is a process of representing a set of documents with a short piece of text by capturing the relevant information and filtering out the redundant information. Two prominent approaches to Multi-Document Summarization are extractive and abstractive summarization. Extractive summarization systems aim to extract salient snippets, sentences or passages from documents, while abstractive summarization systems aim to concisely paraphrase the content of the documents.

Source: Multi-Document Summarization using Distributed Bag-of-Words Model

Benchmarks

Add a Result

These leaderboards are used to track progress in Multi-Document Summarization

Dataset	Best Model	Compare
Multi-News	PRIMER	See all
DUC 2004	GCN: Personalized Discourse Graph	See all
review	solar	See all
WCEP	PRIMER	See all
MS^2	led-base-16384-ms2	See all

Datasets

Most implemented papers

Most implemented Social Latest No code

Bottom-Up Abstractive Summarization

sebastianGehrmann/bottom-up-summary • EMNLP 2018

We use this selector as a bottom-up attention step to constrain the model to likely phrases.

Paper
Code

Generating Wikipedia by Summarizing Long Sequences

tensorflow/tensor2tensor • • ICLR 2018

We show that generating English Wikipedia articles can be approached as a multi- document summarization of source documents.

Paper
Code

Scoring Sentence Singletons and Pairs for Abstractive Summarization

ucfnlp/summarization-sing-pair-mix • • ACL 2019

There is thus a crucial gap between sentence selection and fusion to support summarizing by both compressing single sentences and fusing pairs.

Paper
Code

Centroid-based Text Summarization through Compositionality of Word Embeddings

gaetangate/text-summarizer • WS 2017

The textual similarity is a crucial aspect for many extractive text summarization methods.

Paper
Code

Leveraging Graph to Improve Abstractive Multi-Document Summarization

PaddlePaddle/Research • • ACL 2020

Graphs that capture relations between textual units have great benefits for detecting salient information from multiple documents and generating overall coherent summaries.

Paper
Code

Pre-training via Paraphrasing

lucidrains/marge-pytorch • • NeurIPS 2020

The objective noisily captures aspects of paraphrase, translation, multi-document summarization, and information retrieval, allowing for strong zero-shot performance on several tasks.

Paper
Code

Generating (Factual?) Narrative Summaries of RCTs: Experiments with Neural Multi-Document Summarization

allenai/mslr-shared-task • • 25 Aug 2020

We enlist medical professionals to evaluate generated summaries, and we find that modern summarization systems yield consistently fluent and relevant synopses, but that they are not always factual.

Paper
Code

Global-aware Beam Search for Neural Abstractive Summarization

yema2018/global_aware • • NeurIPS 2021

A global scoring mechanism is then developed to regulate beam search to generate summaries in a near-global optimal fashion.

Paper
Code

Quantitative Argument Summarization and Beyond: Cross-Domain Key Point Analysis

ibm/kpa_2021_shared_task • EMNLP 2020

Recent work has proposed to summarize arguments by mapping them to a small set of expert-generated key points, where the salience of each key point corresponds to the number of its matching arguments.

Paper
Code

MS2: Multi-Document Summarization of Medical Studies

allenai/ms2 • • 13 Apr 2021

In support of this goal, we release MS^2 (Multi-Document Summarization of Medical Studies), a dataset of over 470k documents and 20k summaries derived from the scientific literature.

Paper
Code

Multi-Document Summarization

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result