Multi-Document Summarization

93 papers with code • 5 benchmarks • 15 datasets

Multi-Document Summarization is a process of representing a set of documents with a short piece of text by capturing the relevant information and filtering out the redundant information. Two prominent approaches to Multi-Document Summarization are extractive and abstractive summarization. Extractive summarization systems aim to extract salient snippets, sentences or passages from documents, while abstractive summarization systems aim to concisely paraphrase the content of the documents.

Source: Multi-Document Summarization using Distributed Bag-of-Words Model

Latest papers with no code

LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction

no code yet • 5 Oct 2023

Multi-document summarization is a challenging task due to its inherent subjective bias, highlighted by the low inter-annotator ROUGE-1 score of 0. 4 among DUC-2004 reference summaries.

Multi-document Summarization: A Comparative Evaluation

no code yet • 10 Sep 2023

This work serves as a reference for future MDS research and contributes to the development of accurate and robust models which can be utilized on demanding datasets with academically and/or scientifically complex data as well as generalized, relatively simple datasets.

Unsupervised Multi-document Summarization with Holistic Inference

no code yet • 8 Sep 2023

SRI balances the importance and diversity of a subset of sentences from the source documents and can be calculated in unsupervised and adaptive manners.

Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

no code yet • 7 Jun 2023

In this paper, we consider the unsupervised abstractive MDS setting where there are only documents with no groundtruh summaries provided, and we propose Absformer, a new Transformer-based method for unsupervised abstractive summary generation.

LBMT team at VLSP2022-Abmusu: Hybrid method with text correlation and generative models for Vietnamese multi-document summarization

no code yet • 11 Apr 2023

Multi-document summarization is challenging because the summaries should not only describe the most important information from all documents but also provide a coherent interpretation of the documents.

Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges

no code yet • 7 Mar 2023

We present TrialsSummarizer, a system that aims to automatically summarize evidence presented in the set of randomized controlled trials most relevant to a given query.

Mining both Commonality and Specificity from Multiple Documents for Multi-Document Summarization

no code yet • 5 Mar 2023

The multi-document summarization task requires the designed summarizer to generate a short text that covers the important information of original documents and satisfies content diversity.

Do Multi-Document Summarization Models Synthesize?

no code yet • 31 Jan 2023

In this paper we ask: To what extent do modern multi-document summarization models implicitly perform this type of synthesis?

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval

no code yet • 20 Dec 2022

Via extensive automatic and human evaluation, we determine: (1) state-of-the-art summarizers suffer large reductions in performance when applied to open-domain MDS, (2) additional training in the open-domain setting can reduce this sensitivity to imperfect retrieval, and (3) summarizers are insensitive to the retrieval of duplicate documents and the order of retrieved documents, but highly sensitive to other errors, like the retrieval of irrelevant documents.

Combining State-of-the-Art Models with Maximal Marginal Relevance for Few-Shot and Zero-Shot Multi-Document Summarization

no code yet • 19 Nov 2022

In Natural Language Processing, multi-document summarization (MDS) poses many challenges to researchers above those posed by single-document summarization (SDS).