TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Extractive Text Summarization	DUC 2004	Pre-training-meets-Clustering-A-Hybrid-Extractive-Multi-Document-Summarization-Model	Test ROGUE-1	34.013	# 1
Extractive Text Summarization	DUC 2004	Pre-training-meets-Clustering-A-Hybrid-Extractive-Multi-Document-Summarization-Model	Test ROGUE-2	8.266	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pre-training-meets-clustering-a-hybrid/extractive-text-summarization-on-duc-2004-1)](https://paperswithcode.com/sota/extractive-text-summarization-on-duc-2004-1?p=pre-training-meets-clustering-a-hybrid)`

Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model

International Conference on Hybrid Intelligent Systems 2023 · Akanksha Karotia, Seba Susan ·

In this era where a large amount of information has flooded the Internet, manual extraction and consumption of relevant information is very difficult and time-consuming. Therefore, an automated document summarization tool is necessary to excerpt important information from a set of documents that have similar or related subjects. Multi-document summarization allows retrieval of important and relevant content from multiple documents while minimizing redundancy. A multi-document text summarization system is developed in this study using an unsupervised extractive-based approach. The proposed model is a fusion of two learning paradigms: the T5 pre-trained transformer model and the K-Means clustering algorithm. We perform the experiments on the benchmark news article corpus Document Understanding Conference (DUC2004). The ROUGE evaluation metrics were used to estimate the performance of the proposed approach on the DUC2004. Outcomes validate that our proposed model shows greatly enhanced performance as compared to the existent unsupervised state-of-the-art approaches.

PDF Abstract

Code

Add Remove Mark official

Akankshakarotia/Pre-training-meets-… official

Tasks

Add Remove

Clustering

Document Summarization

document understanding

Extractive Text Summarization

Multi-Document Summarization

Retrieval

Text Summarization

Unsupervised Text Summarization

Datasets

DUC 2004

Results from the Paper

Add Remove

Ranked #1 on Extractive Text Summarization on DUC 2004

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Extractive Text Summarization	DUC 2004	Pre-training-meets-Clustering-A-Hybrid-Extractive-Multi-Document-Summarization-Model	Test ROGUE-1	34.013	# 1	Compare
Extractive Text Summarization	DUC 2004		Test ROGUE-2	8.266	# 1	Compare

Methods

Add Remove

Adafactor • Attention Dropout • BPE • Dense Connections • Dropout • GELU • GLU • Inverse Square Root Schedule • k-Means Clustering • Layer Normalization • Linear Layer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • SentencePiece • Softmax • T5

Edit Social Preview

Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove