TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Email Thread Summarization	EmailSum (long)	Oracle	ROUGE-1	45.98	# 1
Email Thread Summarization	EmailSum (long)	Oracle	ROUGE-2	15.49	# 1
Email Thread Summarization	EmailSum (long)	Oracle	ROUGE-L	32.4	# 1
Email Thread Summarization	EmailSum (long)	Oracle	RLsum	42.14	# 1
Email Thread Summarization	EmailSum (long)	Oracle	BertS	26.31	# 3
Email Thread Summarization	EmailSum (long)	T5base	ROUGE-1	43.81	# 3
Email Thread Summarization	EmailSum (long)	T5base	ROUGE-2	14.08	# 2
Email Thread Summarization	EmailSum (long)	T5base	ROUGE-L	30.47	# 3
Email Thread Summarization	EmailSum (long)	T5base	RLsum	39.88	# 3
Email Thread Summarization	EmailSum (long)	T5base	BertS	32.09	# 2
Email Thread Summarization	EmailSum (long)	SemiSuptogether	ROUGE-1	44.08	# 2
Email Thread Summarization	EmailSum (long)	SemiSuptogether	ROUGE-2	14.06	# 3
Email Thread Summarization	EmailSum (long)	SemiSuptogether	ROUGE-L	31.17	# 2
Email Thread Summarization	EmailSum (long)	SemiSuptogether	RLsum	40.67	# 2
Email Thread Summarization	EmailSum (long)	SemiSuptogether	BertS	32.3	# 1
Email Thread Summarization	EmailSum (short)	SemiSuptogether	ROUGE-1	36.98	# 2
Email Thread Summarization	EmailSum (short)	SemiSuptogether	ROUGE-2	11.21	# 2
Email Thread Summarization	EmailSum (short)	SemiSuptogether	ROUGE-L	28.76	# 2
Email Thread Summarization	EmailSum (short)	SemiSuptogether	RLsum	33.7	# 2
Email Thread Summarization	EmailSum (short)	SemiSuptogether	BertS	33.91	# 1
Email Thread Summarization	EmailSum (short)	T5base	ROUGE-1	36.57	# 3
Email Thread Summarization	EmailSum (short)	T5base	ROUGE-2	10.56	# 3
Email Thread Summarization	EmailSum (short)	T5base	ROUGE-L	28.3	# 3
Email Thread Summarization	EmailSum (short)	T5base	RLsum	32.76	# 3
Email Thread Summarization	EmailSum (short)	T5base	BertS	33.9	# 2
Email Thread Summarization	EmailSum (short)	Oracle	ROUGE-1	39.04	# 1
Email Thread Summarization	EmailSum (short)	Oracle	ROUGE-2	12.47	# 1
Email Thread Summarization	EmailSum (short)	Oracle	ROUGE-L	30.17	# 1
Email Thread Summarization	EmailSum (short)	Oracle	RLsum	35.61	# 1
Email Thread Summarization	EmailSum (short)	Oracle	BertS	22.32	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/emailsum-abstractive-email-thread/email-thread-summarization-on-emailsum-long)](https://paperswithcode.com/sota/email-thread-summarization-on-emailsum-long?p=emailsum-abstractive-email-thread)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/emailsum-abstractive-email-thread/email-thread-summarization-on-emailsum-short)](https://paperswithcode.com/sota/email-thread-summarization-on-emailsum-short?p=emailsum-abstractive-email-thread)`

EmailSum: Abstractive Email Thread Summarization

ACL 2021 · Shiyue Zhang, Asli Celikyilmaz, Jianfeng Gao, Mohit Bansal ·

Recent years have brought about an interest in the challenging task of summarizing conversation threads (meetings, online discussions, etc.). Such summaries help analysis of the long text to quickly catch up with the decisions made and thus improve our work or communication efficiency. To spur research in thread summarization, we have developed an abstractive Email Thread Summarization (EmailSum) dataset, which contains human-annotated short (<30 words) and long (<100 words) summaries of 2549 email threads (each containing 3 to 10 emails) over a wide variety of topics. We perform a comprehensive empirical study to explore different summarization techniques (including extractive and abstractive methods, single-document and hierarchical models, as well as transfer and semisupervised learning) and conduct human evaluations on both short and long summary generation tasks. Our results reveal the key challenges of current abstractive summarization models in this task, such as understanding the sender's intent and identifying the roles of sender and receiver. Furthermore, we find that widely used automatic evaluation metrics (ROUGE, BERTScore) are weakly correlated with human judgments on this email thread summarization task. Hence, we emphasize the importance of human evaluation and the development of better metrics by the community. Our code and summary data have been made available at: https://github.com/ZhangShiyue/EmailSum

PDF Abstract ACL 2021 PDF ACL 2021 Abstract

Code

Add Remove Mark official

ZhangShiyue/EmailSum official

Tasks

Add Remove

Abstractive Text Summarization

Email Thread Summarization

Datasets

Introduced in the Paper:

EmailSum

Used in the Paper:

SAMSum CRD3 Avocado research email collection

Results from the Paper

Edit

Ranked #1 on Email Thread Summarization on EmailSum (short)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Email Thread Summarization	EmailSum (long)	Oracle	ROUGE-1	45.98	# 1	Compare
			ROUGE-2	15.49	# 1	Compare
			ROUGE-L	32.4	# 1	Compare
			RLsum	42.14	# 1	Compare
			BertS	26.31	# 3	Compare
Email Thread Summarization	EmailSum (long)	T5base	ROUGE-1	43.81	# 3	Compare
			ROUGE-2	14.08	# 2	Compare
			ROUGE-L	30.47	# 3	Compare
			RLsum	39.88	# 3	Compare
			BertS	32.09	# 2	Compare
Email Thread Summarization	EmailSum (long)	SemiSuptogether	ROUGE-1	44.08	# 2	Compare
			ROUGE-2	14.06	# 3	Compare
			ROUGE-L	31.17	# 2	Compare
			RLsum	40.67	# 2	Compare
			BertS	32.3	# 1	Compare
Email Thread Summarization	EmailSum (short)	SemiSuptogether	ROUGE-1	36.98	# 2	Compare
			ROUGE-2	11.21	# 2	Compare
			ROUGE-L	28.76	# 2	Compare
			RLsum	33.7	# 2	Compare
			BertS	33.91	# 1	Compare
Email Thread Summarization	EmailSum (short)	T5base	ROUGE-1	36.57	# 3	Compare
			ROUGE-2	10.56	# 3	Compare
			ROUGE-L	28.3	# 3	Compare
			RLsum	32.76	# 3	Compare
			BertS	33.9	# 2	Compare
Email Thread Summarization	EmailSum (short)	Oracle	ROUGE-1	39.04	# 1	Compare
			ROUGE-2	12.47	# 1	Compare
			ROUGE-L	30.17	# 1	Compare
			RLsum	35.61	# 1	Compare
			BertS	22.32	# 3	Compare

Methods

Add Remove

Adafactor • Attention Dropout • BPE • Dense Connections • Dropout • GELU • GLU • Inverse Square Root Schedule • Layer Normalization • Linear Layer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • SentencePiece • Softmax • T5

Edit Social Preview

EmailSum: Abstractive Email Thread Summarization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove