TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Citation Recommendation	AAN test	Longformer	F1	85.4	# 1
Citation Recommendation	ANN test	Rand CD-LM	F1	85.7	# 2
Citation Recommendation	ANN test	CD-LM	F1	88.8	# 1
Entity Cross-Document Coreference Resolution	ECB+ test	Longformer	CoNLL F1	80.4	# 3
Event Cross-Document Coreference Resolution	ECB+ test	Yu et al	CoNLL F1	84.4	# 4
Entity Cross-Document Coreference Resolution	ECB+ test	CD-LM	CoNLL F1	82.9	# 1
Event Cross-Document Coreference Resolution	ECB+ test	CD-LM	CoNLL F1	85.6	# 2
Cross-Document Language Modeling	MultiNews test	Longformer	Perplexity	2.34	# 3
Cross-Document Language Modeling	MultiNews test	CD-LM	Perplexity	1.76	# 1
Cross-Document Language Modeling	MultiNews test	Rand CD-LM	Perplexity	1.93	# 2
Cross-Document Language Modeling	MultiNews val	CD-LM	Perplexity	1.69	# 1
Cross-Document Language Modeling	MultiNews val	Longformer	Perplexity	2.03	# 3
Cross-Document Language Modeling	MultiNews val	Rand CD-LM	Perplexity	1.88	# 2
Citation Recommendation	OC	Rand CD-LM	F1	93.5	# 2
Citation Recommendation	OC	CD-LM	F1	95.3	# 1
Citation Recommendation	OC	Longformer	F1	93.4	# 3
Citation Recommendation	PAN	Longformer	F1	80.4	# 2
Citation Recommendation	PAN	Rand CD-LM	F1	79.4	# 3
Citation Recommendation	PAN	CD-LM	F1	82.9	# 1
Citation Recommendation	S2ORC	Rand CD-LM	F1	94.6	# 3
Citation Recommendation	S2ORC	CD-LM	F1	96.5	# 1
Citation Recommendation	S2ORC	Longformer	F1	95.8	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/citation-recommendation-on-aan-test)](https://paperswithcode.com/sota/citation-recommendation-on-aan-test?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/citation-recommendation-on-ann-test)](https://paperswithcode.com/sota/citation-recommendation-on-ann-test?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/entity-cross-document-coreference-resolution)](https://paperswithcode.com/sota/entity-cross-document-coreference-resolution?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/cross-document-language-modeling-on-multinews-1)](https://paperswithcode.com/sota/cross-document-language-modeling-on-multinews-1?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/cross-document-language-modeling-on-multinews)](https://paperswithcode.com/sota/cross-document-language-modeling-on-multinews?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/citation-recommendation-on-oc)](https://paperswithcode.com/sota/citation-recommendation-on-oc?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/citation-recommendation-on-pan)](https://paperswithcode.com/sota/citation-recommendation-on-pan?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/citation-recommendation-on-s2orc)](https://paperswithcode.com/sota/citation-recommendation-on-s2orc?p=cross-document-language-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-document-language-modeling/event-cross-document-coreference-resolution)](https://paperswithcode.com/sota/event-cross-document-coreference-resolution?p=cross-document-language-modeling)`

CDLM: Cross-Document Language Modeling

Findings (EMNLP) 2021 · Avi Caciularu, Arman Cohan, Iz Beltagy, Matthew E. Peters, Arie Cattan, Ido Dagan ·

We introduce a new pretraining approach geared for multi-document language modeling, incorporating two key ideas into the masked language modeling self-supervised objective. First, instead of considering documents in isolation, we pretrain over sets of multiple related documents, encouraging the model to learn cross-document relationships. Second, we improve over recent long-range transformers by introducing dynamic global attention that has access to the entire input to predict masked tokens. We release CDLM (Cross-Document Language Model), a new general language model for multi-document setting that can be easily applied to downstream tasks. Our extensive analysis shows that both ideas are essential for the success of CDLM, and work in synergy to set new state-of-the-art results for several multi-text tasks. Code and models are available at https://github.com/aviclu/CDLM.

PDF Abstract Findings (EMNLP) 2021 PDF Findings (EMNLP) 2021 Abstract

Code

Add Remove Mark official

aviclu/cdlm official

aviclu/CD-LM official

Tasks

Add Remove

Citation Recommendation

Coreference Resolution

Cross-Document Language Modeling

Entity Cross-Document Coreference Resolution

Event Coreference Resolution

Event Cross-Document Coreference Resolution

Language Modelling

Masked Language Modeling

Question Answering

Datasets

S2ORC

Multi-News ECB+

Results from the Paper

Edit

Ranked #1 on Citation Recommendation on AAN test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Citation Recommendation	AAN test	Longformer	F1	85.4	# 1	Compare
Citation Recommendation	ANN test	Rand CD-LM	F1	85.7	# 2	Compare
Citation Recommendation	ANN test	CD-LM	F1	88.8	# 1	Compare
Entity Cross-Document Coreference Resolution	ECB+ test	Longformer	CoNLL F1	80.4	# 3	Compare
Event Cross-Document Coreference Resolution	ECB+ test	Yu et al	CoNLL F1	84.4	# 4	Compare
Entity Cross-Document Coreference Resolution	ECB+ test	CD-LM	CoNLL F1	82.9	# 1	Compare
Event Cross-Document Coreference Resolution	ECB+ test	CD-LM	CoNLL F1	85.6	# 2	Compare
Cross-Document Language Modeling	MultiNews test	Longformer	Perplexity	2.34	# 3	Compare
Cross-Document Language Modeling	MultiNews test	CD-LM	Perplexity	1.76	# 1	Compare
Cross-Document Language Modeling	MultiNews test	Rand CD-LM	Perplexity	1.93	# 2	Compare
Cross-Document Language Modeling	MultiNews val	CD-LM	Perplexity	1.69	# 1	Compare
Cross-Document Language Modeling	MultiNews val	Longformer	Perplexity	2.03	# 3	Compare
Cross-Document Language Modeling	MultiNews val	Rand CD-LM	Perplexity	1.88	# 2	Compare
Citation Recommendation	OC	Rand CD-LM	F1	93.5	# 2	Compare
Citation Recommendation	OC	CD-LM	F1	95.3	# 1	Compare
Citation Recommendation	OC	Longformer	F1	93.4	# 3	Compare
Citation Recommendation	PAN	Longformer	F1	80.4	# 2	Compare
Citation Recommendation	PAN	Rand CD-LM	F1	79.4	# 3	Compare
Citation Recommendation	PAN	CD-LM	F1	82.9	# 1	Compare
Citation Recommendation	S2ORC	Rand CD-LM	F1	94.6	# 3	Compare
Citation Recommendation	S2ORC	CD-LM	F1	96.5	# 1	Compare
Citation Recommendation	S2ORC	Longformer	F1	95.8	# 2	Compare

Methods

Add Remove

AdamW • Attention Dropout • Dense Connections • Dilated Sliding Window Attention • Dropout • GELU • Global and Sliding Window Attention • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Longformer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Sliding Window Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

CDLM: Cross-Document Language Modeling

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove