DocRED Dataset | Papers With Code

Name:*

Full name (optional):

Description (Markdown and $\LaTeX$ enabled):*

**DocRED** (Document-Level Relation Extraction Dataset) is a relation extraction dataset constructed from Wikipedia and Wikidata. Each document in the dataset is human-annotated with named entity mentions, coreference information, intra- and inter-sentence relations, and supporting evidence. DocRED requires reading multiple sentences in a document to extract entities and infer their relations by synthesizing all information of the document. Along with the human-annotated data, the dataset provides large-scale distantly supervised data.

DocRED contains 132,375 entities and 56,354 relational facts annotated on 5,053 Wikipedia documents. In addition to the human-annotated data, the dataset provides large-scale distantly supervised data over 101,873 documents.

Source: [DocRED: A Large-Scale Document-Level Relation Extraction Dataset](https://paperswithcode.com/paper/docred-a-large-scale-document-level-relation/)
Image Source: [DocRED: A Large-Scale Document-Level Relation Extraction Dataset](https://paperswithcode.com/paper/docred-a-large-scale-document-level-relation/)

Homepage URL (optional):

Paper where the dataset was introduced:

Introduction date:

Dataset license:

URL to full license terms:

Image

Currently

datasets/DocRED-0000002310-0743d975_JeWVto8.jpg Clear

Change

---

Task	Dataset Variant	Best Model
Relation Extraction	DocRED	DREEAM
Joint Entity and Relation Extraction	DocRED	REBEL+pretraining
Few-Shot Relation Classification	DocRED	DL-MNAV

DocRED

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

Re-DocRED

GDA

FREDo

CDR

Usage

License

Modalities

Languages

DocRED

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit