DWIE (Deutsche Welle corpus for Information Extraction)

Introduced by Zaporojets et al. in DWIE: an entity-centric dataset for multi-task document-level information extraction

The 'Deutsche Welle corpus for Information Extraction' (DWIE) is a multi-task dataset that combines four main Information Extraction (IE) annotation sub-tasks: (i) Named Entity Recognition (NER), (ii) Coreference Resolution, (iii) Relation Extraction (RE), and (iv) Entity Linking. DWIE is conceived as an entity-centric dataset that describes interactions and properties of conceptual entities on the level of the complete document.

Source: https://arxiv.org/abs/2009.12626

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • GPL-3.0 License

Modalities


Languages