Datasets > Modality > Texts > PreCo

A large-scale English dataset for coreference resolution. The dataset is designed to embody the core challenges in coreference, such as entity representation, by alleviating the challenge of low overlap between training and test sets and enabling separated analysis of mention detection and mention clustering.

Source: PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution

Samples

Modalities

Languages

Tasks