GAP Coreference Dataset

Introduced by Webster et al. in Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns

GAP is a gender-balanced dataset containing 8,908 coreference-labeled pairs of (ambiguous pronoun, antecedent name), sampled from Wikipedia and released by Google AI Language for the evaluation of coreference resolution in practical applications.

Source: GAP Coreference Dataset

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages