IBM Debater Mention Detection Benchmark

Introduced by Mass et al. in What did you Mention? A Large Scale Mention Detection Benchmark for Spoken and Written Text

This dataset contains general and named entities annotations on both clean written text and on noisy speech data. It includes 1000 sentences from Wikipedia and 1000 sentences of speech data that appear in two forms: (1) transcribed manually, and (2) the output of an ASR engine. Each of the datasets includes a total of around 6500 mentions linked to there DBPedia pages.


Paper Code Results Date Stars

Dataset Loaders

No data loaders found. You can submit your data loader here.



