FEVER is a publicly available dataset for fact extraction and verification against textual sources.
133 PAPERS • 2 BENCHMARKS
The VitaminC dataset contains more than 450,000 claim-evidence pairs for fact verification and factual consistent generation. Based on over 100,000 revisions to popular Wikipedia pages, and additional "synthetic" revisions.
4 PAPERS • NO BENCHMARKS YET
BEIR (Benchmarking IR) is an heterogeneous benchmark containing different information retrieval (IR) tasks. Through BEIR, it is possible to systematically study the zero-shot generalization capabilities of multiple neural retrieval approaches.
3 PAPERS • 18 BENCHMARKS