WikiAnn is a dataset for cross-lingual name tagging and linking based on Wikipedia articles in 295 languages.
49 PAPERS • 7 BENCHMARKS
MasakhaNEWS is a benchmark dataset for news topic classification covering 16 languages widely spoken in Africa.
2 PAPERS • NO BENCHMARKS YET