WikiAnn is a dataset for cross-lingual name tagging and linking based on Wikipedia articles in 295 languages.
49 PAPERS • 7 BENCHMARKS
A parallel corpus of over 300 languages with around 100 thousand parallel sentences per language pair on average.
25 PAPERS • NO BENCHMARKS YET