Taiga is a corpus, where text sources and their meta-information are collected according to popular ML tasks.
4 PAPERS • NO BENCHMARKS YET