This dataset evaluates instruction following ability of large language models. There are 500+ prompts with instructions such as "write an article with more than 800 words", "wrap your response with double quotation marks", etc.
11 PAPERS • 1 BENCHMARK
Collection of news websites in low-resource languages.
1 PAPER • NO BENCHMARKS YET