…While all questions directly relate to the passage, the English dataset on its own proves difficult enough to challenge state-of-the-art language models.
20 PAPERS • NO BENCHMARKS YET