Robust Summarization Evaluation Benchmark is a large human evaluation dataset consisting of over 22k summary-level annotations over state-of-the-art systems on three datasets.
1 PAPER • NO BENCHMARKS YET
…News, Politics, Sports, Weather, Business, Technology, Science, Health, Family, Education, Entertainment and Arts).
27 PAPERS • 6 BENCHMARKS
…177.2 | 4.5 | 11,558 | Theoretical Economics,Applied Economics | | Literature | 2 | 18.8 | 158.2 | 8.3 | 10,501 | Chinese Literature,Journalism | | Art | 1 | 17.8 | 170.8 | 5.4 | 5,201 | Art | | History | 1 | 17.6 | 181.0 | 6.0 | 6,270 | History