Improving ROUGE for Timeline Summarization

EACL 2017 · Sebastian Martschat, Katja Markert ·

Current evaluation metrics for timeline summarization either ignore the temporal aspect of the task or require strict date matching. We introduce variants of ROUGE that allow alignment of daily summaries via temporal distance or semantic similarity. We argue for the suitability of these variants in a theoretical analysis and demonstrate it in a battery of task-specific tests.

PDF Abstract