This repository contains the SubSumE dataset for subjective document summarization. See the paper and the talk for details on dataset creation. Also check out our work SuDocu on example-based document summarization.
Download the dataset from here.
The dataset contains :
processed_state_sentences.csv
and are assigned a unique sentence id that
is used in summary json files. Each datapoint file in the directory user_summary_jsons
contains a json containing summaries of Wikipedia pages
of eight states with following keys:
processed_state_sentences.csv
) present in the summaryPaper | Code | Results | Date | Stars |
---|