Klexikon (Klexikon: A German Dataset for Joint Summarization and Simplification)

Introduced by Aumiller et al. in Klexikon: A German Dataset for Joint Summarization and Simplification

The dataset introduces document alignments between German Wikipedia and the children's lexicon Klexikon. The source texts in Wikipedia are both written in a more complex language than Klexikon, and also significantly longer, which makes this a suitable application for both summarization and simplification. In fact, previous research has so far only focused on either of the two, but not comprehensively been studied as a joint task.

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • CC-BY-SA

Modalities


Languages