Text Simplification
125 papers with code • 11 benchmarks • 20 datasets
Text Simplification is the task of reducing the complexity of the vocabulary and sentence structure of text while retaining its original meaning, with the goal of improving readability and understanding. Simplification has a variety of important societal applications, for example increasing accessibility for those with cognitive disabilities such as aphasia, dyslexia, and autism, or for non-native speakers and children with reading difficulties.
Libraries
Use these libraries to find Text Simplification models and implementationsDatasets
Most implemented papers
Felix: Flexible Text Editing Through Tagging and Insertion
We achieve this by decomposing the text-editing task into two sub-tasks: tagging to decide on the subset of input tokens and their order in the output text and insertion to in-fill the missing tokens in the output not present in the input.
Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation
This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation.
DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion
We author a set of rules for identifying a diverse set of discourse phenomena in raw text, and decomposing the text into two independent sentences.
Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification
Sentence simplification is the task of rewriting texts so they are easier to understand.
Controllable Sentence Simplification
Text simplification aims at making a text easier to read and understand by simplifying grammar and structure while keeping the underlying information identical.
HTSS: A Novel Hybrid Text Summarisation and Simplification Architecture
Our results show that our proposed HTSS model outperforms neural text simplification (NTS) on SARI score and abstractive text summarisation (ATS) on the ROUGE score.
Control Prefixes for Parameter-Efficient Text Generation
Prefix-tuning is a powerful lightweight technique for adapting a large pre-trained language model to a downstream application.
Klexikon: A German Dataset for Joint Summarization and Simplification
Traditionally, Text Simplification is treated as a monolingual translation task where sentences between source texts and their simplified counterparts are aligned for training.
Lossless Acceleration for Seq2seq Generation with Aggressive Decoding
We study lossless acceleration for seq2seq generation with a novel decoding algorithm -- Aggressive Decoding.
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Diffusion models have emerged as a powerful paradigm for generation, obtaining strong performance in various continuous domains.