Text Generation

1505 papers with code • 21 benchmarks • 115 datasets

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Benchmarks

Add a Result

These leaderboards are used to track progress in Text Generation

Dataset	Best Model	Compare
EMNLP2017 WMT	LeakGAN	See all
COCO Captions	LeakGAN	See all
ReDial	UniCRS	See all
CommonGen	UniLM	See all
ROCStories	Beam search + A*esque (beam)	See all
Chinese Poems	SARG (greedy)	See all
Yahoo Questions	Aggressive VAE	See all
Czech restaurant information	TGen++	See all
DART	Control Prefixes (T5-large)	See all
SciQ	LLaMA-65B+CFG (zero-shot)	See all
CMU-SE	STWGAN-GP	See all
DailyDialog	AEM+Attention	See all
LDC2016E25	Graph2Seq	See all
One Billion Word	WGANGP + DGflow	See all
CNN/Daily Mail	PALM	See all
LCSTS	BART (TextBox 2.0)	See all
CSL	BART (TextBox 2.0)	See all
ADGEN	BART (TextBox 2.0)	See all
HarmfulQA	GPT-4	See all

Show all 21 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Text Generation models and implementations

huggingface/transformers

10 papers

126,027

UFAL-DSG/tgen

6 papers

204

Datasets

Subtasks

Text Style Transfer

Story Generation

Paraphrase Generation

Spelling Correction

Table-to-Text Generation

Headline Generation

Conditional Text Generation

Visual Storytelling

Text Infilling

Distractor Generation

Question-Answer-Generation

News Generation

Story Completion

Code Documentation Generation

Concept-To-Text Generation

Fact-based Text Editing

Rules-of-thumb Generation

Molecular description generation

Natural Language Landmark Navigation Instructions Generation

Most implemented papers

Most implemented Social Latest No code

Show and Tell: A Neural Image Caption Generator

yashk2810/Image-Captioning • • CVPR 2015

Experiments on several datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions.

Paper
Code

Learning Transferable Visual Models From Natural Language Supervision

openai/CLIP • • 26 Feb 2021

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories.

Paper
Code

Generating Sequences With Recurrent Neural Networks

karpathy/char-rnn • • 4 Aug 2013

This paper shows how Long Short-term Memory recurrent neural networks can be used to generate complex sequences with long-range structure, simply by predicting one data point at a time.

Paper
Code

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

huggingface/transformers • • ACL 2020

We evaluate a number of noising approaches, finding the best performance by both randomly shuffling the order of the original sentences and using a novel in-filling scheme, where spans of text are replaced with a single mask token.

Paper
Code

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

ashwinkalyan/dbs • • 7 Oct 2016

We observe that our method consistently outperforms BS and previously proposed techniques for diverse decoding from neural sequence models.

Paper
Code

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

LantaoYu/SeqGAN • • 18 Sep 2016

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data.

Paper
Code

Language Models are Unsupervised Multitask Learners

openai/gpt-2 • • Preprint 2019

Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on taskspecific datasets.

Paper
Code