Video Generation

264 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Generation

Dataset	Best Model	Compare
UCF-101	W.A.L.T-XL (class-conditional)	See all
BAIR Robot Pushing	MAGVIT	See all
Sky Time-lapse	StyleSV (256x256)	See all
UCF-101 16 frames, 64x64, Unconditional	Make-A-Video (ours) vs. CogVideo (Chinese)	See all
UCF-101 16 frames, Unconditional, Single GPU	TGAN-F	See all
LAION-400M	Imagen original (constant=6)	See all
Taichi	StyleSV (256x256)	See all
UCF-101 16 frames, 128x128, Unconditional	TGANv2 (2020)	See all
Kinetics-600 12 frames, 64x64	W.A.L.T-L	See all
TrailerFaces	PG-SWGAN-3D	See all
Kinetics-600 48 frames, 64x64	DVD-GAN	See all
Kinetics-600 12 frames, 128x128	DVD-GAN	See all
How2Sign	INR-V	See all
YouTube Driving	StyleSV	See all
MSR-VTT	VideoAssembler (Zero-Shot, 256x256, class-conditional)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Generation models and implementations

faceonlive/ai-research

3 papers

277

stability-ai/generative-models

2 papers

22,874

nvlabs/long-video-gan

2 papers

302

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture

mil-tokyo/FTGAN • 27 Nov 2017

FlowGAN generates optical flow, which contains only the edge and motion of the videos to be begerated.

Paper
Code

Stochastic Video Generation with a Learned Prior

edenton/svg • • ICML 2018

Sample generations are both varied and sharp, even many frames into the future, and compare favorably to those from existing approaches.

Paper
Code

Point-to-Point Video Generation

charlescheng0117/p2pvg • • ICCV 2019

We introduce point-to-point video generation that controls the generation process with two control points: the targeted start- and end-frames.

Paper
Code

Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample

shirgur/hp-vae-gan • • NeurIPS 2020

We consider the task of generating diverse and novel videos from a single video sample.

Paper
Code

VideoGPT: Video Generation using VQ-VAE and Transformers

wilson1yan/VideoGPT • • 20 Apr 2021

We present VideoGPT: a conceptually simple architecture for scaling likelihood based generative modeling to natural videos.

Paper
Code

Video Diffusion Models

lucidrains/make-a-video-pytorch • • 7 Apr 2022

Generating temporally coherent high fidelity video is an important milestone in generative modeling research.

Paper
Code

Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

vqassessment/dover • • ICCV 2023

In light of this, we propose the Disentangled Objective Video Quality Evaluator (DOVER) to learn the quality of UGC videos based on the two perspectives.

Paper
Code

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

showlab/Tune-A-Video • • ICCV 2023

To replicate the success of text-to-image (T2I) generation, recent works employ large-scale video datasets to train a text-to-video (T2V) generator.

Paper
Code

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

stability-ai/generative-models • • CVPR 2023

We first pre-train an LDM on images only; then, we turn the image generator into a video generator by introducing a temporal dimension to the latent space diffusion model and fine-tuning on encoded image sequences, i. e., videos.

Paper
Code

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

AILab-CVC/FreeNoise • • 23 Oct 2023

With the availability of large-scale video datasets and the advances of diffusion models, text-driven video generation has achieved substantial progress.

Paper
Code

Video Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result