Video Generation

239 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Generation

Dataset	Best Model	Compare
UCF-101	W.A.L.T-XL (class-conditional)	See all
BAIR Robot Pushing	MAGVIT	See all
Sky Time-lapse	StyleSV (256x256)	See all
UCF-101 16 frames, 64x64, Unconditional	Make-A-Video (ours) vs. CogVideo (Chinese)	See all
UCF-101 16 frames, Unconditional, Single GPU	TGAN-F	See all
LAION-400M	Imagen original (constant=6)	See all
Taichi	StyleSV (256x256)	See all
UCF-101 16 frames, 128x128, Unconditional	TGANv2 (2020)	See all
Kinetics-600 12 frames, 64x64	W.A.L.T-L	See all
TrailerFaces	PG-SWGAN-3D	See all
Kinetics-600 48 frames, 64x64	DVD-GAN	See all
Kinetics-600 12 frames, 128x128	DVD-GAN	See all
How2Sign	INR-V	See all
YouTube Driving	StyleSV	See all
MSR-VTT	VideoAssembler (Zero-Shot, 256x256, class-conditional)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Generation models and implementations

faceonlive/ai-research

3 papers

131

stability-ai/generative-models

2 papers

22,120

nvlabs/long-video-gan

2 papers

301

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

bioinf-jku/TTUR • • NeurIPS 2017

Generative Adversarial Networks (GANs) excel at creating realistic images with complex models for which maximum likelihood is infeasible.

Paper
Code

Everybody Dance Now

carolineec/EverybodyDanceNow • • ICCV 2019

This paper presents a simple method for "do as I do" motion transfer: given a source video of a person dancing, we can transfer that performance to a novel (amateur) target after only a few minutes of the target subject performing standard moves.

Paper
Code

Learning Temporal Coherence via Self-Supervision for GAN-based Video Generation

thunil/TecoGAN • • 23 Nov 2018

Additionally, we propose a first set of metrics to quantitatively evaluate the accuracy as well as the perceptual quality of the temporal evolution.

Paper
Code

Consistency Models

openai/consistency_models • • 2 Mar 2023

Through extensive experiments, we demonstrate that they outperform existing distillation techniques for diffusion models in one- and few-step sampling, achieving the new state-of-the-art FID of 3. 55 on CIFAR-10 and 6. 20 on ImageNet 64x64 for one-step generation.

Paper
Code

MoCoGAN: Decomposing Motion and Content for Video Generation

sergeytulyakov/mocogan • • CVPR 2018

The proposed framework generates a video by mapping a sequence of random vectors to a sequence of video frames.

Paper
Code

Temporal Generative Adversarial Nets with Singular Value Clipping

universome/stylegan-v • • ICCV 2017

In this paper, we propose a generative model, Temporal Generative Adversarial Nets (TGAN), which can learn a semantic representation of unlabeled videos, and is capable of generating videos.

Paper
Code

Stochastic Adversarial Video Prediction

alexlee-gk/video_prediction • • ICLR 2019

However, learning to predict raw future observations, such as frames in a video, is exceedingly challenging -- the ambiguous nature of the problem can cause a naively designed model to average together possible futures into a single, blurry prediction.

Paper
Code

Collaborative Neural Rendering using Anime Character Sheets

transpchan/Live3D • • 12 Jul 2022

Drawing images of characters with desired poses is an essential but laborious task in anime production.

Paper
Code

Unsupervised Learning for Physical Interaction through Video Prediction

tensorflow/models • • NeurIPS 2016

A core challenge for an agent learning to interact with the world is to predict how its actions affect objects in its environment.

Paper
Code

Stochastic Variational Video Prediction

StanfordVL/roboturk_real_dataset • • ICLR 2018

We find that our proposed method produces substantially improved video predictions when compared to the same model without stochasticity, and to other stochastic video prediction methods.

Paper
Code

Video Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result