Video Generation

239 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Generation

Dataset	Best Model	Compare
UCF-101	W.A.L.T-XL (class-conditional)	See all
BAIR Robot Pushing	MAGVIT	See all
Sky Time-lapse	StyleSV (256x256)	See all
UCF-101 16 frames, 64x64, Unconditional	Make-A-Video (ours) vs. CogVideo (Chinese)	See all
UCF-101 16 frames, Unconditional, Single GPU	TGAN-F	See all
LAION-400M	Imagen original (constant=6)	See all
Taichi	StyleSV (256x256)	See all
UCF-101 16 frames, 128x128, Unconditional	TGANv2 (2020)	See all
Kinetics-600 12 frames, 64x64	W.A.L.T-L	See all
TrailerFaces	PG-SWGAN-3D	See all
Kinetics-600 48 frames, 64x64	DVD-GAN	See all
Kinetics-600 12 frames, 128x128	DVD-GAN	See all
How2Sign	INR-V	See all
YouTube Driving	StyleSV	See all
MSR-VTT	VideoAssembler (Zero-Shot, 256x256, class-conditional)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Generation models and implementations

faceonlive/ai-research

3 papers

124

stability-ai/generative-models

2 papers

22,088

nvlabs/long-video-gan

2 papers

301

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

nus-hpc-ai-lab/opendit • • 15 Mar 2024

Scaling large models with long sequences across applications like language generation, video generation and multimodal tasks requires efficient sequence parallelism.

981

15 Mar 2024

Paper
Code

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

mayuelala/followyourclick • 13 Mar 2024

Despite recent advances in image-to-video generation, better controllability and local animation are less explored.

700

13 Mar 2024

Paper
Code

DragAnything: Motion Control for Anything using Entity Representation

showlab/draganything • • 12 Mar 2024

We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation.

283

12 Mar 2024

Paper
Code

SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces

shim0114/ssm-meets-video-diffusion-models • • 12 Mar 2024

In the experiments, we first evaluate our SSM-based model with UCF101, a standard benchmark of video generation.

12 Mar 2024

Paper
Code

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

wangwenhao0716/vidprom • 10 Mar 2024

In this paper, we introduce VidProM, the first large-scale dataset comprising 1. 67 million unique text-to-video prompts from real users.

10 Mar 2024

Paper
Code

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

ybybzhang/videoelevator • • 8 Mar 2024

Different from conventional T2V sampling (i. e., temporal and spatial modeling), VideoElevator explicitly decomposes each sampling step into temporal motion refining and spatial quality elevating.

110

08 Mar 2024

Paper
Code

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

XuweiyiChen/UniCtrl • • 4 Mar 2024

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

04 Mar 2024

Paper
Code

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

lichao-sun/sorareview • • 27 Feb 2024

Sora is a text-to-video generative AI model, released by OpenAI in February 2024.

445

27 Feb 2024

Paper
Code

VGMShield: Mitigating Misuse of Video Generative Models

py85252876/mmvgm • • 20 Feb 2024

Together with fake video detection and tracing, our multi-faceted set of solutions can effectively mitigate misuse of video generative models.

20 Feb 2024

Paper
Code

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

guolanqing/self-cascade • 16 Feb 2024

Diffusion models have proven to be highly effective in image and video generation; however, they still face composition challenges when generating images of varying sizes due to single-scale training data.

16 Feb 2024

Paper
Code

Video Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result