Video Generation

248 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Generation

Dataset	Best Model	Compare
UCF-101	W.A.L.T-XL (class-conditional)	See all
BAIR Robot Pushing	MAGVIT	See all
Sky Time-lapse	StyleSV (256x256)	See all
UCF-101 16 frames, 64x64, Unconditional	Make-A-Video (ours) vs. CogVideo (Chinese)	See all
UCF-101 16 frames, Unconditional, Single GPU	TGAN-F	See all
LAION-400M	Imagen original (constant=6)	See all
Taichi	StyleSV (256x256)	See all
UCF-101 16 frames, 128x128, Unconditional	TGANv2 (2020)	See all
Kinetics-600 12 frames, 64x64	W.A.L.T-L	See all
TrailerFaces	PG-SWGAN-3D	See all
Kinetics-600 48 frames, 64x64	DVD-GAN	See all
Kinetics-600 12 frames, 128x128	DVD-GAN	See all
How2Sign	INR-V	See all
YouTube Driving	StyleSV	See all
MSR-VTT	VideoAssembler (Zero-Shot, 256x256, class-conditional)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Generation models and implementations

faceonlive/ai-research

3 papers

208

stability-ai/generative-models

2 papers

22,459

nvlabs/long-video-gan

2 papers

301

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

gigaai-research/general-world-models-survey • • 6 May 2024

General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.

06 May 2024

Paper
Code

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

hvision-nku/storydiffusion • • 2 May 2024

This module converts the generated sequence of images into videos with smooth transitions and consistent subjects that are significantly more stable than the modules based on latent spaces only, especially in the context of long video generation.

3,656

02 May 2024

Paper
Code

FlexiFilm: Long Video Generation with Flexible Conditions

Y-ichen/FlexiFilm • 29 Apr 2024

Generating long and consistent videos has emerged as a significant yet challenging problem.

29 Apr 2024

Paper
Code

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

nihaomiao/cvpr23_lfdm • • 25 Apr 2024

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

419

25 Apr 2024

Paper
Code

Synthesizing Audio from Silent Video using Sequence to Sequence Modeling

Adam-Haile/vita-research-group • • 25 Apr 2024

Generating audio from a video's visual context has multiple practical applications in improving how we interact with audio-visual media - for example, enhancing CCTV footage analysis, restoring historical videos (e. g., silent movies), and improving video generation models.

25 Apr 2024

Paper
Code

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator • • 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

160

23 Apr 2024

Paper
Code

TAVGBench: Benchmarking Text to Audible-Video Generation

opennlplab/tavgbench • 22 Apr 2024

To support research in this field, we have developed a comprehensive Text to Audible-Video Generation Benchmark (TAVGBench), which contains over 1. 7 million clips with a total duration of 11. 8 thousand hours.

22 Apr 2024

Paper
Code

On the Content Bias in Fréchet Video Distance

songweige/tats • • 18 Apr 2024

We show that FVD with features extracted from the recent large-scale self-supervised video models is less biased toward image quality.

245

18 Apr 2024

Paper
Code

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

pku-yuangroup/magictime • • 7 Apr 2024

Recent advances in Text-to-Video generation (T2V) have achieved remarkable success in synthesizing high-quality general videos from textual descriptions.

1,093

07 Apr 2024

Paper
Code

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

hehao13/cameractrl • • 2 Apr 2024

Controllability plays a crucial role in video generation since it allows users to create desired content.

220

02 Apr 2024

Paper
Code

Video Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result