Video Generation

241 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Generation

Dataset	Best Model	Compare
UCF-101	W.A.L.T-XL (class-conditional)	See all
BAIR Robot Pushing	MAGVIT	See all
Sky Time-lapse	StyleSV (256x256)	See all
UCF-101 16 frames, 64x64, Unconditional	Make-A-Video (ours) vs. CogVideo (Chinese)	See all
UCF-101 16 frames, Unconditional, Single GPU	TGAN-F	See all
LAION-400M	Imagen original (constant=6)	See all
Taichi	StyleSV (256x256)	See all
UCF-101 16 frames, 128x128, Unconditional	TGANv2 (2020)	See all
Kinetics-600 12 frames, 64x64	W.A.L.T-L	See all
TrailerFaces	PG-SWGAN-3D	See all
Kinetics-600 48 frames, 64x64	DVD-GAN	See all
Kinetics-600 12 frames, 128x128	DVD-GAN	See all
How2Sign	INR-V	See all
YouTube Driving	StyleSV	See all
MSR-VTT	VideoAssembler (Zero-Shot, 256x256, class-conditional)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Video Generation models and implementations

faceonlive/ai-research

3 papers

156

stability-ai/generative-models

2 papers

22,280

nvlabs/long-video-gan

2 papers

300

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Motion Inversion for Video Customization

no code yet • 29 Mar 2024

In this research, we present a novel approach to motion customization in video generation, addressing the widespread gap in the thorough exploration of motion representation within video generative models.

Paper
Add Code

Frame by Familiar Frame: Understanding Replication in Video Diffusion Models

no code yet • 28 Mar 2024

In our paper, we present a systematic investigation into the phenomenon of sample replication in video diffusion models.

Paper
Add Code

A Review of Multi-Modal Large Language and Vision Models

no code yet • 28 Mar 2024

Large Language Models (LLMs) have recently emerged as a focal point of research and application, driven by their unprecedented ability to understand and generate text with human-like quality.

Paper
Add Code

TC4D: Trajectory-Conditioned Text-to-4D Generation

no code yet • 26 Mar 2024

We learn local deformations that conform to the global trajectory using supervision from a text-to-video model.

Paper
Add Code

Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields

no code yet • 26 Mar 2024

It is composed of a denoising diffusion probabilistic model (DDPM) generating high-fidelity synthetic cell microscopy images and a flow prediction model (FPM) predicting the non-rigid transformation between consecutive video frames.

Paper
Add Code

Tutorial on Diffusion Models for Imaging and Vision

no code yet • 26 Mar 2024

The goal of this tutorial is to discuss the essential ideas underlying the diffusion models.

Paper
Add Code

A Survey on Long Video Generation: Challenges, Methods, and Prospects

no code yet • 25 Mar 2024

Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications.

Paper
Add Code

TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models

no code yet • 25 Mar 2024

Next, TRIP executes a residual-like dual-path scheme for noise prediction: 1) a shortcut path that directly takes image noise prior as the reference noise of each frame to amplify the alignment between the first frame and subsequent frames; 2) a residual path that employs 3D-UNet over noised video and static image latent codes to enable inter-frame relational reasoning, thereby easing the learning of the residual noise for each frame.

Paper
Add Code

Opportunities and challenges in the application of large artificial intelligence models in radiology

no code yet • 24 Mar 2024

Influenced by ChatGPT, artificial intelligence (AI) large models have witnessed a global upsurge in large model research and development.

Paper
Add Code

Spectral Motion Alignment for Video Motion Transfer using Diffusion Models

no code yet • 22 Mar 2024

The evolution of diffusion models has greatly impacted video generation and understanding.

Paper
Add Code

Video Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result