Video Generation

248 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Libraries

Use these libraries to find Video Generation models and implementations

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

gigaai-research/general-world-models-survey 6 May 2024

General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.

56
06 May 2024

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

hvision-nku/storydiffusion 2 May 2024

This module converts the generated sequence of images into videos with smooth transitions and consistent subjects that are significantly more stable than the modules based on latent spaces only, especially in the context of long video generation.

3,656
02 May 2024

FlexiFilm: Long Video Generation with Flexible Conditions

Y-ichen/FlexiFilm 29 Apr 2024

Generating long and consistent videos has emerged as a significant yet challenging problem.

12
29 Apr 2024

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

nihaomiao/cvpr23_lfdm 25 Apr 2024

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

419
25 Apr 2024

Synthesizing Audio from Silent Video using Sequence to Sequence Modeling

Adam-Haile/vita-research-group 25 Apr 2024

Generating audio from a video's visual context has multiple practical applications in improving how we interact with audio-visual media - for example, enhancing CCTV footage analysis, restoring historical videos (e. g., silent movies), and improving video generation models.

0
25 Apr 2024

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

160
23 Apr 2024

TAVGBench: Benchmarking Text to Audible-Video Generation

opennlplab/tavgbench 22 Apr 2024

To support research in this field, we have developed a comprehensive Text to Audible-Video Generation Benchmark (TAVGBench), which contains over 1. 7 million clips with a total duration of 11. 8 thousand hours.

5
22 Apr 2024

On the Content Bias in Fréchet Video Distance

songweige/tats 18 Apr 2024

We show that FVD with features extracted from the recent large-scale self-supervised video models is less biased toward image quality.

245
18 Apr 2024

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

pku-yuangroup/magictime 7 Apr 2024

Recent advances in Text-to-Video generation (T2V) have achieved remarkable success in synthesizing high-quality general videos from textual descriptions.

1,093
07 Apr 2024

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

hehao13/cameractrl 2 Apr 2024

Controllability plays a crucial role in video generation since it allows users to create desired content.

220
02 Apr 2024