Video Generation

239 papers with code • 15 benchmarks • 14 datasets

( Various Video Generation Tasks. Gif credit: MaGViT )

Libraries

Use these libraries to find Video Generation models and implementations

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

nus-hpc-ai-lab/opendit 15 Mar 2024

Scaling large models with long sequences across applications like language generation, video generation and multimodal tasks requires efficient sequence parallelism.

981
15 Mar 2024

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

mayuelala/followyourclick 13 Mar 2024

Despite recent advances in image-to-video generation, better controllability and local animation are less explored.

700
13 Mar 2024

DragAnything: Motion Control for Anything using Entity Representation

showlab/draganything 12 Mar 2024

We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation.

283
12 Mar 2024

SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces

shim0114/ssm-meets-video-diffusion-models 12 Mar 2024

In the experiments, we first evaluate our SSM-based model with UCF101, a standard benchmark of video generation.

32
12 Mar 2024

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

wangwenhao0716/vidprom 10 Mar 2024

In this paper, we introduce VidProM, the first large-scale dataset comprising 1. 67 million unique text-to-video prompts from real users.

69
10 Mar 2024

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

ybybzhang/videoelevator 8 Mar 2024

Different from conventional T2V sampling (i. e., temporal and spatial modeling), VideoElevator explicitly decomposes each sampling step into temporal motion refining and spatial quality elevating.

110
08 Mar 2024

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

XuweiyiChen/UniCtrl 4 Mar 2024

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

50
04 Mar 2024

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

lichao-sun/sorareview 27 Feb 2024

Sora is a text-to-video generative AI model, released by OpenAI in February 2024.

445
27 Feb 2024

VGMShield: Mitigating Misuse of Video Generative Models

py85252876/mmvgm 20 Feb 2024

Together with fake video detection and tracing, our multi-faceted set of solutions can effectively mitigate misuse of video generative models.

6
20 Feb 2024

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

guolanqing/self-cascade 16 Feb 2024

Diffusion models have proven to be highly effective in image and video generation; however, they still face composition challenges when generating images of varying sizes due to single-scale training data.

45
16 Feb 2024