Video Prediction

183 papers with code • 19 benchmarks • 24 datasets

Video Prediction is the task of predicting future frames given past video frames.

Gif credit: MAGVIT

Source: Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames

Libraries

Use these libraries to find Video Prediction models and implementations

Generalized Predictive Model for Autonomous Driving

opendrivelab/driveagi 14 Mar 2024

In this paper, we introduce the first large-scale video prediction model in the autonomous driving discipline.

359
14 Mar 2024

General surgery vision transformer: A video pre-trained foundation model for general surgery

samuelschmidgall/gsvit 9 Mar 2024

The absence of openly accessible data and specialized foundation models is a major barrier for computational research in surgery.

7
09 Mar 2024

Switch EMA: A Free Lunch for Better Flatness and Sharpness

Westlake-AI/openmixup 14 Feb 2024

Exponential Moving Average (EMA) is a widely used weight averaging (WA) regularization to learn flat optima for better generalizations without extra cost in deep neural network (DNN) optimization.

568
14 Feb 2024

STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction

xiye20/stdiffproject 11 Dec 2023

Predicting future frames of a video is challenging because it is difficult to learn the uncertainty of the underlying factors influencing their contents.

7
11 Dec 2023

SVQ: Sparse Vector Quantization for Spatiotemporal Forecasting

Pachark/SVQ-Forecasting 6 Dec 2023

Moreover, we approximate the sparse regression process using a blend of a two-layer MLP and an extensive codebook.

2
06 Dec 2023

Rethinking Urban Mobility Prediction: A Super-Multivariate Time Series Forecasting Approach

chengyui/sumformer 4 Dec 2023

To address this challenge, we present the Super-Multivariate Urban Mobility Transformer (SUMformer), which utilizes a specially designed attention mechanism to calculate temporal and cross-variable correlations and reduce computational costs stemming from a large number of time series.

10
04 Dec 2023

Pair-wise Layer Attention with Spatial Masking for Video Prediction

mlvccn/pla_sm_videopred 19 Nov 2023

To this end, we present a Pair-wise Layer Attention with Spatial Masking (PLA-SM) framework for video prediction to capture the spatiotemporal dynamics, which reflect the motion trend.

1
19 Nov 2023

MMVP: Motion-Matrix-based Video Prediction

kay1794/mmvp-motion-matrix-based-video-prediction ICCV 2023

A central challenge of video prediction lies where the system has to reason the objects' future motions from image frames while simultaneously maintaining the consistency of their appearances across frames.

26
30 Aug 2023

SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin Transformer and LSTM

SongTang-x/SwinLSTM 19 Aug 2023

In this paper, we propose a new recurrent cell, SwinLSTM, which integrates Swin Transformer blocks and the simplified LSTM, an extension that replaces the convolutional structure in ConvLSTM with the self-attention mechanism.

56
19 Aug 2023

Neural Multigrid Memory For Computational Fluid Dynamics

combi2k2/mg-turbulent-flow 21 Jun 2023

Turbulent flow simulation plays a crucial role in various applications, including aircraft and ship design, industrial process optimization, and weather prediction.

1
21 Jun 2023