TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	KTH	MSPred	LPIPS	0.029	# 1
Video Prediction	KTH	MSPred	PSNR	27.81	# 10
Video Prediction	KTH	MSPred	SSIM	0.951	# 1
Video Prediction	KTH	MSPred	MSE	23.18	# 1
Video Prediction	Moving MNIST	MSPred	MSE	34.44	# 19
Video Prediction	Moving MNIST	MSPred	SSIM	0.975	# 1
Video Prediction	Moving MNIST	MSPred	LPIPS	0.024	# 1
Video Prediction	Moving MNIST	MSPred	PSNR	26.82	# 1
Video Prediction	SynpickVP	MSPred	MSE	53.09	# 3
Video Prediction	SynpickVP	MSPred	PSNR	27.89	# 1
Video Prediction	SynpickVP	MSPred	SSIM	0.881	# 3
Video Prediction	SynpickVP	MSPred	LPIPS	0.033	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-prediction-at-multiple-scales-with/video-prediction-on-kth)](https://paperswithcode.com/sota/video-prediction-on-kth?p=video-prediction-at-multiple-scales-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-prediction-at-multiple-scales-with/video-prediction-on-synpickvp)](https://paperswithcode.com/sota/video-prediction-on-synpickvp?p=video-prediction-at-multiple-scales-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-prediction-at-multiple-scales-with/video-prediction-on-moving-mnist)](https://paperswithcode.com/sota/video-prediction-on-moving-mnist?p=video-prediction-at-multiple-scales-with)`

MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks

17 Mar 2022 · Angel Villar-Corrales, Ani Karapetyan, Andreas Boltres, Sven Behnke ·

Autonomous systems not only need to understand their current environment, but should also be able to predict future actions conditioned on past states, for instance based on captured camera frames. However, existing models mainly focus on forecasting future video frames for short time-horizons, hence being of limited use for long-term action planning. We propose Multi-Scale Hierarchical Prediction (MSPred), a novel video prediction model able to simultaneously forecast future possible outcomes of different levels of granularity at different spatio-temporal scales. By combining spatial and temporal downsampling, MSPred efficiently predicts abstract representations such as human poses or locations over long time horizons, while still maintaining a competitive performance for video frame prediction. In our experiments, we demonstrate that MSPred accurately predicts future video frames as well as high-level representations (e.g. keypoints or semantics) on bin-picking and action recognition datasets, while consistently outperforming popular approaches for future frame prediction. Furthermore, we ablate different modules and design choices in MSPred, experimentally validating that combining features of different spatial and temporal granularity leads to a superior performance. Code and models to reproduce our experiments can be found in https://github.com/AIS-Bonn/MSPred.

PDF Abstract

Code

Add Remove Mark official

AIS-Bonn/MSPred official

Tasks

Add Remove

Video Prediction

Datasets

MNIST

Perceptual Similarity

KTH

Moving MNIST

SynPick

Results from the Paper

Edit

Ranked #1 on Video Prediction on KTH (LPIPS metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Prediction	KTH	MSPred	LPIPS	0.029	# 1	Compare
			PSNR	27.81	# 10	Compare
			SSIM	0.951	# 1	Compare
			MSE	23.18	# 1	Compare
Video Prediction	Moving MNIST	MSPred	MSE	34.44	# 19	Compare
			SSIM	0.975	# 1	Compare
			LPIPS	0.024	# 1	Compare
			PSNR	26.82	# 1	Compare
Video Prediction	SynpickVP	MSPred	MSE	53.09	# 3	Compare
			PSNR	27.89	# 1	Compare
			SSIM	0.881	# 3	Compare
			LPIPS	0.033	# 1	Compare

Methods

Add Remove

ConvLSTM • Convolution • Sigmoid Activation • Tanh Activation

Edit Social Preview

MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove