TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Generation	BAIR Robot Pushing	SAVP (from FVD)	FVD score	116.4	# 12
Video Generation	BAIR Robot Pushing	SAVP (from FVD)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SAVP (from FVD)	Pred	14	# 2
Video Generation	BAIR Robot Pushing	SAVP (from FVD)	Train	14	# 12
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	SSIM	0.815	# 9
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	PSNR	19.09	# 6
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	Pred	28	# 20
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	Train	14	# 12
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	FVD score	152±9	# 19
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	SSIM	0.7887±0.0092	# 12
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	PSNR	18.44±0.25	# 8
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	LPIPS	0.0634±0.0026	# 3
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	Pred	28	# 20
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	Train	12	# 18
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	FVD score	143.43	# 17
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	SSIM	0.795±0.07	# 11
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	LPIPS	0.062±0.03	# 4
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	Pred	28	# 20
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	Train	10	# 23
Video Prediction	KTH	SAVP (from Grid-keypoints)	LPIPS	0.126	# 9
Video Prediction	KTH	SAVP (from Grid-keypoints)	PSNR	23.79	# 28
Video Prediction	KTH	SAVP (from Grid-keypoints)	FVD	183.7	# 4
Video Prediction	KTH	SAVP (from Grid-keypoints)	SSIM	0.699	# 30
Video Prediction	KTH	SAVP (from Grid-keypoints)	Cond	10	# 1
Video Prediction	KTH	SAVP (from Grid-keypoints)	Pred	40	# 22
Video Prediction	KTH	SAVP (from Grid-keypoints)	Params (M)	17.6	# 6
Video Prediction	KTH	SAVP (from Grid-keypoints)	Train	10	# 1
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	LPIPS	0.116	# 7
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	PSNR	26.00	# 22
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	FVD	145.7	# 2
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	SSIM	0.806	# 17
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	Cond	10	# 1
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	Pred	40	# 22
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	Params (M)	7.3	# 3
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	Train	10	# 1
Video Prediction	KTH	SAVP-VAE	PSNR	27.77	# 11
Video Prediction	KTH	SAVP-VAE	SSIM	0.852	# 9
Video Prediction	KTH	SAVP-VAE	Cond	10	# 1
Video Prediction	KTH	SAVP-VAE	Pred	20	# 1
Video Prediction	KTH	SAVP (from SRVP)	LPIPS	0.1120±0.0039	# 6
Video Prediction	KTH	SAVP (from SRVP)	PSNR	26.51±0.29	# 19
Video Prediction	KTH	SAVP (from SRVP)	FVD	374 ± 3	# 9
Video Prediction	KTH	SAVP (from SRVP)	SSIM	0.7564±0.0062	# 27
Video Prediction	KTH	SAVP (from SRVP)	Cond	10	# 1
Video Prediction	KTH	SAVP (from SRVP)	Pred	30	# 17
Video Prediction	KTH	SAVP (from SRVP)	Train	10	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-adversarial-video-prediction/video-prediction-on-kth)](https://paperswithcode.com/sota/video-prediction-on-kth?p=stochastic-adversarial-video-prediction)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-adversarial-video-prediction/video-generation-on-bair-robot-pushing)](https://paperswithcode.com/sota/video-generation-on-bair-robot-pushing?p=stochastic-adversarial-video-prediction)`

Stochastic Adversarial Video Prediction

ICLR 2019 · Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine ·

Being able to predict what may happen in the future requires an in-depth understanding of the physical and causal rules that govern the world. A model that is able to do so has a number of appealing applications, from robotic planning to representation learning. However, learning to predict raw future observations, such as frames in a video, is exceedingly challenging -- the ambiguous nature of the problem can cause a naively designed model to average together possible futures into a single, blurry prediction. Recently, this has been addressed by two distinct approaches: (a) latent variational variable models that explicitly model underlying stochasticity and (b) adversarially-trained models that aim to produce naturalistic images. However, a standard latent variable model can struggle to produce realistic results, and a standard adversarially-trained model underutilizes latent variables and fails to produce diverse predictions. We show that these distinct methods are in fact complementary. Combining the two produces predictions that look more realistic to human raters and better cover the range of possible futures. Our method outperforms prior and concurrent work in these aspects.

PDF Abstract ICLR 2019 PDF ICLR 2019 Abstract

Code

Add Remove Mark official

alexlee-gk/video_prediction official

300

MIT-Omnipush/video-prediction

kamran0153/impact-of-data-freshness…

Bonennult/video_prediction

Tasks

Add Remove

Representation Learning

Video Generation

Video Prediction

Datasets

KTH BAIR Robot Pushing

Results from the Paper

Add Remove

Ranked #1 on Video Prediction on KTH (Cond metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Generation	BAIR Robot Pushing	SAVP (from FVD)	FVD score	116.4	# 12	Compare
			Cond	2	# 13	Compare
			Pred	14	# 2	Compare
			Train	14	# 12	Compare
Video Generation	BAIR Robot Pushing	SAVP-VAE (from WAM)	Cond	2	# 13	Compare
			SSIM	0.815	# 9	Compare
			PSNR	19.09	# 6	Compare
			Pred	28	# 20	Compare
			Train	14	# 12	Compare
Video Generation	BAIR Robot Pushing	SAVP (from SRVP)	FVD score	152±9	# 19	Compare
			Cond	2	# 13	Compare
			SSIM	0.7887±0.0092	# 12	Compare
			PSNR	18.44±0.25	# 8	Compare
			LPIPS	0.0634±0.0026	# 3	Compare
			Pred	28	# 20	Compare
			Train	12	# 18	Compare
Video Generation	BAIR Robot Pushing	SAVP (from vRNN)	FVD score	143.43	# 17	Compare
			Cond	2	# 13	Compare
			SSIM	0.795±0.07	# 11	Compare
			LPIPS	0.062±0.03	# 4	Compare
			Pred	28	# 20	Compare
			Train	10	# 23	Compare
Video Prediction	KTH	SAVP (from Grid-keypoints)	LPIPS	0.126	# 9	Compare
			PSNR	23.79	# 28	Compare
			FVD	183.7	# 4	Compare
			SSIM	0.699	# 30	Compare
			Cond	10	# 1	Compare
			Pred	40	# 22	Compare
			Params (M)	17.6	# 6	Compare
			Train	10	# 1	Compare
Video Prediction	KTH	SAVP-VAE (from Grid-keypoints)	LPIPS	0.116	# 7	Compare
			PSNR	26.00	# 22	Compare
			FVD	145.7	# 2	Compare
			SSIM	0.806	# 17	Compare
			Cond	10	# 1	Compare
			Pred	40	# 22	Compare
			Params (M)	7.3	# 3	Compare
			Train	10	# 1	Compare
Video Prediction	KTH	SAVP-VAE	PSNR	27.77	# 11	Compare
			SSIM	0.852	# 9	Compare
			Cond	10	# 1	Compare
			Pred	20	# 1	Compare
Video Prediction	KTH	SAVP (from SRVP)	LPIPS	0.1120±0.0039	# 6	Compare
			PSNR	26.51±0.29	# 19	Compare
			FVD	374 ± 3	# 9	Compare
			SSIM	0.7564±0.0062	# 27	Compare
			Cond	10	# 1	Compare
			Pred	30	# 17	Compare
			Train	10	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Stochastic Adversarial Video Prediction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove