TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	FVD score	256.62	# 24
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	SSIM	0.816±0.07	# 8
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	LPIPS	0.061±0.03	# 5
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	Pred	28	# 20
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	Train	10	# 23
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	FVD score	255±4	# 23
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	SSIM	0.8058±0.0088	# 10
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	PSNR	18.95±0.26	# 7
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	LPIPS	0.0609±0.0034	# 6
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	Pred	28	# 20
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	Train	12	# 18
Video Generation	BAIR Robot Pushing	SVG-FP (from FVD)	FVD score	315.5	# 27
Video Generation	BAIR Robot Pushing	SVG-FP (from FVD)	Cond	2	# 13
Video Generation	BAIR Robot Pushing	SVG-FP (from FVD)	Pred	14	# 2
Video Generation	BAIR Robot Pushing	SVG-FP (from FVD)	Train	14	# 12
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	FVD	1300.26	# 3
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	SSIM	0.574±0.08	# 5
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	LPIPS	0.549 ± 0.06	# 5
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	Cond.	2	# 1
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	Pred	28	# 3
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	Train	10	# 1
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	LPIPS	0.129	# 10
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	PSNR	23.91	# 27
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	FVD	157.9	# 3
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	SSIM	0.800	# 20
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	Cond	10	# 1
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	Pred	40	# 22
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	Params (M)	22.8	# 7
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	Train	10	# 1
Video Prediction	KTH	SVG-LP (from SRVP)	LPIPS	0.0923±0.0038	# 5
Video Prediction	KTH	SVG-LP (from SRVP)	PSNR	28.06±0.29	# 9
Video Prediction	KTH	SVG-LP (from SRVP)	FVD	377 ± 6	# 10
Video Prediction	KTH	SVG-LP (from SRVP)	SSIM	0.8438±0.0054	# 10
Video Prediction	KTH	SVG-LP (from SRVP)	Cond	10	# 1
Video Prediction	KTH	SVG-LP (from SRVP)	Pred	30	# 17
Video Prediction	KTH	SVG-LP (from SRVP)	Train	10	# 1
Video Prediction	SynpickVP	SVG-Det	MSE	60.60	# 5
Video Prediction	SynpickVP	SVG-Det	PSNR	26.92	# 3
Video Prediction	SynpickVP	SVG-Det	SSIM	0.879	# 4
Video Prediction	SynpickVP	SVG-Det	LPIPS	0.068	# 5
Video Prediction	SynpickVP	SVG-LP	MSE	51.82	# 2
Video Prediction	SynpickVP	SVG-LP	PSNR	27..38	# 5
Video Prediction	SynpickVP	SVG-LP	SSIM	0.886	# 2
Video Prediction	SynpickVP	SVG-LP	LPIPS	0.066	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-video-generation-with-a-learned/video-prediction-on-cityscapes-128x128)](https://paperswithcode.com/sota/video-prediction-on-cityscapes-128x128?p=stochastic-video-generation-with-a-learned)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-video-generation-with-a-learned/video-prediction-on-kth)](https://paperswithcode.com/sota/video-prediction-on-kth?p=stochastic-video-generation-with-a-learned)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-video-generation-with-a-learned/video-prediction-on-synpickvp)](https://paperswithcode.com/sota/video-prediction-on-synpickvp?p=stochastic-video-generation-with-a-learned)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stochastic-video-generation-with-a-learned/video-generation-on-bair-robot-pushing)](https://paperswithcode.com/sota/video-generation-on-bair-robot-pushing?p=stochastic-video-generation-with-a-learned)`

Stochastic Video Generation with a Learned Prior

ICML 2018 · Emily Denton, Rob Fergus ·

Generating video frames that accurately predict future world states is challenging. Existing approaches either fail to capture the full distribution of outcomes, or yield blurry generations, or both. In this paper we introduce an unsupervised video generation model that learns a prior model of uncertainty in a given environment. Video frames are generated by drawing samples from this prior and combining them with a deterministic estimate of the future frame. The approach is simple and easily trained end-to-end on a variety of datasets. Sample generations are both varied and sharp, even many frames into the future, and compare favorably to those from existing approaches.

PDF Abstract ICML 2018 PDF ICML 2018 Abstract

Code

Add Remove Mark official

edenton/svg official

182

joelouismarino/amortized-variationa…

MIT-Omnipush/video-prediction

Tasks

Add Remove

Video Generation

Video Prediction

Datasets

Cityscapes

KTH

Moving MNIST BAIR Robot Pushing

SynPick

Results from the Paper

Add Remove

Ranked #3 on Video Prediction on KTH

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Generation	BAIR Robot Pushing	SVG-LP (from vRNN)	FVD score	256.62	# 24	Compare
			Cond	2	# 13	Compare
			SSIM	0.816±0.07	# 8	Compare
			LPIPS	0.061±0.03	# 5	Compare
			Pred	28	# 20	Compare
			Train	10	# 23	Compare
Video Generation	BAIR Robot Pushing	SVG (from SRVP)	FVD score	255±4	# 23	Compare
			Cond	2	# 13	Compare
			SSIM	0.8058±0.0088	# 10	Compare
			PSNR	18.95±0.26	# 7	Compare
			LPIPS	0.0609±0.0034	# 6	Compare
			Pred	28	# 20	Compare
			Train	12	# 18	Compare
Video Generation	BAIR Robot Pushing	SVG-FP (from FVD)	FVD score	315.5	# 27	Compare
			Cond	2	# 13	Compare
			Pred	14	# 2	Compare
			Train	14	# 12	Compare
Video Prediction	Cityscapes 128x128	SVG (from Hier-VRNN)	FVD	1300.26	# 3	Compare
			SSIM	0.574±0.08	# 5	Compare
			LPIPS	0.549 ± 0.06	# 5	Compare
			Cond.	2	# 1	Compare
			Pred	28	# 3	Compare
			Train	10	# 1	Compare
Video Prediction	KTH	SVG-LP (from Grid-keypoints)	LPIPS	0.129	# 10	Compare
			PSNR	23.91	# 27	Compare
			FVD	157.9	# 3	Compare
			SSIM	0.800	# 20	Compare
			Cond	10	# 1	Compare
			Pred	40	# 22	Compare
			Params (M)	22.8	# 7	Compare
			Train	10	# 1	Compare
Video Prediction	KTH	SVG-LP (from SRVP)	LPIPS	0.0923±0.0038	# 5	Compare
			PSNR	28.06±0.29	# 9	Compare
			FVD	377 ± 6	# 10	Compare
			SSIM	0.8438±0.0054	# 10	Compare
			Cond	10	# 1	Compare
			Pred	30	# 17	Compare
			Train	10	# 1	Compare
Video Prediction	SynpickVP	SVG-Det	MSE	60.60	# 5	Compare
			PSNR	26.92	# 3	Compare
			SSIM	0.879	# 4	Compare
			LPIPS	0.068	# 5	Compare
Video Prediction	SynpickVP	SVG-LP	MSE	51.82	# 2	Compare
			PSNR	27..38	# 5	Compare
			SSIM	0.886	# 2	Compare
			LPIPS	0.066	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Stochastic Video Generation with a Learned Prior

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove