TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	KTH	LMC	LPIPS	159.8	# 17
Video Prediction	KTH	LMC	PSNR	27.5	# 14
Video Prediction	KTH	LMC	SSIM	0.879	# 3
Video Prediction	KTH	LMC	Cond	10	# 1
Video Prediction	KTH	LMC	Pred	40	# 22
Video Prediction	Moving MNIST	LMC	MSE	41.5	# 22
Video Prediction	Moving MNIST	LMC	SSIM	0.924	# 16
Video Prediction	Moving MNIST	LMC	LPIPS	0.047	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-prediction-recalling-long-term-motion/video-prediction-on-kth)](https://paperswithcode.com/sota/video-prediction-on-kth?p=video-prediction-recalling-long-term-motion)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-prediction-recalling-long-term-motion/video-prediction-on-moving-mnist)](https://paperswithcode.com/sota/video-prediction-on-moving-mnist?p=video-prediction-recalling-long-term-motion)`

Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning

CVPR 2021 · Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro ·

Our work addresses long-term motion context issues for predicting future frames. To predict the future precisely, it is required to capture which long-term motion context (e.g., walking or running) the input motion (e.g., leg movement) belongs to. The bottlenecks arising when dealing with the long-term motion context are: (i) how to predict the long-term motion context naturally matching input sequences with limited dynamics, (ii) how to predict the long-term motion context with high-dimensionality (e.g., complex motion). To address the issues, we propose novel motion context-aware video prediction. To solve the bottleneck (i), we introduce a long-term motion context memory (LMC-Memory) with memory alignment learning. The proposed memory alignment learning enables to store long-term motion contexts into the memory and to match them with sequences including limited dynamics. As a result, the long-term context can be recalled from the limited input sequence. In addition, to resolve the bottleneck (ii), we propose memory query decomposition to store local motion context (i.e., low-dimensional dynamics) and recall the suitable local context for each local part of the input individually. It enables to boost the alignment effects of the memory. Experimental results show that the proposed method outperforms other sophisticated RNN-based methods, especially in long-term condition. Further, we validate the effectiveness of the proposed network designs by conducting ablation studies and memory feature analysis. The source code of this work is available.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

sangmin-git/LMC-Memory official

Tasks

Add Remove

Video Prediction

Datasets

MNIST

Human3.6M

KTH

Moving MNIST

Results from the Paper

Edit

Ranked #1 on Video Prediction on KTH (Cond metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Prediction	KTH	LMC	LPIPS	159.8	# 17	Compare
			PSNR	27.5	# 14	Compare
			SSIM	0.879	# 3	Compare
			Cond	10	# 1	Compare
			Pred	40	# 22	Compare
Video Prediction	Moving MNIST	LMC	MSE	41.5	# 22	Compare
			SSIM	0.924	# 16	Compare
			LPIPS	0.047	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove