TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Human action generation	Human3.6M	Deep Video Generation, Prediction and Completion of Human Action Sequences	MMDa	0.419	# 5
Human action generation	Human3.6M	Deep Video Generation, Prediction and Completion of Human Action Sequences	MMDs	0.436	# 5
Human action generation	NTU RGB+D 2D	SkeletonGAN	MMDa (CS)	0.698	# 5
Human action generation	NTU RGB+D 2D	SkeletonGAN	MMDs (CS)	0.788	# 5
Human action generation	NTU RGB+D 2D	SkeletonGAN	MMDa (CV)	0.999	# 5
Human action generation	NTU RGB+D 2D	SkeletonGAN	MMDs (CV)	1.311	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-video-generation-prediction-and/human-action-generation-on-human3-6m)](https://paperswithcode.com/sota/human-action-generation-on-human3-6m?p=deep-video-generation-prediction-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-video-generation-prediction-and/human-action-generation-on-ntu-rgb-d-2d)](https://paperswithcode.com/sota/human-action-generation-on-ntu-rgb-d-2d?p=deep-video-generation-prediction-and)`

Deep Video Generation, Prediction and Completion of Human Action Sequences

ECCV 2018 · Haoye Cai, Chunyan Bai, Yu-Wing Tai, Chi-Keung Tang ·

Current deep learning results on video generation are limited while there are only a few first results on video prediction and no relevant significant results on video completion. This is due to the severe ill-posedness inherent in these three problems. In this paper, we focus on human action videos, and propose a general, two-stage deep framework to generate human action videos with no constraints or arbitrary number of constraints, which uniformly address the three problems: video generation given no input frames, video prediction given the first few frames, and video completion given the first and last frames. To make the problem tractable, in the first stage we train a deep generative model that generates a human pose sequence from random noise. In the second stage, a skeleton-to-image network is trained, which is used to generate a human action video given the complete human pose sequence generated in the first stage. By introducing the two-stage strategy, we sidestep the original ill-posed problems while producing for the first time high-quality video generation/prediction/completion results of much longer duration. We present quantitative and qualitative evaluation to show that our two-stage approach outperforms state-of-the-art methods in video generation, prediction and video completion. Our video result demonstration can be viewed at https://iamacewhite.github.io/supp/index.html

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Human action generation

Video Generation

Video Prediction

Datasets

Human3.6M

NTU RGB+D 2D

Results from the Paper

Add Remove

Ranked #5 on Human action generation on NTU RGB+D 2D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Human action generation	Human3.6M	Deep Video Generation, Prediction and Completion of Human Action Sequences	MMDa	0.419	# 5	Compare
Human action generation	Human3.6M		MMDs	0.436	# 5	Compare
Human action generation	NTU RGB+D 2D	SkeletonGAN	MMDa (CS)	0.698	# 5	Compare
			MMDs (CS)	0.788	# 5	Compare
			MMDa (CV)	0.999	# 5	Compare
			MMDs (CV)	1.311	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Deep Video Generation, Prediction and Completion of Human Action Sequences

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove