TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	DST-VIBE	PA-MPJPE	50.3	# 55
3D Human Pose Estimation	3DPW	DST-VIBE	MPJPE	76.7	# 47
3D Human Pose Estimation	3DPW	DST-VIBE	MPVPE	93.5	# 44
3D Human Pose Estimation	3DPW	DST-VIBE	Acceleration Error	11	# 12
3D Human Pose Estimation	Human3.6M	DST-VIBE	Average MPJPE (mm)	60.5	# 247
3D Human Pose Estimation	Human3.6M	DST-VIBE	PA-MPJPE	39.3	# 49
3D Human Pose Estimation	Human3.6M	DST-VIBE	Acceleration Error	5	# 8
3D Human Pose Estimation	MPI-INF-3DHP	DST-VIBE	MPJPE	93.4	# 50
3D Human Pose Estimation	MPI-INF-3DHP	DST-VIBE	PA-MPJPE	62.2	# 8
3D Human Pose Estimation	MPI-INF-3DHP	DST-VIBE	Acceleration Error	11.9	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-two-stream-video-inference-for-human/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=deep-two-stream-video-inference-for-human)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-two-stream-video-inference-for-human/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=deep-two-stream-video-inference-for-human)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-two-stream-video-inference-for-human/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=deep-two-stream-video-inference-for-human)`

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

22 Oct 2021 · Ziwen Li, Bo Xu, Han Huang, Cheng Lu, Yandong Guo ·

Several video-based 3D pose and shape estimation algorithms have been proposed to resolve the temporal inconsistency of single-image-based methods. However it still remains challenging to have stable and accurate reconstruction. In this paper, we propose a new framework Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation (DTS-VIBE), to generate 3D human pose and mesh from RGB videos. We reformulate the task as a multi-modality problem that fuses RGB and optical flow for more reliable estimation. In order to fully utilize both sensory modalities (RGB or optical flow), we train a two-stream temporal network based on transformer to predict SMPL parameters. The supplementary modality, optical flow, helps to maintain temporal consistency by leveraging motion knowledge between two consecutive frames. The proposed algorithm is extensively evaluated on the Human3.6 and 3DPW datasets. The experimental results show that it outperforms other state-of-the-art methods by a significant margin.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

3D Human Pose Estimation

Optical Flow Estimation

Datasets

Human3.6M

3DPW

AMASS

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #44 on 3D Human Pose Estimation on 3DPW

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	DST-VIBE	PA-MPJPE	50.3	# 55	Compare
			MPJPE	76.7	# 47	Compare
			MPVPE	93.5	# 44	Compare
			Acceleration Error	11	# 12	Compare
3D Human Pose Estimation	Human3.6M	DST-VIBE	Average MPJPE (mm)	60.5	# 247	Compare
			PA-MPJPE	39.3	# 49	Compare
			Acceleration Error	5	# 8	Compare
3D Human Pose Estimation	MPI-INF-3DHP	DST-VIBE	MPJPE	93.4	# 50	Compare
			PA-MPJPE	62.2	# 8	Compare
			Acceleration Error	11.9	# 10	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove