TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	PA-MPJPE	52.1	# 67
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	MPJPE	84.3	# 69
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	MPVPE	99.7	# 52
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	Acceleration Error	7.4	# 7
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	Number of parameters (M)	39.63	# 1
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	FLOPs (G)	4.45	# 1
3D Human Pose Estimation	Human3.6M	MPS-Net (T=16)	Average MPJPE (mm)	69.4	# 276
3D Human Pose Estimation	Human3.6M	MPS-Net (T=16)	PA-MPJPE	47.4	# 87
3D Human Pose Estimation	Human3.6M	MPS-Net (T=16)	Acceleration Error	3.6	# 3
3D Human Pose Estimation	MPI-INF-3DHP	MPS-Net (T=16)	MPJPE	96.7	# 59
3D Human Pose Estimation	MPI-INF-3DHP	MPS-Net (T=16)	PA-MPJPE	62.8	# 11
3D Human Pose Estimation	MPI-INF-3DHP	MPS-Net (T=16)	Acceleration Error	9.6	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/capturing-humans-in-motion-temporal-attentive/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=capturing-humans-in-motion-temporal-attentive)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/capturing-humans-in-motion-temporal-attentive/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=capturing-humans-in-motion-temporal-attentive)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/capturing-humans-in-motion-temporal-attentive/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=capturing-humans-in-motion-temporal-attentive)`

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video

CVPR 2022 · Wen-Li Wei, Jen-Chun Lin, Tyng-Luh Liu, Hong-Yuan Mark Liao ·

Learning to capture human motion is essential to 3D human pose and shape estimation from monocular video. However, the existing methods mainly rely on recurrent or convolutional operation to model such temporal information, which limits the ability to capture non-local context relations of human motion. To address this problem, we propose a motion pose and shape network (MPS-Net) to effectively capture humans in motion to estimate accurate and temporally coherent 3D human pose and shape from a video. Specifically, we first propose a motion continuity attention (MoCA) module that leverages visual cues observed from human motion to adaptively recalibrate the range that needs attention in the sequence to better capture the motion continuity dependencies. Then, we develop a hierarchical attentive feature integration (HAFI) module to effectively combine adjacent past and future feature representations to strengthen temporal correlation and refine the feature representation of the current frame. By coupling the MoCA and HAFI modules, the proposed MPS-Net excels in estimating 3D human pose and shape in the video. Though conceptually simple, our MPS-Net not only outperforms the state-of-the-art methods on the 3DPW, MPI-INF-3DHP, and Human3.6M benchmark datasets, but also uses fewer network parameters. The video demos can be found at https://mps-net.github.io/MPS-Net/.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

3D human pose and shape estimation

3D Human Pose Estimation

Datasets

Human3.6M

3DPW

AMASS

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #52 on 3D Human Pose Estimation on 3DPW

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	MPS-Net (T=16)	PA-MPJPE	52.1	# 67	Compare
			MPJPE	84.3	# 69	Compare
			MPVPE	99.7	# 52	Compare
			Acceleration Error	7.4	# 7	Compare
			Number of parameters (M)	39.63	# 1	Compare
			FLOPs (G)	4.45	# 1	Compare
3D Human Pose Estimation	Human3.6M	MPS-Net (T=16)	Average MPJPE (mm)	69.4	# 276	Compare
			PA-MPJPE	47.4	# 87	Compare
			Acceleration Error	3.6	# 3	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MPS-Net (T=16)	MPJPE	96.7	# 59	Compare
			PA-MPJPE	62.8	# 11	Compare
			Acceleration Error	9.6	# 7	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove