TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
3D Multi-Person Pose Estimation (root-relative)	MuPoTS-3D	TDBU_Net	3DPCK	89.6	# 1
3D Multi-Person Pose Estimation (absolute)	MuPoTS-3D	TDBU_Net	3DPCK	48.0	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/monocular-3d-multi-person-pose-estimation-by/3d-multi-person-pose-estimation-root-relative)](https://paperswithcode.com/sota/3d-multi-person-pose-estimation-root-relative?p=monocular-3d-multi-person-pose-estimation-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/monocular-3d-multi-person-pose-estimation-by/3d-multi-person-pose-estimation-absolute-on)](https://paperswithcode.com/sota/3d-multi-person-pose-estimation-absolute-on?p=monocular-3d-multi-person-pose-estimation-by)`

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

CVPR 2021 · Yu Cheng, Bo wang, Bo Yang, Robby T. Tan ·

In monocular video 3D multi-person pose estimation, inter-person occlusion and close interactions can cause human detection to be erroneous and human-joints grouping to be unreliable. Existing top-down methods rely on human detection and thus suffer from these problems. Existing bottom-up methods do not use human detection, but they process all persons at once at the same scale, causing them to be sensitive to multiple-persons scale variations. To address these challenges, we propose the integration of top-down and bottom-up approaches to exploit their strengths. Our top-down network estimates human joints from all persons instead of one in an image patch, making it robust to possible erroneous bounding boxes. Our bottom-up network incorporates human-detection based normalized heatmaps, allowing the network to be more robust in handling scale variations. Finally, the estimated 3D poses from the top-down and bottom-up networks are fed into our integration network for final 3D poses. Besides the integration of top-down and bottom-up networks, unlike existing pose discriminators that are designed solely for single person, and consequently cannot assess natural inter-person interactions, we propose a two-person pose discriminator that enforces natural two-person interactions. Lastly, we also apply a semi-supervised method to overcome the 3D ground-truth data scarcity. Our quantitative and qualitative evaluations show the effectiveness of our method compared to the state-of-the-art baselines.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

3dpose/3D-Multi-Person-Pose official

154

Tasks

Add Remove

3D Multi-Person Pose Estimation

3D Multi-Person Pose Estimation (absolute)

3D Multi-Person Pose Estimation (root-relative)

Human Detection

Multi-Person Pose Estimation

Pose Estimation

Datasets

MS COCO

Human3.6M

3DPW

MuPoTS-3D

JTA

Results from the Paper

Edit

Ranked #1 on 3D Multi-Person Pose Estimation (root-relative) on MuPoTS-3D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
3D Multi-Person Pose Estimation (root-relative)	MuPoTS-3D	TDBU_Net	3DPCK	89.6	# 1		Compare
3D Multi-Person Pose Estimation (absolute)	MuPoTS-3D	TDBU_Net	3DPCK	48.0	# 2		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove