TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	APL	71.5	# 12
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	APM	60.4	# 13
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	AP50	87.8	# 8
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	AP75	71.1	# 10
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	AP	64.8	# 5
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	AP	63.3	# 41
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	AP50	86.7	# 36
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	AP75	69.4	# 38
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	APL	71.2	# 34
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	APM	57.8	# 35

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/directpose-direct-end-to-end-multi-person/keypoint-detection-on-coco-test-dev)](https://paperswithcode.com/sota/keypoint-detection-on-coco-test-dev?p=directpose-direct-end-to-end-multi-person)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/directpose-direct-end-to-end-multi-person/pose-estimation-on-coco-test-dev)](https://paperswithcode.com/sota/pose-estimation-on-coco-test-dev?p=directpose-direct-end-to-end-multi-person)`

DirectPose: Direct End-to-End Multi-Person Pose Estimation

18 Nov 2019 · Zhi Tian, Hao Chen, Chunhua Shen ·

We propose the first direct end-to-end multi-person pose estimation framework, termed DirectPose. Inspired by recent anchor-free object detectors, which directly regress the two corners of target bounding-boxes, the proposed framework directly predicts instance-aware keypoints for all the instances from a raw input image, eliminating the need for heuristic grouping in bottom-up methods or bounding-box detection and RoI operations in top-down ones. We also propose a novel Keypoint Alignment (KPAlign) mechanism, which overcomes the main difficulty: lack of the alignment between the convolutional features and predictions in this end-to-end framework. KPAlign improves the framework's performance by a large margin while still keeping the framework end-to-end trainable. With the only postprocessing non-maximum suppression (NMS), our proposed framework can detect multi-person keypoints with or without bounding-boxes in a single shot. Experiments demonstrate that the end-to-end paradigm can achieve competitive or better performance than previous strong baselines, in both bottom-up and top-down methods. We hope that our end-to-end approach can provide a new perspective for the human pose estimation task.

PDF Abstract

Code

Add Remove Mark official

aim-uofa/adet

3,325

aim-uofa/AdelaiDet

3,325

IDEA-Research/UniPose

228

Pxtri2156/AdelaiDet_v2

blueardour/AdelaiDet

See all 8 implementations

Tasks

Add Remove

Multi-Person Pose Estimation

Pose Estimation

Datasets

MS COCO

Results from the Paper

Edit

Ranked #13 on Keypoint Detection on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Keypoint Detection	COCO test-dev	DirectPose (ResNet-101)	APL	71.5	# 12	Compare
			APM	60.4	# 13	Compare
			AP50	87.8	# 8	Compare
			AP75	71.1	# 10	Compare
			AP	64.8	# 5	Compare
Pose Estimation	COCO test-dev	DirectPose (ResNet-101)	AP	63.3	# 41	Compare
			AP50	86.7	# 36	Compare
			AP75	69.4	# 38	Compare
			APL	71.2	# 34	Compare
			APM	57.8	# 35	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

DirectPose: Direct End-to-End Multi-Person Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove