Greedy Offset-Guided Keypoint Grouping for Human Pose Estimation

7 Jul 2021  ·  Jia Li, Linhua Xiang, Jiwei Chen, Zengfu Wang ·

We propose a simple yet reliable bottom-up approach with a good trade-off between accuracy and efficiency for the problem of multi-person pose estimation. Given an image, we employ an Hourglass Network to infer all the keypoints from different persons indiscriminately as well as the guiding offsets connecting the adjacent keypoints belonging to the same persons. Then, we greedily group the candidate keypoints into multiple human poses (if any), utilizing the predicted guiding offsets. And we refer to this process as greedy offset-guided keypoint grouping (GOG). Moreover, we revisit the encoding-decoding method for the multi-person keypoint coordinates and reveal some important facts affecting accuracy. Experiments have demonstrated the obvious performance improvements brought by the introduced components. Our approach is comparable to the state of the art on the challenging COCO dataset under fair conditions. The source code and our pre-trained model are publicly available online.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Multi-Person Pose Estimation COCO test-dev Hourglass-104 AP 65.6 # 12
APL 68.8 # 9
APM 63.3 # 7
Pose Estimation CrowdPose Hourglass-104 AP 65.2 # 10
AP50 85.9 # 4
AP75 69.5 # 4
APM 66.2 # 5
Multi-Person Pose Estimation CrowdPose Hourglass-104 mAP @0.5:0.95 65.2 # 15
AP Easy 73.8 # 13
AP Medium 66.2 # 14
AP Hard 54.8 # 13
FPS 14.7 (21.4) # 2

Methods


No methods listed for this paper. Add relevant methods here