Keypoint Detection

150 papers with code • 7 benchmarks • 11 datasets

Keypoint Detection involves simultaneously detecting people and localizing their keypoints. Keypoints are the same thing as interest points. They are spatial locations, or points in the image that define what is interesting or what stand out in the image. They are invariant to image rotation, shrinkage, translation, distortion, and so on.

( Image credit: PifPaf: Composite Fields for Human Pose Estimation; "Learning to surf" by fotologic, license: CC-BY-2.0 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Keypoint Detection

Dataset	Best Model	Compare
MS COCO	4xRSN-50(384×288)	See all
COCO test-dev	HRNet*	See all
MPII Multi-Person	AlphaPose	See all
OCHuman	MIPNet (HRNet-W48)	See all
COCO test-challenge	Simple Base+*	See all
Pascal3D+	ConvNet + deformable shape model	See all
ApolloCar3D	GSNet	See all

Libraries

Use these libraries to find Keypoint Detection models and implementations

open-mmlab/mmpose

12 papers

4,957

osmr/imgclsmob

6 papers

2,917

PaddlePaddle/PaddleDetection

5 papers

12,022

CMU-Perceptual-Computing-Lab/openpo…

3 papers

29,793

See all 10 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters

axelBarroso/Key.Net • • ICCV 2019

We introduce a novel approach for keypoint detection task that combines handcrafted and learned CNN filters within a shallow multi-scale architecture.

Paper
Code

GLAMpoints: Greedily Learned Accurate Match points

PruneTruong/GLAMpoints_pytorch • • ICCV 2019

We introduce a novel CNN-based feature point detector - GLAMpoints - learned in a semi-supervised manner.

Paper
Code

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

ethnhe/PVN3D • • CVPR 2020

Our method is a natural extension of 2D-keypoint approaches that successfully work on RGB based 6DoF estimation.

Paper
Code

Improving Convolutional Networks With Self-Calibrated Convolutions

MCG-NKU/SCNet • • CVPR 2020

Recent advances on CNNs are mostly devoted to designing more complex architectures to enhance their representation learning capacity.

Paper
Code

HoughNet: Integrating near and long-range evidence for visual detection

giddyyupp/coco-minitrain • • 14 Apr 2021

This paper presents HoughNet, a one-stage, anchor-free, voting-based, bottom-up object detection method.

Paper
Code

RegionViT: Regional-to-Local Attention for Vision Transformers

IBM/RegionViT • • ICLR 2022

The regional-to-local attention includes two steps: first, the regional self-attention extract global information among all regional tokens and then the local self-attention exchanges the information among one regional token and the associated local tokens via self-attention.

Paper
Code

Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation

idea-research/ed-pose • • 3 Feb 2023

This paper presents a novel end-to-end framework with Explicit box Detection for multi-person Pose estimation, called ED-Pose, where it unifies the contextual learning between human-level (global) and keypoint-level (local) information.

Paper
Code

PifPaf: Composite Fields for Human Pose Estimation

thanhtrung98/human_pose_estimation • CVPR 2019

We propose a new bottom-up method for multi-person 2D human pose estimation that is particularly well suited for urban mobility such as self-driving cars and delivery robots.

Paper
Code

Pose Neural Fabrics Search

yangsenius/PoseNFS • • 16 Sep 2019

Neural Architecture Search (NAS) technologies have emerged in many domains to jointly learn the architectures and weights of the neural network.

Paper
Code

R2D2: Reliable and Repeatable Detector and Descriptor

naver/r2d2 • • NeurIPS 2019

We thus propose to jointly learn keypoint detection and description together with a predictor of the local descriptor discriminativeness.

Paper
Code

Keypoint Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result