Keypoint Detection
150 papers with code • 7 benchmarks • 11 datasets
Keypoint Detection involves simultaneously detecting people and localizing their keypoints. Keypoints are the same thing as interest points. They are spatial locations, or points in the image that define what is interesting or what stand out in the image. They are invariant to image rotation, shrinkage, translation, distortion, and so on.
( Image credit: PifPaf: Composite Fields for Human Pose Estimation; "Learning to surf" by fotologic, license: CC-BY-2.0 )
Libraries
Use these libraries to find Keypoint Detection models and implementationsDatasets
Latest papers with no code
Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching
Current image-based keypoint detection methods for animal (including human) bodies and faces are generally divided into full-supervised and few-shot class-agnostic approaches.
RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy
While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches learn to be robust only up to a certain degree.
SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras
Intelligent transportation systems (ITS) have revolutionized modern road infrastructure, providing essential functionalities such as traffic monitoring, road safety assessment, congestion reduction, and law enforcement.
ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment
We present ClothesNet: a large-scale dataset of 3D clothes objects with information-rich annotations.
ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition
Visual chart recognition systems are gaining increasing attention due to the growing demand for automatically identifying table headers and values from chart images.
CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild
Non-intrusive, real-time analysis of the dynamics of the eye region allows us to monitor humans' visual attention allocation and estimate their mental state during the performance of real-world tasks, which can potentially benefit a wide range of human-computer interaction (HCI) applications.
Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images
Keypoint detection and matching is a fundamental task in many computer vision problems, from shape reconstruction, to structure from motion, to AR/VR applications and robotics.
Human Pose Estimation in Monocular Omnidirectional Top-View Images
In our work we propose a new dataset for training and evaluation of CNNs for the task of keypoint detection in omnidirectional images.
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Unlike current deep keypoint detectors that are trained to recognize limited number of body parts, few-shot keypoint detection (FSKD) attempts to localize any keypoints, including novel or base keypoints, depending on the reference samples.
NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud
We show that a simple network based on NerVE can already outperform the previous state-of-the-art methods by a great margin.