Pose Estimation

1339 papers with code • 28 benchmarks • 113 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Benchmarks

Add a Result

These leaderboards are used to track progress in Pose Estimation

Dataset	Best Model	Compare
MPII Human Pose	PCT (swin-l, test set)	See all
COCO test-dev	ViTPose (ViTAE-G, ensemble)	See all
Leeds Sports Poses	OmniPose	See all
OCHuman	ViTPose (ViTAE-G, GT bounding boxes)	See all
CrowdPose	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	See all
MS COCO	OmniPose (WASPv2)	See all
AIC	Hulk(Finetune, ViT-L)	See all
ITOP front-view	AdaPose	See all
ITOP top-view	DECA-D3	See all
UPenn Action	OmniPose	See all
J-HMDB	SimpleBaseline + HANet	See all
MPII Single Person	4xRSN-50	See all
COCO val2017	MogaNet-B (384x288)	See all
300W (Full)	SPIGA	See all
DensePose-COCO	Parsing R-CNN + ResNext101	See all
FLIC Elbows	Stacked Hourglass Networks	See all
FLIC Wrists	Stacked Hourglass Networks	See all
UAV-Human	AlphaPose	See all
BRACE	HRNet fine-tuned on BRACE	See all
COCO minival	MSPN	See all
3DPW	HybridCap	See all
MPII	OmniPose (WASPv2)	See all
ApolloCar3D	GSNet	See all
Pix3D	Mid-Level based	See all
KITTI 2015	GeoNet	See all
MERL-RAV	SPIGA	See all
MS-COCO	UniHCP (finetune)	See all
COCO 2017 val	MogaNet-S (384x288)	See all

Show all 28 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Pose Estimation models and implementations

open-mmlab/mmpose

32 papers

4,995

PaddlePaddle/PaddleDetection

9 papers

12,059

DeepLabCut/DeepLabCut

6 papers

4,281

osmr/imgclsmob

6 papers

2,917

Datasets

Subtasks

6D Pose Estimation

Hand Pose Estimation

6D Pose Estimation using RGB

Multi-Person Pose Estimation

Head Pose Estimation

Human Pose Forecasting

6D Pose Estimation using RGBD

Animal Pose Estimation

Vehicle Pose Estimation

RF-based Pose Estimation

Car Pose Estimation

Hand Joint Reconstruction

Activeness Detection

Semi-supervised 2D and 3D landmark labeling

Latest papers

Most implemented Social Latest No code

KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation

JihuaPeng/KTPFormer • 31 Mar 2024

This paper presents a novel Kinematics and Trajectory Prior Knowledge-Enhanced Transformer (KTPFormer), which overcomes the weakness in existing transformer-based methods for 3D human pose estimation that the derivation of Q, K, V vectors in their self-attention mechanisms are all based on simple linear mapping.

31 Mar 2024

Paper
Code

Video-Based Human Pose Regression via Decoupled Space-Time Aggregation

zgspose/dsta • • 29 Mar 2024

In light of this, we propose a novel Decoupled Space-Time Aggregation network (DSTA) to separately capture the spatial contexts between adjacent joints and the temporal cues of each individual joint, thereby avoiding the conflation of spatiotemporal dimensions.

102

29 Mar 2024

Paper
Code

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

leeiieeo/ag-pose • 28 Mar 2024

(2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features.

28 Mar 2024

Paper
Code

Object Pose Estimation via the Aggregation of Diffusion Features

tianfu18/diff-feats-pose • • 27 Mar 2024

To achieve this, we propose three distinct architectures that can effectively capture and aggregate diffusion features of different granularity, greatly improving the generalizability of object pose estimation.

27 Mar 2024

Paper
Code

A Survey on 3D Egocentric Human Pose Estimation

facebookresearch/xR-EgoPose • • 26 Mar 2024

Egocentric human pose estimation aims to estimate human body poses and develop body representations from a first-person camera perspective.

26 Mar 2024

Paper
Code

GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction

hrishavbakulbarua/gta-hdr • 26 Mar 2024

High Dynamic Range (HDR) content (i. e., images and videos) has a broad range of applications.

26 Mar 2024

Paper
Code

YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

cviviers/YOLOv5-6D-Pose • • IEEE Transactions on Image Processing 2024

We propose a general-purpose approach of data acquisition for 6-DoF pose estimation tasks in X-ray systems, a novel and general purpose YOLOv5-6D pose architecture for accurate and fast object pose estimation and a complete method for surgical screw pose estimation under acquisition geometry consideration from a monocular cone-beam X-ray image.

22 Mar 2024

Paper
Code

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

sailor-z/dvmnet • • 20 Mar 2024

Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation.

20 Mar 2024

Paper
Code

Meta-Point Learning and Refining for Category-Agnostic Pose Estimation

chenbys/metapoint • 20 Mar 2024

Existing methods only rely on the features extracted at support keypoints to predict or refine the keypoints on query image, but a few support feature vectors are local and inadequate for CAPE.

20 Mar 2024

Paper
Code

WHAC: World-grounded Humans and Cameras

openxrlab/xrfeitoria • 19 Mar 2024

In this study, we aim to recover expressive parametric human models (i. e., SMPL-X) and corresponding camera poses jointly, by leveraging the synergy between three critical players: the world, the human, and the camera.

170

19 Mar 2024

Paper
Code

Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result