3D Human Pose Estimation

309 papers with code • 25 benchmarks • 47 datasets

3D Human Pose Estimation is a computer vision task that involves estimating the 3D positions and orientations of body joints and bones from 2D images or videos. The goal is to reconstruct the 3D pose of a person in real-time, which can be used in a variety of applications, such as virtual reality, human-computer interaction, and motion analysis.

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Human Pose Estimation

Dataset	Best Model	Compare
Human3.6M	BCP+VHA R152 384x384	See all
3DPW	WHAM (ViT)	See all
MPI-INF-3DHP	MotionAGFormer-L (T=81)	See all
HumanEva-I	Ours (T=27, GT)	See all
Total Capture	AdaFuse	See all
H3WB	3D-LFM	See all
AGORA	NIKI (Twist-and-Swing)	See all
EMDB	TRAM	See all
Panoptic	TesseTrack Multi-View (5 views)	See all
Surreal	VirtualMarker	See all
3D Poses in the Wild Challenge	BeyondWeak	See all
AIST++	RobustCap	See all
SLOPER4D	LiDAR-HMR	See all
UBody	Multi-HMR	See all
SkiPose	CanonPose	See all
Geometric Pose Affordance	ResNet-F	See all
DHP19	Point Transformer	See all
RICH	IPMAN-R	See all
SPEC-MTP	W-HMR	See all
CHALL H80K	ResNet	See all
Geometric Pose Affordance	SIM-G-F	See all
JTA	Dual network	See all
HSPACE	T-THUNDR (HITI + HSPACE)	See all
3DOH50K	OOH	See all
Waymo Open Dataset	VoxelKP	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Human Pose Estimation models and implementations

open-mmlab/mmpose

9 papers

5,026

ailingzengzzz/Split-and-Recombine-N…

3 papers

sjtuxcx/ITES

3 papers

osmr/imgclsmob

2 papers

2,918

See all 8 libraries.

Datasets

Subtasks

3D human pose and shape estimation

Weakly-supervised 3D Human Pose Estimation

Egocentric Pose Estimation

3D Absolute Human Pose Estimation

Multi-Hypotheses 3D Human Pose Estimation

Global 3D Human Pose Estimation

Most implemented papers

Most implemented Social Latest No code

Multi-Garment Net: Learning to Dress 3D People from Images

bharat-b7/MultiGarmentNetwork • • ICCV 2019

We present Multi-Garment Network (MGN), a method to predict body shape and clothing, layered on top of the SMPL model from a few frames (1-8) of a video.

Paper
Code

CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

huawei-noah/noah-research • • 1 Aug 2022

Top-down methods dominate the field of 3D human pose and shape estimation, because they are decoupled from human detection and allow researchers to focus on the core problem.

Paper
Code

MogaNet: Multi-order Gated Aggregation Network

chengtan9907/OpenSTL • • 7 Nov 2022

Notably, MogaNet hits 80. 0\% and 87. 8\% accuracy with 5. 2M and 181M parameters on ImageNet-1K, outperforming ParC-Net and ConvNeXt-L, while saving 59\% FLOPs and 17M parameters, respectively.

Paper
Code

V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map

mks0601/V2V-PoseNet_RELEASE • • CVPR 2018

To overcome these weaknesses, we firstly cast the 3D hand and human pose estimation problem from a single depth map into a voxel-to-voxel prediction that uses a 3D voxelized grid and estimates the per-voxel likelihood for each keypoint.

Paper
Code

Semantic Graph Convolutional Networks for 3D Human Pose Regression

garyzhao/SemGCN • • CVPR 2019

In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for regression.

Paper
Code

VIBE: Video Inference for Human Body Pose and Shape Estimation

mkocabas/VIBE • • CVPR 2020

Human motion is fundamental to understanding behavior.

Paper
Code

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

rwightman/pytorch-image-models • • 1 Jul 2019

The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy.

Paper
Code

Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

mks0601/3DMPPE_POSENET_RELEASE • • ICCV 2019

Although significant improvement has been achieved recently in 3D human pose estimation, most of the previous methods only treat a single-person case.

Paper
Code

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

geopavlakos/c2f-vol-demo • • CVPR 2017

This paper addresses the challenge of 3D human pose estimation from a single color image.

Paper
Code

PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

facebookresearch/pifuhd • • CVPR 2020

Although current approaches have demonstrated the potential in real world settings, they still fail to produce reconstructions with the level of detail often present in the input images.

Paper
Code

3D Human Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result