3D Human Pose Estimation

311 papers with code • 25 benchmarks • 47 datasets

3D Human Pose Estimation is a computer vision task that involves estimating the 3D positions and orientations of body joints and bones from 2D images or videos. The goal is to reconstruct the 3D pose of a person in real-time, which can be used in a variety of applications, such as virtual reality, human-computer interaction, and motion analysis.

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Human Pose Estimation

Dataset	Best Model	Compare
Human3.6M	BCP+VHA R152 384x384	See all
3DPW	WHAM (ViT)	See all
MPI-INF-3DHP	MotionAGFormer-L (T=81)	See all
HumanEva-I	Ours (T=27, GT)	See all
Total Capture	AdaFuse	See all
H3WB	3D-LFM	See all
AGORA	NIKI (Twist-and-Swing)	See all
EMDB	TRAM	See all
Panoptic	TesseTrack Multi-View (5 views)	See all
Surreal	VirtualMarker	See all
3D Poses in the Wild Challenge	BeyondWeak	See all
AIST++	RobustCap	See all
SLOPER4D	LiDAR-HMR	See all
UBody	Multi-HMR	See all
SkiPose	CanonPose	See all
Geometric Pose Affordance	ResNet-F	See all
DHP19	Point Transformer	See all
RICH	IPMAN-R	See all
SPEC-MTP	W-HMR	See all
CHALL H80K	ResNet	See all
Geometric Pose Affordance	SIM-G-F	See all
JTA	Dual network	See all
HSPACE	T-THUNDR (HITI + HSPACE)	See all
3DOH50K	OOH	See all
Waymo Open Dataset	VoxelKP	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Human Pose Estimation models and implementations

open-mmlab/mmpose

9 papers

5,050

ailingzengzzz/Split-and-Recombine-N…

3 papers

sjtuxcx/ITES

3 papers

osmr/imgclsmob

2 papers

2,924

See all 8 libraries.

Datasets

Subtasks

3D human pose and shape estimation

Weakly-supervised 3D Human Pose Estimation

Egocentric Pose Estimation

3D Absolute Human Pose Estimation

Multi-Hypotheses 3D Human Pose Estimation

Global 3D Human Pose Estimation

Latest papers

Most implemented Social Latest No code

Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey

liuyangme/sota-3dhpe-hmr • 29 Feb 2024

To the best of our knowledge, this survey is arguably the first to comprehensively cover deep learning methods for 3D human pose estimation, including both single-person and multi-person approaches, as well as human mesh recovery, encompassing methods based on explicit models and implicit representations.

29 Feb 2024

Paper
Code

Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot

naver/multi-hmr • • 22 Feb 2024

We present Multi-HMR, a strong single-shot model for multi-person 3D human mesh recovery from a single RGB image.

120

22 Feb 2024

Paper
Code

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester • 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

15 Feb 2024

Paper
Code

Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers

wujinhuan/3d-human-pose • • 30 Jan 2024

Due to the challenges in data collection, mainstream datasets of 3D human pose estimation are primarily composed of multi-view video data collected in laboratory environments, which contains rich spatial-temporal correlation information besides the image frame content.

30 Jan 2024

Paper
Code

Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework

jjkislele/monoMotionDiff • • 18 Jan 2024

However, there is still ample room for improvement as these methods often overlook the exploration of correlation between the 2D and 3D joint-level features.

18 Jan 2024

Paper
Code

Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton

khb1698/drpose • • 10 Jan 2024

To address these two challenges, we propose a diffusion-based refinement framework called DRPose, which refines the output of deterministic models by reverse diffusion and achieves more suitable multi-hypothesis prediction for the current pose benchmark by multi-step refinement with multiple noises.

10 Jan 2024

Paper
Code

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion

yw0208/STAF • • 3 Jan 2024

This method can remarkably improve the smoothness of recovery results from video.

03 Jan 2024

Paper
Code

3D-LFM: Lifting Foundation Model

mosamdabhi/3dlfm • 19 Dec 2023

The lifting of 3D structure and camera from 2D landmarks is at the cornerstone of the entire discipline of computer vision.

19 Dec 2023

Paper
Code

WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion

yohanshin/WHAM • • 12 Dec 2023

We address these limitations with WHAM (World-grounded Humans with Accurate Motion), which accurately and efficiently reconstructs 3D human motion in a global coordinate system from video.

497

12 Dec 2023

Paper
Code

VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data

shijianjian/voxelkp • 11 Dec 2023

To the best of our knowledge, \textit{VoxelKP} is the first single-staged, fully sparse network that is specifically designed for addressing the challenging task of 3D keypoint estimation from LiDAR data, achieving state-of-the-art performances.

11 Dec 2023

Paper
Code

3D Human Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result