TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
hand-object pose	HO-3D	Keypoint-Trans	Average MPJPE (mm)	25.5	# 3
hand-object pose	HO-3D	Keypoint-Trans	ST-MPJPE	25.7	# 4
hand-object pose	HO-3D	Keypoint-Trans	PA-MPJPE	10.8	# 5
hand-object pose	HO-3D	Keypoint-Trans	OME	68.0	# 5
hand-object pose	HO-3D	Keypoint-Trans	ADD-S	21.4	# 3
3D Hand Pose Estimation	HO-3D	Keypoint Transformer	Average MPJPE (mm)	25.5	# 5
3D Hand Pose Estimation	HO-3D	Keypoint Transformer	ST-MPJPE (mm)	25.7	# 9
3D Hand Pose Estimation	HO-3D	Keypoint Transformer	PA-MPJPE (mm)	10.8	# 10
3D Interacting Hand Pose Estimation	InterHand2.6M	Keypoint Transformer	MPJPE Test	12.78	# 5
3D Interacting Hand Pose Estimation	InterHand2.6M	Keypoint Transformer	MRRPE Test	29.63	# 4
3D Interacting Hand Pose Estimation	InterHand2.6M	Keypoint Transformer	MPVPE Test	-	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/handsformer-keypoint-transformer-for/hand-object-pose-on-ho-3d)](https://paperswithcode.com/sota/hand-object-pose-on-ho-3d?p=handsformer-keypoint-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/handsformer-keypoint-transformer-for/3d-interacting-hand-pose-estimation-on)](https://paperswithcode.com/sota/3d-interacting-hand-pose-estimation-on?p=handsformer-keypoint-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/handsformer-keypoint-transformer-for/3d-hand-pose-estimation-on-ho-3d)](https://paperswithcode.com/sota/3d-hand-pose-estimation-on-ho-3d?p=handsformer-keypoint-transformer-for)`

Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

CVPR 2022 · Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit ·

We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image. This is a very challenging problem, as large occlusions and many confusions between the joints may happen. State-of-the-art methods solve this problem by regressing a heatmap for each joint, which requires solving two problems simultaneously: localizing the joints and recognizing them. In this work, we propose to separate these tasks by relying on a CNN to first localize joints as 2D keypoints, and on self-attention between the CNN features at these keypoints to associate them with the corresponding hand joint. The resulting architecture, which we call "Keypoint Transformer", is highly efficient as it achieves state-of-the-art performance with roughly half the number of model parameters on the InterHand2.6M dataset. We also show it can be easily extended to estimate the 3D pose of an object manipulated by one or two hands with high performance. Moreover, we created a new dataset of more than 75,000 images of two hands manipulating an object fully annotated in 3D and will make it publicly available.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

shreyashampali/kypt_transformer official

Tasks

Add Remove

3D Hand Pose Estimation

3D Interacting Hand Pose Estimation

3D Pose Estimation

hand-object pose

Pose Estimation

Datasets

HO-3D

InterHand2.6M

Results from the Paper

Edit

Ranked #4 on hand-object pose on HO-3D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
hand-object pose	HO-3D	Keypoint-Trans	Average MPJPE (mm)	25.5	# 3	Compare
			ST-MPJPE	25.7	# 4	Compare
			PA-MPJPE	10.8	# 5	Compare
			OME	68.0	# 5	Compare
			ADD-S	21.4	# 3	Compare
3D Hand Pose Estimation	HO-3D	Keypoint Transformer	Average MPJPE (mm)	25.5	# 5	Compare
			ST-MPJPE (mm)	25.7	# 9	Compare
			PA-MPJPE (mm)	10.8	# 10	Compare
3D Interacting Hand Pose Estimation	InterHand2.6M	Keypoint Transformer	MPJPE Test	12.78	# 5	Compare
			MRRPE Test	29.63	# 4	Compare
			MPVPE Test	-	# 6	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Heatmap • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove