TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Egocentric Pose Estimation	SceneEgo	EgoPoseFormer	Average MPJPE (mm)	93.0	# 3
Egocentric Pose Estimation	SceneEgo	EgoPoseFormer	PA-MPJPE	74.3	# 3
Egocentric Pose Estimation	UnrealEgo	EgoPoseFormer	Average MPJPE (mm)	33.4	# 1
Egocentric Pose Estimation	UnrealEgo	EgoPoseFormer	PA-MPJPE	32.7	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/egoposeformer-a-simple-baseline-for/egocentric-pose-estimation-on-unrealego)](https://paperswithcode.com/sota/egocentric-pose-estimation-on-unrealego?p=egoposeformer-a-simple-baseline-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/egoposeformer-a-simple-baseline-for/egocentric-pose-estimation-on-sceneego)](https://paperswithcode.com/sota/egocentric-pose-estimation-on-sceneego?p=egoposeformer-a-simple-baseline-for)`

EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation

26 Mar 2024 · Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J. Crowley, Cem Keskin ·

We present EgoPoseFormer, a simple yet effective transformer-based model for stereo egocentric human pose estimation. The main challenge in egocentric pose estimation is overcoming joint invisibility, which is caused by self-occlusion or a limited field of view (FOV) of head-mounted cameras. Our approach overcomes this challenge by incorporating a two-stage pose estimation paradigm: in the first stage, our model leverages the global information to estimate each joint's coarse location, then in the second stage, it employs a DETR style transformer to refine the coarse locations by exploiting fine-grained stereo visual features. In addition, we present a deformable stereo operation to enable our transformer to effectively process multi-view features, which enables it to accurately localize each joint in the 3D world. We evaluate our method on the stereo UnrealEgo dataset and show it significantly outperforms previous approaches while being computationally efficient: it improves MPJPE by 27.4mm (45% improvement) with only 7.9% model parameters and 13.1% FLOPs compared to the state-of-the-art. Surprisingly, with proper training techniques, we find that even our first-stage pose proposal network can achieve superior performance compared to previous arts. We also show that our method can be seamlessly extended to monocular settings, which achieves state-of-the-art performance on the SceneEgo dataset, improving MPJPE by 25.5mm (21% improvement) compared to the best existing method with only 60.7% model parameters and 36.4% FLOPs.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

3D Human Pose Estimation

Egocentric Pose Estimation

Pose Estimation

Datasets

xR-EgoPose

UnrealEgo SceneEgo

Results from the Paper

Edit

Ranked #1 on Egocentric Pose Estimation on UnrealEgo

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Egocentric Pose Estimation	SceneEgo	EgoPoseFormer	Average MPJPE (mm)	93.0	# 3	Compare
Egocentric Pose Estimation	SceneEgo	EgoPoseFormer	PA-MPJPE	74.3	# 3	Compare
Egocentric Pose Estimation	UnrealEgo	EgoPoseFormer	Average MPJPE (mm)	33.4	# 1	Compare
Egocentric Pose Estimation	UnrealEgo	EgoPoseFormer	PA-MPJPE	32.7	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Convolution • Dense Connections • Detr • Dropout • Feedforward Network • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove