PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation

CVPR 2021  ยท  Kehong Gong, Jianfeng Zhang, Jiashi Feng ยท

Existing 3D human pose estimators suffer poor generalization performance to new datasets, largely due to the limited diversity of 2D-3D pose pairs in the training data. To address this problem, we present PoseAug, a new auto-augmentation framework that learns to augment the available training poses towards a greater diversity and thus improve generalization of the trained 2D-to-3D pose estimator. Specifically, PoseAug introduces a novel pose augmentor that learns to adjust various geometry factors (e.g., posture, body size, view point and position) of a pose through differentiable operations. With such differentiable capacity, the augmentor can be jointly optimized with the 3D pose estimator and take the estimation error as feedback to generate more diverse and harder poses in an online manner. Moreover, PoseAug introduces a novel part-aware Kinematic Chain Space for evaluating local joint-angle plausibility and develops a discriminative module accordingly to ensure the plausibility of the augmented poses. These elaborate designs enable PoseAug to generate more diverse yet plausible poses than existing offline augmentation methods, and thus yield better generalization of the pose estimator. PoseAug is generic and easy to be applied to various 3D pose estimators. Extensive experiments demonstrate that PoseAug brings clear improvements on both intra-scenario and cross-scenario datasets. Notably, it achieves 88.6% 3D PCK on MPI-INF-3DHP under cross-dataset evaluation setup, improving upon the previous best data augmentation based method by 9.1%. Code can be found at: https://github.com/jfzhang95/PoseAug.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Results from the Paper


 Ranked #1 on Monocular 3D Human Pose Estimation on Human3.6M (Use Video Sequence metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Human Pose Estimation 3DPW HR-Net+ST-GCN+PoseAug PA-MPJPE 73.2 # 117
Monocular 3D Human Pose Estimation Human3.6M HR-Net+VPose+PoseAug Average MPJPE (mm) 50.2 # 21
PA-MPJPE 39.1 # 4
Weakly-supervised 3D Human Pose Estimation Human3.6M PoseAug Number of Frames Per View 1 # 1
3D Annotations S1 # 1
Monocular 3D Human Pose Estimation Human3.6M PoseAug Use Video Sequence No # 1
Frames Needed 1 # 1
Need Ground Truth 2D Pose No # 1
3D Human Pose Estimation Human3.6M 2DGT+VPose+PoseAug (GTi) Average MPJPE (mm) 38.2 # 65
Using 2D ground-truth joints Yes # 2
Multi-View or Monocular Monocular # 1
3D Human Pose Estimation Human3.6M HR-Net+VPose+PoseAug Average MPJPE (mm) 50.2 # 167
Using 2D ground-truth joints No # 2
Multi-View or Monocular Monocular # 1
Weakly-supervised 3D Human Pose Estimation Human3.6M PoseAug Average MPJPE (mm) 56.7 # 7
Number of Views 1 # 1
Number of Frames Per View 1 # 1
3D Annotations S1 # 1
3D Human Pose Estimation Human3.6M 2DGT+ST-GCN+PoseAug (GTi) Average MPJPE (mm) 36.9 # 61
Using 2D ground-truth joints Yes # 2
Multi-View or Monocular Monocular # 1
3D Human Pose Estimation Human3.6M HR-Net+ST-GCN+PoseAug Average MPJPE (mm) 50.8 # 176
Using 2D ground-truth joints No # 2
Multi-View or Monocular Monocular # 1
3D Human Pose Estimation MPI-INF-3DHP PoseAug (+Extra2D) AUC 57.9 # 26
MPJPE 71.1 # 26
PCK 89.2 # 28
3D Human Pose Estimation MPI-INF-3DHP HR-Net+VPose+PoseAug MPJPE 73.2 # 28
3D Human Pose Estimation MPI-INF-3DHP HR-Net+ST-GCN+PoseAug MPJPE 76.6 # 32
3D Human Pose Estimation MPI-INF-3DHP VPose+PoseAug AUC 57.3 # 28
MPJPE 73 # 27
PCK 88.6 # 31

Methods


No methods listed for this paper. Add relevant methods here