A Data-Driven Approach to Improve 3D Head-Pose Estimation

ISVC 2022  ·  Nima Aghli, Eraldo Ribeiro ·

Head-pose estimation from images is an important research topic in computer vision. Its many applications include detecting focus of attention, tracking driver behavior, and human-computer interaction. Recent research on head-pose estimation has focused on developing models based on deep convolutional neural networks (CNNs). These models are trained using transfer-learning and image augmentation to achieve better initiation states and robustness against occlusions. However, methods that use transfer-learning networks are usually aimed at general image recognition and offer no in-depth study of transfer learning from more task-related networks. Additionally, for the head-pose estimation, robustness against heavy occlusion, and noise such as motion blur and low-brightness are vital. In this paper, we propose a new image-augmentation approach that significantly improves the estimation accuracy of the head-pose model. We also propose a task-related weight initialization to further improve the estimation accuracy by studying internal activations of models trained for face-related tasks such as face-recognition. We test our head-pose estimation model on three challenging test sets and achieve better results to state-of-the-art methods.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Head Pose Estimation AFLW2000 DDD-Pose MAE 4.22 # 12
Head Pose Estimation BIWI DDD-Pose MAE (trained with BIWI data) 2.80 # 4
MAE (trained with other data) 4.52 # 12

Methods


No methods listed for this paper. Add relevant methods here