PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction

Pose prediction is to predict future poses given a window of previous poses. In this paper, we propose a new problem that predicts poses using 3D joint coordinate sequences. Different from the traditional pose prediction based on Mocap frames, this problem is convenient to use in real applications due to its simple sensors to capture data. We also present a new framework, PISEP^2 (Pseudo Image Sequence Evolution based 3D Pose Prediction), to address this new problem. Specifically, a skeletal representation is proposed by transforming the joint coordinate sequence into an image sequence, which can model the different correlations of different joints. With this image based skeletal representation, we model the pose prediction as the evolution of image sequence. Moreover, a novel inference network is proposed to predict all future poses in one step by decoupling the decoders in a non-recursive manner. Compared with the recursive sequence to sequence model, we can improve the computational efficiency and avoid error accumulation significantly. Extensive experiments are carried out on two benchmark datasets (e.g. G3D and FNTU). The proposed method achieves the state-of-the-art performance on both datasets, which demonstrates the effectiveness of our proposed method.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Pose Prediction Filtered NTU RGB+D PISEP^2 (L1 norm) MSE 0.1210 # 1
MAE 1.1651 # 1
Pose Prediction Filtered NTU RGB+D PISEP^2 (L2) MSE 0.1354 # 2
MAE 1.4063 # 2
Pose Prediction Gaming 3D (G3D) PISEP^2 (L1 norm) MSE 0.1199 # 1
MAE 1.1101 # 1
Pose Prediction Gaming 3D (G3D) PISEP^2 (L2) MSE 0.1434 # 2
MAE 1.4038 # 2

Methods


No methods listed for this paper. Add relevant methods here