GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

14 Aug 2018  ·  Hao Tang, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe ·

Hand gesture-to-gesture translation in the wild is a challenging task since hand gestures can have arbitrary poses, sizes, locations and self-occlusions. Therefore, this task requires a high-level understanding of the mapping between the input source gesture and the output target gesture. To tackle this problem, we propose a novel hand Gesture Generative Adversarial Network (GestureGAN). GestureGAN consists of a single generator $G$ and a discriminator $D$, which takes as input a conditional hand image and a target hand skeleton image. GestureGAN utilizes the hand skeleton information explicitly, and learns the gesture-to-gesture mapping through two novel losses, the color loss and the cycle-consistency loss. The proposed color loss handles the issue of "channel pollution" while back-propagating the gradients. In addition, we present the Fr\'echet ResNet Distance (FRD) to evaluate the quality of generated images. Extensive experiments on two widely used benchmark datasets demonstrate that the proposed GestureGAN achieves state-of-the-art performance on the unconstrained hand gesture-to-gesture translation task. Meanwhile, the generated images are in high-quality and are photo-realistic, allowing them to be used as data augmentation to improve the performance of a hand gesture classifier. Our model and code are available at https://github.com/Ha0Tang/GestureGAN.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Gesture-to-Gesture Translation NTU Hand Digit GestureGAN MSE 105.7286 # 1
PSNR 32.6091 # 2
IS 2.5532 # 1
AMT 26.1 # 2
Gesture-to-Gesture Translation Senz3D GestureGAN MSE 169.9219 # 1
PSNR 27.9749 # 2
IS 3.4107 # 1
AMT 22.6 # 2

Methods