Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

21 Oct 2021  ·  Bruno Degardin, João Neves, Vasco Lopes, João Brito, Ehsan Yaghoubi, Hugo Proença ·

Synthesising the spatial and temporal dynamics of the human body skeleton remains a challenging task, not only in terms of the quality of the generated shapes, but also of their diversity, particularly to synthesise realistic body movements of a specific action (action conditioning). In this paper, we propose Kinetic-GAN, a novel architecture that leverages the benefits of Generative Adversarial Networks and Graph Convolutional Networks to synthesise the kinetics of the human body. The proposed adversarial architecture can condition up to 120 different actions over local and global body movements while improving sample quality and diversity through latent space disentanglement and stochastic variations. Our experiments were carried out in three well-known datasets, where Kinetic-GAN notably surpasses the state-of-the-art methods in terms of distribution quality metrics while having the ability to synthesise more than one order of magnitude regarding the number of different actions. Our code and models are publicly available at

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Human action generation Human3.6M Kinetic-GAN MMDa 0.071 # 1
MMDs 0.082 # 1
Human action generation NTU RGB+D Kinetic-GAN FID (CS) 3.618 # 1
FID (CV) 4.235 # 1
Human action generation NTU RGB+D 120 Kinetic-GAN FID (CS) 5.967 # 1
FID (CV) 6.751 # 1
Human action generation NTU RGB+D 2D Kinetic-GAN MMDa (CS) 0.256 # 1
MMDs (CS) 0.273 # 1
MMDa (CV) 0.295 # 1
MMDs (CV) 0.310 # 1