no code implementations • CVPR 2021 • Sijie Song, Xudong Lin, Jiaying Liu, Zongming Guo, Shih-Fu Chang
In this paper, we address the problem of referring expression comprehension in videos, which is challenging due to complex expression and scene dynamics.
1 code implementation • 12 Oct 2020 • Lilang Lin, Sijie Song, Wenhan Yan, Jiaying Liu
To realize this goal, we integrate motion prediction, jigsaw puzzle recognition, and contrastive learning to learn skeleton features from different aspects.
no code implementations • 31 Mar 2020 • Wen-Huang Cheng, Sijie Song, Chieh-Yun Chen, Shintami Chusnul Hidayati, Jiaying Liu
Fashion is the way we present ourselves to the world and has become one of the world's largest industries.
no code implementations • 31 Jan 2020 • Sijie Song, Jiaying Liu, Yanghao Li, Zongming Guo
In this work, we propose a Modality Compensation Network (MCN) to explore the relationships of different modalities, and boost the representations for human action recognition.
1 code implementation • CVPR 2019 • Sijie Song, Wei zhang, Jiaying Liu, Tao Mei
Firstly, a semantic generative network is proposed to transform between semantic parsing maps, in order to simplify the non-rigid deformation learning.
no code implementations • 25 Nov 2018 • Yanghao Li, Sijie Song, Yuqi Li, Jiaying Liu
Temporal modeling in videos is a fundamental yet challenging problem in computer vision.
no code implementations • 22 Mar 2017 • Chunhui Liu, Yueyu Hu, Yanghao Li, Sijie Song, Jiaying Liu
Despite the fact that many 3D human activity benchmarks being proposed, most existing action datasets focus on the action recognition tasks for the segmented videos.
no code implementations • 18 Nov 2016 • Sijie Song, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jiaying Liu
In this work, we propose an end-to-end spatial and temporal attention model for human action recognition from skeleton data.
Ranked #110 on Skeleton Based Action Recognition on NTU RGB+D