no code implementations • 11 Nov 2022 • Yuren Cong, Jinhui Yi, Bodo Rosenhahn, Michael Ying Yang
A semantic scene graph-to-video synthesis framework (SSGVS), based on the pre-trained VSG encoder, VQ-VAE, and auto-regressive Transformer, is proposed to synthesize a video given an initial scene image and a non-fixed number of semantic scene graphs.
no code implementations • ICCV 2021 • Shijie Li, Yanying Zhou, Jinhui Yi, Juergen Gall
Trajectory forecasting is a crucial step for autonomous vehicles and mobile robots in order to navigate and interact safely.
no code implementations • 14 Oct 2020 • Shijie Li, Jinhui Yi, Yazan Abu Farha, Juergen Gall
To this end, the network first refines the poses before they are further processed to recognize the action.