no code implementations • 13 Oct 2023 • Feng Jiang, Chaoping Tu, Gang Zhang, Jun Li, Hanqing Huang, Junyu Lin, Di Feng, Jian Pu
LiDAR and camera are two critical sensors for multi-modal 3D semantic segmentation and are supposed to be fused efficiently and robustly to promise safety in various real-world scenarios.
no code implementations • 9 Jul 2020 • Chaoping Tu, Yin Zhao, Longjun Cai
Person re-identification (re-ID) is a challenging task in real-world.
no code implementations • 25 Sep 2019 • Yin Zhao, Longjun Cai, Chaoping Tu, Jie Zhang, Wu Wei
Feature extraction, multi-modal fusion and temporal context fusion are crucial stages for predicting valence and arousal values in the emotional impact, but have not been successfully exploited.
no code implementations • 1 Sep 2019 • Jie Zhang, Yin Zhao, Longjun Cai, Chaoping Tu, Wu Wei
We select the most suitable modalities for valence and arousal tasks respectively and each modal feature is extracted using the modality-specific pre-trained deep model on large generic dataset.