no code implementations • 25 Apr 2024 • Xiaotong Yu, Chang-Wen Chen
Efficient visual perception using mobile systems is crucial, particularly in unknown environments such as search and rescue operations, where swift and comprehensive perception of objects of interest is essential.
no code implementations • 15 Mar 2024 • Xiaotong Yu, Ruihan Xie, Zhihe Zhao, Chang-Wen Chen
While we enjoy the richness and informativeness of multimodal data, it also introduces interference and redundancy of information.
1 code implementation • CVPR 2023 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Specifically, cheap scene graph supervision data can be easily obtained by parsing image language descriptions into semantic graphs.
no code implementations • 22 Aug 2022 • Lingbo Liu, Jianlong Chang, Bruce X. B. Yu, Liang Lin, Qi Tian, Chang-Wen Chen
Previous methods usually fine-tuned the entire networks for each specific dataset, which will be burdensome to store massive parameters of these networks.
1 code implementation • CVPR 2022 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Such design decomposes the process of HOI set prediction into two subsequent phases, i. e., an interaction proposal generation is first performed, and then followed by transforming the non-parametric interaction proposals into HOI predictions via a structure-aware Transformer.
Ranked #3 on Human-Object Interaction Detection on V-COCO