no code implementations • 1 Apr 2024 • Jialou Wang, Manli Zhu, Yulei Li, Honglei Li, Longzhi Yang, Wai Lok Woo
As a result, Detect2Interact achieves consistent qualitative results on object key field detection across extensive test cases and outperforms the existing VQA system with object detection by providing a more reasonable and finer visual representation.
1 code implementation • 18 Aug 2022 • Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum
As a result, we propose a solution that explicitly takes both individual joint features and inter-joint features as input, relieving the system from the need of discovering more complicated features from small data.
1 code implementation • 11 Jul 2022 • Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum
Our network exploits the spatial connections between human keypoints and object keypoints to capture their fine-grained structural interactions via graph convolutions.
Ranked #22 on Human-Object Interaction Detection on V-COCO
no code implementations • 8 Jun 2021 • Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum
To highlight the capacity of the deep network in modelling input features, we utilize raw joint positions instead of hand-crafted features.