2 code implementations • 12 Dec 2023 • Weiguang Zhao, Guanyu Yang, Rui Zhang, Chenru Jiang, Chaolong Yang, Yuyao Yan, Amir Hussain, Kaizhu Huang
To this end, we propose a more realistic and challenging scenario named open-pose 3D zero-shot classification, focusing on the recognition of 3D objects regardless of their orientation.
no code implementations • 13 Dec 2022 • Chaolong Yang, Yuyao Yan, Weiguang Zhao, Jianan Ye, Xi Yang, Amir Hussain, Kaizhu Huang
On the one hand, the unidirectional projection enforces our model focused more on the core task, i. e., 3D segmentation; on the other hand, unlocking the bidirectional to unidirectional projection enables a deeper cross-domain semantic alignment and enjoys the flexibility to fuse better and complicated features from very different spaces.
1 code implementation • 27 Oct 2022 • Zhaorui Tan, Xi Yang, Zihan Ye, Qiufeng Wang, Yuyao Yan, Anh Nguyen, Kaizhu Huang
Generating consistent and high-quality images from given texts is essential for visual-language understanding.
1 code implementation • ICCV 2023 • Weiguang Zhao, Yuyao Yan, Chaolong Yang, Jianan Ye, Xi Yang, Kaizhu Huang
Due to the uneven distribution of offset points, these existing methods can hardly cluster all instance points.
Ranked #3 on 3D Instance Segmentation on S3DIS
1 code implementation • 22 Jul 2022 • Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang
Although deep-learning based methods for monocular pedestrian detection have made great progress, they are still vulnerable to heavy occlusions.
Ranked #2 on Multiview Detection on Wildtrack (using extra training data)
no code implementations • 24 May 2022 • Zixian Su, Kai Yao, Xi Yang, Qiufeng Wang, Yuyao Yan, Jie Sun, Kaizhu Huang
This combination of global and local alignment can precisely localize the crucial regions in segmentation target while preserving the overall semantic consistency.
1 code implementation • 8 Apr 2022 • Weiguang Zhao, Chaolong Yang, Jianan Ye, Rui Zhang, Yuyao Yan, Xi Yang, Bin Dong, Amir Hussain, Kaizhu Huang
While weakly supervised multi-view face reconstruction (MVR) is garnering increased attention, one critical issue still remains open: how to effectively fuse multiple image information to reconstruct high-precision 3D models.
1 code implementation • 27 Jan 2022 • Penglei Gao, Xi Yang, Rui Zhang, John Y. Goulermas, Yujie Geng, Yuyao Yan, Kaizhu Huang
In this paper, we develop a novel transformer-based generative adversarial neural network called U-Transformer for generalised image outpainting problem.