no code implementations • 16 Apr 2024 • Kang Liao, Zongsheng Yue, Zhonghua Wu, Chen Change Loy
To our knowledge, this is the first work that solves multiple practical warping tasks in one single model.
no code implementations • CVPR 2024 • Xiangxi Shi, Zhonghua Wu, Stefan Lee
In this paper we investigate the significance of viewpoint information in 3D visual grounding -- introducing a model that explicitly predicts the speaker's viewpoint based on the referring expression and scene.
1 code implementation • NeurIPS 2023 • Hui EnPang, Zhongang Cai, Lei Yang, Qingyi Tao, Zhonghua Wu, Tianwei Zhang, Ziwei Liu
Whole-body pose and shape estimation aims to jointly predict different behaviors (e. g., pose, hand gesture, facial expression) of the entire human body from a monocular image.
no code implementations • 28 Nov 2023 • Xiaojing Zhong, Xinyi Huang, Zhonghua Wu, Guosheng Lin, Qingyao Wu
To address this problem, we propose a novel Spatial Alignment and Region-Adaptive normalization method (SARA) in this paper.
no code implementations • 28 Nov 2023 • Xiaojing Zhong, Yukun Su, Zhonghua Wu, Guosheng Lin, Qingyao Wu
3D virtual try-on enjoys many potential applications and hence has attracted wide attention.
no code implementations • 13 Sep 2023 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
In this work, we tackle the challenging problem of long-tailed image recognition.
1 code implementation • 10 Jul 2023 • Yicheng Wu, Zhonghua Wu, Hengcan Shi, Bjoern Picker, Winston Chong, Jianfei Cai
Moreover, a simple and effective relation regularization is proposed to ensure the longitudinal relations among the three outputs to improve the model learning.
1 code implementation • 24 Mar 2023 • Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin
Current methods for few-shot segmentation (FSSeg) have mainly focused on improving the performance of novel classes while neglecting the performance of base classes.
no code implementations • 14 Mar 2023 • Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Zhonghua Wu, Qingyi Tao, Lewei Lu, Shijian Lu
The task of 3D single object tracking (SOT) with LiDAR point clouds is crucial for various applications, such as autonomous driving and robotics.
1 code implementation • 9 Mar 2023 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai
Weakly-supervised point cloud segmentation with extremely limited labels is highly desirable to alleviate the expensive costs of collecting densely annotated 3D points.
no code implementations • 19 Jul 2022 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian
Weakly supervised point cloud segmentation, i. e. semantically segmenting a point cloud with only a few labeled points in the whole 3D scene, is highly desirable due to the heavy burden of collecting abundant dense annotations for the model training.
no code implementations • 2 Jun 2022 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.
Ranked #9 on Long-tail Learning on CIFAR-10-LT (ρ=10)
2 code implementations • 2 Mar 2022 • Yicheng Wu, Zhonghua Wu, Qianyi Wu, ZongYuan Ge, Jianfei Cai
The pixel-level smoothness forces the model to generate invariant results under adversarial perturbations.
no code implementations • 17 Aug 2021 • Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu
With the development of Generative Adversarial Network, image-based virtual try-on methods have made great progress.
1 code implementation • 11 Aug 2021 • Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning.
1 code implementation • ICCV 2021 • Zhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai
To explicitly learn meta-class representations in few-shot segmentation task, we propose a novel Meta-class Memory based few-shot segmentation method (MM-Net), where we introduce a set of learnable memory embeddings to memorize the meta-class information during the base class training and transfer to novel classes during the inference stage.
no code implementations • 27 Jul 2021 • Xiangxi Shi, Zhonghua Wu, Guosheng Lin, Jianfei Cai, Shafiq Joty
Therefore, in this paper, we propose a memory-based Image Manipulation Network (MIM-Net), where a set of memories learned from images is introduced to synthesize the texture information with the guidance of the textual description.
no code implementations • CVPR 2020 • Zhonghua Wu, Qingyi Tao, Guosheng Lin, Jianfei Cai
To reduce the human labeling effort, we propose a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations.
no code implementations • 21 Nov 2018 • Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai
Instead, we present a novel virtual Try-On network, M2E-Try On Net, which transfers the clothes from a model image to a person image without the need of any clean product images.
no code implementations • 14 Sep 2018 • Zhonghua Wu, Guosheng Lin, Jianfei Cai
We develop an iterative learning method to generate pseudo part segmentation masks from keypoint labels.