1 code implementation • 27 Mar 2025 • Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li, Chen Change Loy
Unifying visual understanding and generation within a single multimodal framework remains a significant challenge, as the two inherently heterogeneous tasks require representations at different levels of granularity.
no code implementations • 24 Jan 2025 • Peiqing Yang, Shangchen Zhou, Jixin Zhao, Qingyi Tao, Chen Change Loy
For robust training, we present a larger, high-quality, and diverse dataset for video matting.
1 code implementation • NeurIPS 2023 • Hui EnPang, Zhongang Cai, Lei Yang, Qingyi Tao, Zhonghua Wu, Tianwei Zhang, Ziwei Liu
Whole-body pose and shape estimation aims to jointly predict different behaviors (e. g., pose, hand gesture, facial expression) of the entire human body from a monocular image.
1 code implementation • NeurIPS 2023 • Peiqing Yang, Shangchen Zhou, Qingyi Tao, Chen Change Loy
When combined with a diffusion prior, this partial guidance can deliver appealing results across a range of restoration tasks.
no code implementations • 14 Mar 2023 • Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Zhonghua Wu, Qingyi Tao, Lewei Lu, Shijian Lu
The task of 3D single object tracking (SOT) with LiDAR point clouds is crucial for various applications, such as autonomous driving and robotics.
1 code implementation • 28 Feb 2022 • Chawan Piansaddhayanon, Sakun Santisukwongchote, Shanop Shuangshoti, Qingyi Tao, Sira Sriswasdi, Ekapol Chuangsuwanich
Existing approaches utilize a two-stage pipeline: the detection stage for identifying the locations of potential mitotic cells and the classification stage for refining prediction confidences.
no code implementations • 19 Nov 2021 • Zhihong Lin, Donghao Zhang, Qingyi Tao, Danli Shi, Gholamreza Haffari, Qi Wu, Mingguang He, ZongYuan Ge
Medical Visual Question Answering~(VQA) is a combination of medical artificial intelligence and popular VQA challenges.
no code implementations • 4 Aug 2021 • Wei Feng, Lie Ju, Lin Wang, Kaimin Song, Xin Wang, Xin Zhao, Qingyi Tao, ZongYuan Ge
In this work, we explore unsupervised domain adaptation in retinal vessel segmentation by using entropy-based adversarial learning and transfer normalization layer to train a segmentation network, which generalizes well across domains and requires no annotation of the target domain.
no code implementations • CVPR 2020 • Zhonghua Wu, Qingyi Tao, Guosheng Lin, Jianfei Cai
To reduce the human labeling effort, we propose a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations.
1 code implementation • 9 Jul 2019 • Qingyi Tao, ZongYuan Ge, Jianfei Cai, Jianxiong Yin, Simon See
Secondly, in CT scans, the lesions are often indistinguishable from the background since the lesion and non-lesion areas may have very similar appearances.
no code implementations • 21 Nov 2018 • Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai
Instead, we present a novel virtual Try-On network, M2E-Try On Net, which transfers the clothes from a model image to a person image without the need of any clean product images.
no code implementations • ECCV 2018 • Qing Li, Qingyi Tao, Shafiq Joty, Jianfei Cai, Jiebo Luo
Most existing works in visual question answering (VQA) are dedicated to improving the accuracy of predicted answers, while disregarding the explanations.
Ranked #4 on
Explanatory Visual Question Answering
on GQA-REX
Explanatory Visual Question Answering
Multi-Task Learning
+1
no code implementations • ECCV 2018 • Qingyi Tao, Hao Yang, Jianfei Cai
Object detection is one of the major problems in computer vision, and has been extensively studied.
no code implementations • 27 Jul 2017 • Qingyi Tao, Hao Yang, Jianfei Cai
Object detection without bounding box annotations, i. e, weakly supervised detection methods, are still lagging far behind.
Ranked #23 on
Weakly Supervised Object Detection
on PASCAL VOC 2012 test
(using extra training data)