no code implementations • CVPR 2023 • Mengyao Xie, Zongbo Han, Changqing Zhang, Yichen Bai, QinGhua Hu
Second, the quality of the imputed data itself is of high uncertainty.
no code implementations • 9 Apr 2023 • Zongbo Han, Zhipeng Liang, Fan Yang, Liu Liu, Lanqing Li, Yatao Bian, Peilin Zhao, QinGhua Hu, Bingzhe Wu, Changqing Zhang, Jianhua Yao
Subpopulation shift exists widely in many real-world applications, which refers to the training and test distributions that contain the same subpopulation groups but with different subpopulation proportions.
2 code implementations • 23 Mar 2023 • Huan Ma, Changqing Zhang, Yatao Bian, Lemao Liu, Zhirui Zhang, Peilin Zhao, Shu Zhang, Huazhu Fu, QinGhua Hu, Bingzhe Wu
Large language models have demonstrated surprising ability to perform in-context learning, i. e., these models can be directly applied to solve numerous downstream tasks by conditioning on a prompt constructed by a few input-output examples.
1 code implementation • 2 Feb 2023 • Yiming Sun, Bing Cao, Pengfei Zhu, QinGhua Hu
The MoLE performs specialized learning of multi-modal local features, prompting the fused images to retain the local information in a sample-adaptive manner, while the MoGE focuses on the global information that complements the fused image with overall texture detail and contrast.
no code implementations • CVPR 2023 • Zixuan Qin, Liu Yang, Qilong Wang, Yahong Han, QinGhua Hu
When there are large differences in data distribution among clients, it is crucial for federated learning to design a reliable client selection strategy and an interpretable client communication framework to better utilize group knowledge.
no code implementations • 13 Dec 2022 • Qinghe Wang, Lijie Liu, Miao Hua, Qian He, Pengfei Zhu, Bing Cao, QinGhua Hu
We blend the semantic layouts of source head and source body, and then inpaint the transition region by the semantic layout generator, achieving a coarse-grained head swapping.
1 code implementation • ACMMM 2022 • Yiming Sun, Bing Cao, Pengfei Zhu, QinGhua Hu
We cascade the image fusion network with the detection networks of both modalities and use the detection loss of the fused images to provide guidance on task-related information for the optimization of the image fusion network.
no code implementations • 29 Aug 2022 • Pengfei Zhu, Xinjie Yao, Yu Wang, Meng Cao, Binyuan Hui, Shuai Zhao, QinGhua Hu
Multi-view learning has progressed rapidly in recent years.
no code implementations • 5 Jul 2022 • Hongzhi Huang, Yu Wang, QinGhua Hu, Ming-Ming Cheng
In this study, we propose a novel method, called Class-Specific Semantic Reconstruction (CSSR), that integrates the power of AE and prototype learning.
1 code implementation • 30 Jun 2022 • Huitong Chen, Yu Wang, QinGhua Hu
Re-balancing methods are used to alleviate the influence of data imbalance; however, we empirically discover that they would under-fit new classes.
no code implementations • 25 Apr 2022 • Ruize Han, Wei Feng, Qing Guo, QinGhua Hu
Visual object tracking is an important task in computer vision, which has many real-world applications, e. g., video surveillance, visual navigation.
1 code implementation • 19 Mar 2022 • Junwen Pan, Pengfei Zhu, Kaihua Zhang, Bing Cao, Yu Wang, Dingwen Zhang, Junwei Han, QinGhua Hu
Semantic segmentation with limited annotations, such as weakly supervised semantic segmentation (WSSS) and semi-supervised semantic segmentation (SSSS), is a challenging task that has attracted much attention recently.
Ranked #21 on
Weakly-Supervised Semantic Segmentation
on COCO 2014 val
no code implementations • 15 Jan 2022 • Yu Geng, Zongbo Han, Changqing Zhang, QinGhua Hu
Under the help of uncertainty, DUA-Nets weigh each view of individual sample according to data quality so that the high-quality samples (or views) can be fully exploited while the effects from the noisy samples (or views) will be alleviated.
no code implementations • 23 Nov 2021 • Pengfei Zhu, Hongtao Yu, Kaihua Zhang, Yu Wang, Shuai Zhao, Lei Wang, Tianzhu Zhang, QinGhua Hu
To address this issue, segmentation-based trackers have been proposed that employ per-pixel matching to improve the tracking performance of deformable objects effectively.
1 code implementation • NeurIPS 2021 • Huan Ma, Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu
Multimodal regression is a fundamental task, which integrates the information from different sources to improve the performance of follow-up applications.
1 code implementation • NeurIPS 2021 • Zilin Gao, Qilong Wang, Bingbing Zhang, QinGhua Hu, Peihua Li
Then, a temporal covariance pooling performs temporal pooling of the attentive covariance representations to characterize both intra-frame correlations and inter-frame cross-correlations of the calibrated features.
1 code implementation • ICCV 2021 • Ruihuang Li, Xu Jia, Jianzhong He, Shuaijun Chen, QinGhua Hu
Most existing domain adaptation methods focus on adaptation from only one source domain, however, in practice there are a number of relevant sources that could be leveraged to help improve performance on target domain.
Ranked #2 on
Unsupervised Domain Adaptation
on PACS
1 code implementation • 19 Jul 2021 • Dawei Du, Longyin Wen, Pengfei Zhu, Heng Fan, QinGhua Hu, Haibin Ling, Mubarak Shah, Junwen Pan, Ali Al-Ali, Amr Mohamed, Bakour Imene, Bin Dong, Binyu Zhang, Bouchali Hadia Nesma, Chenfeng Xu, Chenzhen Duan, Ciro Castiello, Corrado Mencar, Dingkang Liang, Florian Krüger, Gennaro Vessio, Giovanna Castellano, Jieru Wang, Junyu Gao, Khalid Abualsaud, Laihui Ding, Lei Zhao, Marco Cianciotta, Muhammad Saqib, Noor Almaadeed, Omar Elharrouss, Pei Lyu, Qi Wang, Shidong Liu, Shuang Qiu, Siyang Pan, Somaya Al-Maadeed, Sultan Daud Khan, Tamer Khattab, Tao Han, Thomas Golda, Wei Xu, Xiang Bai, Xiaoqing Xu, Xuelong Li, Yanyun Zhao, Ye Tian, Yingnan Lin, Yongchao Xu, Yuehan Yao, Zhenyu Xu, Zhijian Zhao, Zhipeng Luo, Zhiwei Wei, Zhiyuan Zhao
Crowd counting on the drone platform is an interesting topic in computer vision, which brings new challenges such as small object inference, background clutter and wide viewpoint.
1 code implementation • CVPR 2021 • Longyin Wen, Dawei Du, Pengfei Zhu, QinGhua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu
To promote the developments of object detection, tracking and counting algorithms in drone-captured videos, we construct a benchmark with a new drone-captured largescale dataset, named as DroneCrowd, formed by 112 video clips with 33, 600 HD frames in various scenarios.
no code implementations • 1 Jan 2021 • Zongbo Han, Changqing Zhang, Huazhu Fu, QinGhua Hu, Joey Tianyi Zhou
Learning effective representations for data with multiple views is crucial in machine learning and pattern recognition.
no code implementations • 12 Nov 2020 • Changqing Zhang, Yajie Cui, Zongbo Han, Joey Tianyi Zhou, Huazhu Fu, QinGhua Hu
Although multi-view learning has made signifificant progress over the past few decades, it is still challenging due to the diffificulty in modeling complex correlations among different views, especially under the context of view missing.
no code implementations • ECCV 2020 • Junbing Li, Changqing Zhang, Pengfei Zhu, Baoyuan Wu, Lei Chen, QinGhua Hu
Although significant progress achieved, multi-label classification is still challenging due to the complexity of correlations among different labels.
no code implementations • 13 Jul 2020 • Yucan Zhou, Yu Wang, Jianfei Cai, Yu Zhou, QinGhua Hu, Weiping Wang
Some works in the optimization of deep neural networks have shown that a better arrangement of training data can make the classifier converge faster and perform better.
6 code implementations • 7 May 2020 • Zhaohui Zheng, Ping Wang, Dongwei Ren, Wei Liu, Rongguang Ye, QinGhua Hu, WangMeng Zuo
In this paper, we propose Complete-IoU (CIoU) loss and Cluster-NMS for enhancing geometric factors in both bounding box regression and Non-Maximum Suppression (NMS), leading to notable gains of average precision (AP) and average recall (AR), without the sacrifice of inference efficiency.
1 code implementation • CVPR 2020 • Qilong Wang, Li Zhang, Banggu Wu, Dongwei Ren, Peihua Li, WangMeng Zuo, QinGhua Hu
Recent works have demonstrated that global covariance pooling (GCP) has the ability to improve performance of deep convolutional neural networks (CNNs) on visual classification task.
1 code implementation • 16 Mar 2020 • Pengfei Zhu, Jiayu Zheng, Dawei Du, Longyin Wen, Yiming Sun, QinGhua Hu
Moreover, an agent sharing network (ASNet) is proposed by self-supervised template sharing and view-aware fusion of the target from multiple drones, which can improve the tracking accuracy significantly compared with single drone tracking.
2 code implementations • 5 Mar 2020 • Yiming Sun, Bing Cao, Pengfei Zhu, QinGhua Hu
To address this dilemma, we further propose an uncertainty-aware cross-modality vehicle detection (UA-CMDet) framework to extract complementary information from cross-modal images, which can significantly improve the detection performance in low light conditions.
2 code implementations • 16 Jan 2020 • Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Heng Fan, QinGhua Hu, Haibin Ling
We provide a large-scale drone captured dataset, VisDrone, which includes four tracks, i. e., (1) image object detection, (2) video object detection, (3) single object tracking, and (4) multi-object tracking.
1 code implementation • 4 Dec 2019 • Longyin Wen, Dawei Du, Pengfei Zhu, QinGhua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu
This paper proposes a space-time multi-scale attention network (STANet) to solve density map estimation, localization and tracking in dense crowds of video clips captured by drones with arbitrary crowd density, perspective, and flight altitude.
1 code implementation • NeurIPS 2019 • Changqing Zhang, Zongbo Han, Yajie Cui, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu
Despite multi-view learning progressed fast in past decades, it is still challenging due to the difficulty in modeling complex correlation among different views, especially under the context of view missing.
1 code implementation • International Conference on Computer Vision Workshops 2019 • Dawei Du, Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Lin, QinGhua Hu, Tao Peng, Jiayu Zheng, Xinyao Wang, Yue Zhang, Liefeng Bo, Hailin Shi, Rui Zhu, Aashish Kumar, Aijin Li, Almaz Zinollayev, Anuar Askergaliyev, Arne Schumann, Binjie Mao, Byeongwon Lee, Chang Liu, Changrui Chen, Chunhong Pan, Chunlei Huo, Da Yu, Dechun Cong, Dening Zeng, Dheeraj Reddy Pailla, Di Li, Dong Wang, Donghyeon Cho, Dongyu Zhang, Furui Bai, George Jose, Guangyu Gao, Guizhong Liu, Haitao Xiong, Hao Qi, Haoran Wang, Heqian Qiu, Hongliang Li, Huchuan Lu, Ildoo Kim, Jaekyum Kim, Jane Shen, Jihoon Lee, Jing Ge, Jingjing Xu, Jingkai Zhou, Jonas Meier, Jun Won Choi, Junhao Hu, Junyi Zhang, Junying Huang, Kaiqi Huang, Keyang Wang, Lars Sommer, Lei Jin, Lei Zhang
Results of 33 object detection algorithms are presented.
10 code implementations • CVPR 2020 • Qilong Wang, Banggu Wu, Pengfei Zhu, Peihua Li, WangMeng Zuo, QinGhua Hu
By dissecting the channel attention module in SENet, we empirically show avoiding dimensionality reduction is important for learning channel attention, and appropriate cross-channel interaction can preserve performance while significantly decreasing model complexity.
Ranked #657 on
Image Classification
on ImageNet
1 code implementation • 6 Aug 2019 • Pengfei Zhu, Binyuan Hui, Changqing Zhang, Dawei Du, Longyin Wen, QinGhua Hu
2) The end-to-end learning manner of deep learning is not well used in multi-view clustering.
Ranked #1 on
Multi-view Subspace Clustering
on ORL
1 code implementation • CVPR 2020 • Dongwei Ren, Kai Zhang, Qilong Wang, QinGhua Hu, WangMeng Zuo
To connect MAP and deep models, we in this paper present two generative networks for respectively modeling the deep priors of clean image and blur kernel, and propose an unconstrained neural optimization solution to blind deconvolution.
4 code implementations • CVPR 2019 • Dongwei Ren, WangMeng Zuo, QinGhua Hu, Pengfei Zhu, Deyu Meng
To handle this issue, this paper provides a better and simpler baseline deraining network by considering network architecture, input and output, and loss functions.
Ranked #1 on
Single Image Deraining
on Rain1400
no code implementations • 12 Jan 2019 • Qing Yin, Guan Luo, Xiaodong Zhu, QinGhua Hu, Ou wu
Question answering (QA) is an important natural language processing (NLP) task and has received much attention in academic research and industry communities.
no code implementations • 11 Dec 2018 • Tianyu Zhao, Wenqi Ren, Changqing Zhang, Dongwei Ren, QinGhua Hu
Specifically, we propose a degradation network to model the real-world degradation process from HR to LR via generative adversarial networks, and these generated realistic LR images paired with real-world HR images are exploited for training the SR reconstruction network, forming the first cycle.
no code implementations • 21 Sep 2018 • Meijun Sun, Ziqi Zhou, QinGhua Hu, Zheng Wang, Jianmin Jiang
To this end, we propose a novel and efficient video eye fixation detection model to improve the saliency detection performance.
no code implementations • 2 Jun 2018 • Pinlong Zhao, Zhouyu Fu, Ou wu, QinGhua Hu, Jun Wang
In contrast to existing defense methods, the proposed method does not require knowledge of the process for generating adversarial examples and can be applied to defend against different types of attacks.
no code implementations • 20 Apr 2018 • Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Ling, QinGhua Hu
In this paper we present a large-scale visual object detection and tracking benchmark, named VisDrone2018, aiming at advancing visual understanding tasks on the drone platform.
no code implementations • CVPR 2017 • Changqing Zhang, QinGhua Hu, Huazhu Fu, Pengfei Zhu, Xiaochun Cao
In this paper, we propose a novel Latent Multi-view Subspace Clustering (LMSC) method, which clusters data points with latent representation and simultaneously explores underlying complementary information from multiple views.
no code implementations • 23 Jan 2014 • Linhao Li, Ping Wang, QinGhua Hu, Sijia Cai
A cyclic iteration process is then proposed to extract the background from the discriminative frame set.