1 code implementation • ECCV 2020 • Guangrui Li, Guoliang Kang, Wu Liu, Yunchao Wei, Yi Yang
The target of CCM is to acquire those synthetic images that share similar distribution with the real ones in the target domain, so that the domain gap can be naturally alleviated by employing the content-consistent synthetic images for training.
Ranked #12 on
Semantic Segmentation
on GTAV-to-Cityscapes Labels
no code implementations • 14 Apr 2025 • Huijie Liu, Bingcan Wang, Jie Hu, Xiaoming Wei, Guoliang Kang
In general cases, existing text-to-image generation models excel in producing high-quality images; however, they struggle to capture diverse characteristics and faithful details of specific domains, particularly Chinese dishes.
no code implementations • 29 Mar 2025 • Yuxiang Bao, Huijie Liu, Xun Gao, Huan Fu, Guoliang Kang
It is motivated from a statistical perspective that an ensemble of DDIM inversion processes for multiple trajectories yields a smaller trajectory mismatch error on expectation.
no code implementations • 5 Feb 2025 • Jingyun Wang, Cilin Yan, Guoliang Kang
As the representation of each patch is finally determined by the attention weights and the Value embeddings, we propose to reshape the last-block attention and Value embeddings to aggregate useful global context into final features.
Open Vocabulary Semantic Segmentation
Open-Vocabulary Semantic Segmentation
+1
no code implementations • 28 Jan 2025 • Huijie Liu, Jingyun Wang, Shuai Ma, Jie Hu, Xiaoming Wei, Guoliang Kang
Extensive experiments demonstrate that compared to previous works, our method can generate videos with appearance more aligned with the text descriptions and motion more consistent with the reference videos.
1 code implementation • 6 Jan 2025 • Yuxiang Bao, Guoliang Kang, Linlin Yang, Xiaoyue Duan, Bo Zhao, Baochang Zhang
Differently, in this paper, we identify that the bias towards the frequent class may be encoded into features, i. e., the rare-specific features which play a key role in discriminating the rare class are much weaker than the frequent-specific features.
no code implementations • 10 Dec 2024 • Yingfan Wang, Guoliang Kang
In this paper, we propose a new perspective to harness CLIP for DG, i. e., attention head purification.
no code implementations • 12 Nov 2024 • Cilin Yan, Jingyun Wang, Lin Zhang, Ruihui Zhao, Xiaopu Wu, Kai Xiong, Qingsong Liu, Guoliang Kang, Yangyang Kang
In this work, we propose an Exemplar-Guided Reflection with Memory mechanism (ERM) to realize more efficient and accurate prompt optimization.
no code implementations • 23 Oct 2024 • Yu Wang, Xiaobao Wei, Ming Lu, Guoliang Kang
In this paper, we propose a new method called PLGS that enables 3DGS to generate consistent panoptic segmentation masks from noisy 2D segmentation masks while maintaining superior efficiency compared to NeRF-based methods.
1 code implementation • 15 Aug 2024 • Gengwei Zhang, Liyuan Wang, Guoliang Kang, Ling Chen, Yunchao Wei
Considering that the overly fast representation learning and the biased classification layer constitute this particular problem, we introduce the advanced Slow Learner with Classifier Alignment (SLCA++) framework to unleash the power of Seq FT, serving as a strong baseline approach for CLPT.
1 code implementation • 13 Aug 2024 • Jingyun Wang, Guoliang Kang
In this paper, we propose to explicitly model and rectify the bias existing in CLIP to facilitate the unsupervised semantic segmentation task.
2 code implementations • 16 Jul 2024 • Cilin Yan, Haochen Wang, Shilin Yan, XiaoLong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves
In this paper, we introduce a new task, Reasoning Video Object Segmentation (ReasonVOS).
Ranked #3 on
Referring Video Object Segmentation
on ReVOS
no code implementations • 17 Jun 2024 • Cilin Yan, Haochen Wang, XiaoLong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves
Specifically, we adopt a transformer module which takes the visual feature as "Query", the text features of the anchors as "Key" and the similarity matrix between the text features of anchor and target classes as "Value".
1 code implementation • CVPR 2024 • Jingyun Wang, Guoliang Kang
In this paper we propose to explicitly model and rectify the bias existing in CLIP to facilitate the unsupervised semantic segmentation.
no code implementations • 22 Dec 2023 • Xiaoyue Duan, Shuhao Cui, Guoliang Kang, Baochang Zhang, Zhengcong Fei, Mingyuan Fan, Junshi Huang
Consistent editing of real images is a challenging task, as it requires performing non-rigid edits (e. g., changing postures) to the main objects in the input image without changing their identity or attributes.
no code implementations • 1 Nov 2023 • Yuxiang Bao, Di Qiu, Guoliang Kang, Baochang Zhang, Bo Jin, Kaiye Wang, Pengfei Yan
As a result, the corresponding regions across the adjacent frames can share closely-related query tokens and attention outputs, which can further improve latent-level consistency to enhance visual temporal coherence of generated videos.
1 code implementation • CVPR 2023 • Runqi Wang, Xiaoyue Duan, Guoliang Kang, Jianzhuang Liu, Shaohui Lin, Songcen Xu, Jinhu Lv, Baochang Zhang
Text consists of a category name and a fixed number of learnable parameters which are selected from our designed attribute word bank and serve as attributes.
1 code implementation • 23 Apr 2023 • Cilin Yan, Haochen Wang, Jie Liu, XiaoLong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves
Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing.
2 code implementations • ICCV 2023 • Gengwei Zhang, Liyuan Wang, Guoliang Kang, Ling Chen, Yunchao Wei
The goal of continual learning is to improve the performance of recognition models in learning sequentially arrived data.
no code implementations • CVPR 2023 • Guangrui Li, Guoliang Kang, Xiaohan Wang, Yunchao Wei, Yi Yang
With the help of adversarial training, the masking module can learn to generate source masks to mimic the pattern of irregular target noise, thereby narrowing the domain gap.
no code implementations • 28 Nov 2022 • Xiaoyue Duan, Guoliang Kang, Runqi Wang, Shumin Han, Song Xue, Tian Wang, Baochang Zhang
Based on this observation, we propose a simple strategy, i. e., increasing the number of training shots, to mitigate the loss of intrinsic dimension caused by robustness-promoting regularization.
no code implementations • 17 Jan 2022 • Mengshu Sun, Haoyu Ma, Guoliang Kang, Yifan Jiang, Tianlong Chen, Xiaolong Ma, Zhangyang Wang, Yanzhi Wang
To the best of our knowledge, this is the first time quantization has been incorporated into ViT acceleration on FPGAs with the help of a fully automatic framework to guide the quantization strategy on the software side and the accelerator implementations on the hardware side given the target frame rate.
1 code implementation • CVPR 2021 • Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang
To better exploit the intrinsic structure of the target domain, we propose Domain Consensus Clustering (DCC), which exploits the domain consensus knowledge to discover discriminative clusters on both common samples and private ones.
Ranked #4 on
Partial Domain Adaptation
on Office-31
2 code implementations • NeurIPS 2021 • Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei
Directly performing cross-attention may aggregate these features from support to query and bias the query features.
Ranked #59 on
Few-Shot Semantic Segmentation
on PASCAL-5i (1-Shot)
1 code implementation • NeurIPS 2020 • Guoliang Kang, Yunchao Wei, Yi Yang, Yueting Zhuang, Alexander G. Hauptmann
The conventional solution to this task is to minimize the discrepancy between source and target to enable effective knowledge transfer.
Ranked #27 on
Synthetic-to-Real Translation
on SYNTHIA-to-Cityscapes
1 code implementation • Proceedings of the IEEE Winter Conference on Applications of Computer Vision Workshops 2020 • Wenhe Liu, Guoliang Kang, Po-Yao Huang, Xiaojun Chang, Yijun Qian, Junwei Liang, Liangke Gui, Jing Wen, Peng Chen
We propose an Efficient Activity Detection System, Argus, for Extended Video Analysis in the surveillance scenario.
no code implementations • 1 Feb 2020 • Lijun Yu, Peng Chen, Wenhe Liu, Guoliang Kang, Alexander G. Hauptmann
To deal with the aforementioned problems, in this paper, we propose a training-free monocular 3D event detection system for traffic surveillance.
1 code implementation • ICCV 2019 • Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang
In this paper, we exploit the semantic structure of open set data from two aspects: 1) Semantic Categorical Alignment, which aims to achieve good separability of target known classes by categorically aligning the centroid of target with the source.
6 code implementations • 17 Apr 2019 • Chao Li, Zhiyuan Liu, Mengmeng Wu, Yuchi Xu, Pipei Huang, Huan Zhao, Guoliang Kang, Qiwei Chen, Wei Li, Dik Lun Lee
Industrial recommender systems usually consist of the matching stage and the ranking stage, in order to handle the billion-scale of users and items.
Ranked #1 on
Information Retrieval
on Amazon
1 code implementation • 13 Apr 2019 • Guoliang Kang, Jun Li, DaCheng Tao
Dropout has played an essential role in many successful deep neural networks, by inducing regularization in the model training.
2 code implementations • CVPR 2019 • Guoliang Kang, Lu Jiang, Yi Yang, Alexander G. Hauptmann
Unsupervised Domain Adaptation (UDA) makes predictions for the target domain data while manual annotations are only available in the source domain.
Ranked #9 on
Domain Adaptation
on Office-31
2 code implementations • 22 Aug 2018 • Yang He, Xuanyi Dong, Guoliang Kang, Yanwei Fu, Chenggang Yan, Yi Yang
With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable.
6 code implementations • 21 Aug 2018 • Yang He, Guoliang Kang, Xuanyi Dong, Yanwei Fu, Yi Yang
Therefore, the network trained by our method has a larger model capacity to learn from the training data.
1 code implementation • ECCV 2018 • Xiaolin Zhang, Yunchao Wei, Guoliang Kang, Yi Yang, Thomas Huang
A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks.
Ranked #1 on
Weakly-Supervised Object Localization
on ILSVRC 2015
no code implementations • ECCV 2018 • Guoliang Kang, Liang Zheng, Yan Yan, Yi Yang
Second, we estimate the posterior label distribution of the unlabeled data for target network training.
2 code implementations • CVPR 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao
To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image.
no code implementations • 22 Sep 2017 • Xuanyi Dong, Guoliang Kang, Kun Zhan, Yi Yang
For most state-of-the-art architectures, Rectified Linear Unit (ReLU) becomes a standard component accompanied with each layer.
Ranked #11 on
Image Classification
on SVHN
18 code implementations • 16 Aug 2017 • Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, Yi Yang
In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).
Ranked #2 on
Image Classification
on Fashion-MNIST
no code implementations • 22 Jul 2017 • Guoliang Kang, Xuanyi Dong, Liang Zheng, Yi Yang
This paper focuses on regularizing the training of the convolutional neural network (CNN).