1 code implementation • 28 Nov 2023 • Zhihe Lu, Jiawang Bai, Xin Li, Zeyu Xiao, Xinchao Wang
However, performance advancements are limited when relying solely on intricate algorithmic designs for a single model, even one exhibiting strong performance, e. g., CLIP-ViT-B/16.
Ranked #2 on Prompt Engineering on ImageNet
1 code implementation • NeurIPS 2023 • Xin Li, Dongze Lian, Zhihe Lu, Jiawang Bai, Zhibo Chen, Xinchao Wang
To mitigate that, we propose an effective adapter-style tuning strategy, dubbed GraphAdapter, which performs the textual adapter by explicitly modeling the dual-modality structure knowledge (i. e., the correlation of different semantics/classes in textual and visual modalities) with a dual knowledge graph.
1 code implementation • 3 Jun 2023 • Zhihe Lu, Shuicheng Yan, Xinchao Wang
In this paper, we for the first time investigate the efficient multi-grained knowledge reuse for CISS, and propose a novel method, Evolving kNowleDge minING (ENDING), employing a frozen backbone.
Class-Incremental Semantic Segmentation Knowledge Distillation
no code implementations • 23 May 2023 • Zeyu Xiao, Jiawang Bai, Zhihe Lu, Zhiwei Xiong
This motivates the investigation and incorporation of prior knowledge in order to effectively constrain the solution space and enhance the quality of the restored images.
no code implementations • 11 May 2023 • Zhihe Lu, Zeyu Xiao, Jiawang Bai, Zhiwei Xiong, Xinchao Wang
To use the SAM-based prior, we propose a simple yet effective module -- SAM-guidEd refinEment Module (SEEM), which can enhance both alignment and fusion procedures by the utilization of semantic information.
1 code implementation • CVPR 2023 • Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang
Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned general visual representations and broad visual concepts.
no code implementations • 15 Oct 2022 • Zhihe Lu, Sen He, Da Li, Yi-Zhe Song, Tao Xiang
To ensure that the fused scores are not biased to either the base or novel classes, a new Transformer-based calibration module is introduced.
Generalized Few-Shot Semantic Segmentation Semantic Segmentation
1 code implementation • ICCV 2021 • Zhihe Lu, Sen He, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang
A few-shot semantic segmentation model is typically composed of a CNN encoder, a CNN decoder and a simple classifier (separating foreground and background pixels).
2 code implementations • CVPR 2020 • Zhihe Lu, Yongxin Yang, Xiatian Zhu, Cong Liu, Yi-Zhe Song, Tao Xiang
A common strategy adopted by existing state-of-the-art unsupervised domain adaptation (UDA) methods is to employ two classifiers to identify the misaligned local regions between source and target domain.
no code implementations • 10 Dec 2017 • Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan
An expression invariant face recognition experiment is also performed to further show the advantages of our proposed method.
no code implementations • 15 Jun 2017 • Zhihe Lu, Zhihang Li, Jie Cao, Ran He, Zhenan Sun
Face synthesis has been a fascinating yet challenging problem in computer vision and machine learning.