Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models

1 code implementation28 Nov 2023 Zhihe Lu, Jiawang Bai, Xin Li, Zeyu Xiao, Xinchao Wang

However, performance advancements are limited when relying solely on intricate algorithmic designs for a single model, even one exhibiting strong performance, e. g., CLIP-ViT-B/16.

Prompt Engineering

GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph

1 code implementation NeurIPS 2023 Xin Li, Dongze Lian, Zhihe Lu, Jiawang Bai, Zhibo Chen, Xinchao Wang

To mitigate that, we propose an effective adapter-style tuning strategy, dubbed GraphAdapter, which performs the textual adapter by explicitly modeling the dual-modality structure knowledge (i. e., the correlation of different semantics/classes in textual and visual modalities) with a dual knowledge graph.

Transfer Learning

Evolving Knowledge Mining for Class Incremental Segmentation

1 code implementation3 Jun 2023 Zhihe Lu, Shuicheng Yan, Xinchao Wang

In this paper, we for the first time investigate the efficient multi-grained knowledge reuse for CISS, and propose a novel method, Evolving kNowleDge minING (ENDING), employing a frozen backbone.

Class-Incremental Semantic Segmentation Knowledge Distillation

A Dive into SAM Prior in Image Restoration

no code implementations23 May 2023 Zeyu Xiao, Jiawang Bai, Zhihe Lu, Zhiwei Xiong

This motivates the investigation and incorporation of prior knowledge in order to effectively constrain the solution space and enhance the quality of the restored images.

Color Image Denoising Image Denoising +2

Can SAM Boost Video Super-Resolution?

no code implementations11 May 2023 Zhihe Lu, Zeyu Xiao, Jiawang Bai, Zhiwei Xiong, Xinchao Wang

To use the SAM-based prior, we propose a simple yet effective module -- SAM-guidEd refinEment Module (SEEM), which can enhance both alignment and fusion procedures by the utilization of semantic information.

Optical Flow Estimation Video Super-Resolution

Task Residual for Tuning Vision-Language Models

1 code implementation CVPR 2023 Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang

Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned general visual representations and broad visual concepts.

Transfer Learning

Stochastic Classifiers for Unsupervised Domain Adaptation

2 code implementations CVPR 2020 Zhihe Lu, Yongxin Yang, Xiatian Zhu, Cong Liu, Yi-Zhe Song, Tao Xiang

A common strategy adopted by existing state-of-the-art unsupervised domain adaptation (UDA) methods is to employ two classifiers to identify the misaligned local regions between source and target domain.

Image Classification Semantic Segmentation +1

Geometry Guided Adversarial Facial Expression Synthesis

no code implementations10 Dec 2017 Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan

An expression invariant face recognition experiment is also performed to further show the advantages of our proposed method.

Face Recognition Face Transfer +2

Recent Progress of Face Image Synthesis

no code implementations15 Jun 2017 Zhihe Lu, Zhihang Li, Jie Cao, Ran He, Zhenan Sun

Face synthesis has been a fascinating yet challenging problem in computer vision and machine learning.

Face Generation Face Recognition

