1 code implementation • 18 Dec 2024 • Rui Cai, Zhiyu Dong, Jianfeng Dong, Xun Wang
As a general parameter-efficient way, a common solution is to utilize adapter modules to transfer the vision-language alignment ability of Vision-Language Pretraining (VLP) models from a source language to a target language.