no code implementations • 18 Jun 2024 • Hongpeng Pan, Shifeng Yi, Shouwei Yang, Lei Qi, Bing Hu, Yi Xu, Yang Yang
This misalignment hinders the zero-shot performance of VLM and the application of fine-tuning methods based on pseudo-labels.
Few-Shot Object Detection Language Modelling +4