no code implementations • 12 Jan 2024 • Guiming Cao, Kaize Shi, Hong Fu, Huaiwen Zhang, Guandong Xu
Pre-trained Vision-Language (V-L) models set the benchmark for generalization to downstream tasks among the noteworthy contenders.