no code implementations • 28 Dec 2023 • Nir Yellinek, Leonid Karlinsky, Raja Giryes
Vision-Language models (VLMs) have proved effective at aligning image and text representations, producing superior zero-shot results when transferred to many downstream tasks.