no code implementations • 1 Apr 2024 • Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon, Shunghyun Choi, Yeong Hyeon Gu
Vision-language models (VLMs) have made significant strides in cross-modal understanding through large-scale paired datasets.
1 code implementation • 1 Apr 2024 • Chull Hwan Song, Jooyoung Yoon, Taebaek Hwang, Shunghyun Choi, Yeong Hyeon Gu, Yannis Avrithis
How important is it for training and evaluation sets to not have class overlap in image retrieval?
no code implementations • ICCV 2023 • Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon, Shunghyun Choi, Yeong Hyeon Gu
Many studies in vision tasks have aimed to create effective embedding spaces for single-label object prediction within an image.