Search Results for author: Yeong Hyeon Gu

SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining

Vision-language models (VLMs) have made significant strides in cross-modal understanding through large-scale paired datasets.

Paper
Add Code

How important is it for training and evaluation sets to not have class overlap in image retrieval?

Paper
Code

Many studies in vision tasks have aimed to create effective embedding spaces for single-label object prediction within an image.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.