Search Results for author: Zhengyuan Xie

Found 1 papers, 1 papers with code

GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery

1 code implementation15 Mar 2024 Enguang Wang, Zhimao Peng, Zhengyuan Xie, Xialei Liu, Ming-Ming Cheng

Specifically, our TES leverages the property that CLIP can generate aligned vision-language features, converting visual embeddings into tokens of the CLIP's text encoder to generate pseudo text embeddings.

Cannot find the paper you are looking for? You can Submit a new open access paper.