no code implementations • 22 Jan 2024 • Liping Qiu, Qin Zhang, Xiaojun Chen, Shaotian Cai
Recently, the cross-modal pretraining model has been employed to produce meaningful pseudo-labels to supervise the training of an image clustering model.
no code implementations • 21 Aug 2022 • Shaotian Cai, Liping Qiu, Xiaojun Chen, Qin Zhang, Longteng Chen
In this paper, we propose to investigate the task of image clustering with the help of a visual-language pre-training model.
no code implementations • 17 Mar 2022 • Qinghong Lin, Xiaojun Chen, Qin Zhang, Shaotian Cai, Wenzhe Zhao, Hongfa Wang
Firstly, DSCH constructs a semantic component structure by uncovering the fine-grained semantics components of images with a Gaussian Mixture Modal~(GMM), where an image is represented as a mixture of multiple components, and the semantics co-occurrence are exploited.