1 code implementation • 8 Aug 2023 • Yizhuo Lu, Changde Du, Qiongyi Zhou, Dianpeng Wang, Huiguang He
In Stage 2, we utilize the CLIP visual feature decoded from fMRI as supervisory information, and continually adjust the two feature vectors decoded in Stage 1 through backpropagation to align the structural information.
no code implementations • 24 Mar 2023 • Yizhuo Lu, Changde Du, Dianpeng Wang, Huiguang He
In Stage 1, the VQ-VAE latent representations and the CLIP text embeddings decoded from fMRI are put into the image-to-image process of Stable Diffusion, which yields a preliminary image that contains semantic and structural information.