no code implementations • 23 Aug 2024 • Zitao Shuai, Chenwei Wu, Zhengxu Tang, Bowen Song, Liyue Shen
Through our investigation of DiT's latent space, we have uncovered key findings that unlock the potential for zero-shot fine-grained semantic editing: (1) Both the text and image spaces in DiTs are inherently decomposable.