no code implementations • 18 Sep 2024 • Jaehoon Joo, Taejin Jeong, Seongjae Hwang
That is, by extracting textual guidance from semantic information regions and visual guidance from perceptual information regions, Brain-Streams provides accurate multi-modal guidance to LDMs.
1 code implementation • 25 Jul 2024 • Gayoon Choi, Taejin Jeong, Sujung Hong, Jaehoon Joo, Seong Jae Hwang
A significant aspect that remains unexplored is the interaction between text and image embeddings.