no code implementations • 7 Apr 2024 • Xiaoteng Shen, Rui Zhang, Xiaoyan Zhao, Jieming Zhu, Xi Xiao
Such user preferences are then fed into a generator, such as a multimodal LLM or diffusion model, to produce personalized content.
multimodal generation Reading Comprehension +1