no code implementations • 26 Aug 2024 • Chaohua Shi, Xuan Wang, Si Shi, Xule Wang, Mingrui Zhu, Nannan Wang, Xinbo Gao
However, existing diffusion models face challenges in processing and fusing information from multiple images and lack access to high-quality publicly available datasets, which prevents the application of diffusion models in food image composition.
1 code implementation • 10 Jan 2024 • Zhiqiang Guo, GuoHui Li, Jianjun Li, Chaoyang Wang, Si Shi
To address this problem, we propose a Dual Disentangled Variational AutoEncoder (DualVAE) for collaborative recommendation, which combines disentangled representation learning with variational inference to facilitate the generation of implicit interaction data.
1 code implementation • 27 Dec 2023 • Zhiqiang Guo, Jianjun Li, GuoHui Li, Chaoyang Wang, Si Shi, Bin Ruan
The multimodal recommendation has gradually become the infrastructure of online media platforms, enabling them to provide personalized service to users through a joint modeling of user historical behaviors (e. g., purchases, clicks) and item various modalities (e. g., visual and textual).