no code implementations • 23 Apr 2024 • Hongyu Chen, Yiqi Gao, Min Zhou, Peng Wang, Xubin Li, Tiezheng Ge, Bo Zheng
Meanwhile, a network, dubbed as Masked ControlNet, is designed to utilize these object masks for object generation in the misaligned visual control region.
no code implementations • 22 Apr 2024 • Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang
Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks.
no code implementations • 22 Apr 2024 • Chengrui Wang, PengFei Liu, Min Zhou, Ming Zeng, Xubin Li, Tiezheng Ge, Bo Zheng
The style guidance is a hand image, e. g., the malformed hand itself, and is employed to furnish the style reference for hand refining.
no code implementations • 23 Oct 2023 • Xian Li, Hongguang Shi, Yunfei Wang, Yeqin Zhang, Xubin Li, Cam-Tu Nguyen
Specifically, the recommendation predicts the long-term recommendation target based on the conversational context and the user history.
no code implementations • 29 Jun 2023 • Yu Tian, Bofang Li, Si Chen, Xubin Li, Hongbo Deng, Jian Xu, Bo Zheng, Qian Wang, Chenliang Li
Recently, Multi-Scenario Learning (MSL) is widely used in recommendation and retrieval systems in the industry because it facilitates transfer learning from different scenarios, mitigating data sparsity and reducing maintenance cost.
no code implementations • 9 May 2022 • Weixin Feng, Xingyuan Bu, Chenchen Zhang, Xubin Li
In this paper, we take advantage of language prompt to introduce effective and unbiased linguistic supervision into object detection, and propose a new mechanism called multimodal knowledge learning (\textbf{MKL}), which is required to learn knowledge from language supervision.
no code implementations • 9 May 2022 • Si Chen, Chen Lin, Wanxian Guan, Jiayi Wei, Xingyuan Bu, He guo, Hui Li, Xubin Li, Jian Xu, Bo Zheng
In this paper, we present a visual encoding framework for CTR prediction to overcome these problems.
1 code implementation • CVPR 2021 • Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song
Based on the adversarial losses of the generator and discriminator, we categorize GANs into two classes, Symmetric GANs and Asymmetric GANs, and introduce a novel gradient decomposition method to unify the two, allowing us to train both classes in one stage and hence alleviate the training effort.
no code implementations • 15 Oct 2018 • Yuan Gao, Xingyuan Bu, Yang Hu, Hui Shen, Ti Bai, Xubin Li, Shilei Wen
This report demonstrates our solution for the Open Images 2018 Challenge.