no code implementations • 16 Dec 2020 • Yuting Chen, Yanshi Wang, Yabo Ni, An-Xiang Zeng, Lanfen Lin
Finally, we employ a novel mutual unit to adaptively learn the similarity between various scenarios and incorporate it into multi-branch network.
Cultural Vocal Bursts Intensity Prediction Recommendation Systems
no code implementations • 25 May 2020 • Jianxiong Wei, An-Xiang Zeng, Yueqiu Wu, Peng Guo, Qingsong Hua, Qingpeng Cai
In this paper, we present a novel Generator and Critic slate re-ranking approach, where the Critic evaluates the slate and the Generator ranks the items by the reinforcement learning approach.
no code implementations • 25 Mar 2020 • Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen, Wen-Ji Zhou, Qing Da, An-Xiang Zeng, Han Yu, Yang Yu, Zhi-Hua Zhou
The framework consists of an evaluator that generalizes to evaluate recommendations involving the context, and a generator that maximizes the evaluator score by reinforcement learning, and a discriminator that ensures the generalization of the evaluator.
no code implementations • 27 Aug 2019 • Qizhi Zhang, Yi Lin, Kangle Wu, Yongliang Li, An-Xiang Zeng
But this method repudiate the variousness of the interest of user in a session.
no code implementations • 18 Nov 2018 • Feiyang Pan, Qingpeng Cai, An-Xiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang
Model-free reinforcement learning methods such as the Proximal Policy Optimization algorithm (PPO) have successfully applied in complex decision-making problems such as Atari games.
no code implementations • ECCV 2018 • Jie Song, Chengchao Shen, Jie Lei, An-Xiang Zeng, Kairi Ou, DaCheng Tao, Mingli Song
We propose a selective zero-shot classifier based on both the human defined and the automatically discovered residual attributes.
no code implementations • 2 Jul 2018 • Hua-Lin He, Chun-Xiang Pan, Qing Da, An-Xiang Zeng
In a large E-commerce platform, all the participants compete for impressions under the allocation mechanism of the platform.
no code implementations • 28 May 2018 • Yabo Ni, Dan Ou, Shichen Liu, Xiang Li, Wenwu Ou, An-Xiang Zeng, Luo Si
In this work, we propose to learn universal user representations across multiple tasks for more e ective personalization.
1 code implementation • NeurIPS 2018 • Zunlei Feng, Xinchao Wang, Chenglong Ke, An-Xiang Zeng, DaCheng Tao, Mingli Song
To achieve disentangling using the labeled pairs, we follow a "encoding-swap-decoding" process, where we first swap the parts of their encodings corresponding to the shared attribute and then decode the obtained hybrid codes to reconstruct the original input pairs.
1 code implementation • 25 May 2018 • Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, An-Xiang Zeng
Applying reinforcement learning in physical-world tasks is extremely challenging.
1 code implementation • 2 Mar 2018 • Yujing Hu, Qing Da, An-Xiang Zeng, Yang Yu, Yinghui Xu
For better utilizing the correlation between different ranking steps, in this paper, we propose to use reinforcement learning (RL) to learn an optimal ranking policy which maximizes the expected accumulative rewards in a search session.