Search Results for author: Haikai Chen

Found 2 papers, 0 papers with code

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Paper
Add Code

Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang

Thus, the global policy of the whole page could be sub-optimal.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.