no code implementations • 6 Nov 2019 • Fan Wang, Xiaomin Fang, Lihang Liu, Hao Tian, Zhiming Peng
The proposed method takes advantage of the characteristics of recommender systems and draws ideas from the model-based reinforcement learning method for higher sample efficiency.
1 code implementation • 1 Feb 2019 • Fan Wang, Xiaomin Fang, Lihang Liu, Yaxue Chen, Jiucheng Tao, Zhiming Peng, Cihang Jin, Hao Tian
On the one hand of this framework, an evaluation model is trained to evaluate the expected overall utility, by fully considering the user, item information and the correlations among the co-exposed items.