Search Results for author: Morgane Lustman

Found 1 papers, 1 papers with code

Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology

3 code implementations • 29 May 2019 • Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Morgane Lustman, Vince Gatto, Paul Covington, Jim McFadden, Tushar Chandra, Craig Boutilier

(i) We develop SLATEQ, a decomposition of value-based temporal-difference and Q-learning that renders RL tractable with slates.

Q-Learning Recommendation Systems +2

30,954

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.