no code implementations • 15 Aug 2017 • Li Ziming, Kiseleva Julia, de Rijke Maarten, Grotov Artem
It is natural to represent the behavior of users who are engaging with interactive systems such as a search engine or a recommender system, as a sequence of actions where each next action depends on the current situation and the user reward of taking a particular action.