no code implementations • 1 Sep 2020 • Philomène Chagniot, Flavian vasile, David Rohde
Recommender systems are often optimised for short-term reward: a recommendation is considered successful if a reward (e. g. a click) can be observed immediately after the recommendation.