no code implementations • 3 Aug 2023 • Otmane Sakhi, David Rohde, Nicolas Chopin
We compare our method to the commonly adopted Plackett-Luce policy class and demonstrate the effectiveness of our approach on problems with action space sizes in the order of millions.
no code implementations • 24 Oct 2022 • Otmane Sakhi, Pierre Alquier, Nicolas Chopin
This paper introduces a new principled approach for off-policy learning in contextual bandits.
no code implementations • 18 Sep 2022 • Imad Aouali, Amine Benhalloum, Martin Bompaire, Benjamin Heymann, Olivier Jeunen, David Rohde, Otmane Sakhi, Flavian vasile
Naturally, the reason for this is that we can directly measure utility metrics that rely on interventions, being the recommendations that are being shown to users.
no code implementations • 10 Aug 2022 • Imad Aouali, Achraf Ait Sidi Hammou, Sergey Ivanov, Otmane Sakhi, David Rohde, Flavian vasile
We introduce Probabilistic Rank and Reward (PRR), a scalable probabilistic model for personalized slate recommendation.
no code implementations • 8 Aug 2022 • Otmane Sakhi, David Rohde, Alexandre Gilotte
Personalised interactive systems such as recommender systems require selecting relevant items from massive catalogs dependent on context.
no code implementations • 13 Nov 2020 • Otmane Sakhi, Louis Faury, Flavian vasile
Our approach relies on the construction of asymptotic confidence intervals for offline contextual bandits through the DRO framework.
no code implementations • 28 Aug 2020 • Otmane Sakhi, Stephen Bonner, David Rohde, Flavian vasile
In contrast, the organic signal is typically strong and covers most items, but is not always relevant to the recommendation task.
no code implementations • 2 Oct 2019 • Otmane Sakhi, Stephen Bonner, David Rohde, Flavian vasile
The combination of the re-parameterization trick with the use of variational auto-encoders has caused a sensation in Bayesian deep learning, allowing the training of realistic generative models of images and has considerably increased our ability to use scalable latent variable models.