Search Results for author: David Rohde

Found 22 papers, 2 papers with code

Bayesian Off-Policy Evaluation and Learning for Large Action Spaces

no code implementations22 Feb 2024 Imad Aouali, Victor-Emmanuel Brunel, David Rohde, Anna Korba

In this framework, we propose sDM, a generic Bayesian approach designed for OPE and OPL, grounded in both algorithmic and theoretical foundations.

Computational Efficiency Off-policy evaluation

Fast Slate Policy Optimization: Going Beyond Plackett-Luce

no code implementations3 Aug 2023 Otmane Sakhi, David Rohde, Nicolas Chopin

We compare our method to the commonly adopted Plackett-Luce policy class and demonstrate the effectiveness of our approach on problems with action space sizes in the order of millions.

Information Retrieval Recommendation Systems +1

Exponential Smoothing for Off-Policy Learning

no code implementations25 May 2023 Imad Aouali, Victor-Emmanuel Brunel, David Rohde, Anna Korba

In particular, it is also valid for standard IPS without making the assumption that the importance weights are bounded.

valid

Learning from aggregated data with a maximum entropy model

1 code implementation5 Oct 2022 Alexandre Gilotte, Ahmed Ben Yahmed, David Rohde

Aggregating a dataset, then injecting some noise, is a simple and common way to release differentially private data. However, aggregated data -- even without noise -- is not an appropriate input for machine learning classifiers. In this work, we show how a new model, similar to a logistic regression, may be learned from aggregated data only by approximating the unobserved feature distribution with a maximum entropy hypothesis.

regression

Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation

no code implementations18 Sep 2022 Imad Aouali, Amine Benhalloum, Martin Bompaire, Benjamin Heymann, Olivier Jeunen, David Rohde, Otmane Sakhi, Flavian vasile

Naturally, the reason for this is that we can directly measure utility metrics that rely on interventions, being the recommendations that are being shown to users.

counterfactual Recommendation Systems

Probabilistic Rank and Reward: A Scalable Model for Slate Recommendation

no code implementations10 Aug 2022 Imad Aouali, Achraf Ait Sidi Hammou, Sergey Ivanov, Otmane Sakhi, David Rohde, Flavian vasile

We introduce Probabilistic Rank and Reward (PRR), a scalable probabilistic model for personalized slate recommendation.

Recommendation Systems

Fast Offline Policy Optimization for Large Scale Recommendation

no code implementations8 Aug 2022 Otmane Sakhi, David Rohde, Alexandre Gilotte

Personalised interactive systems such as recommender systems require selecting relevant items from massive catalogs dependent on context.

Recommendation Systems

Causal Inference, is just Inference: A beautifully simple idea that not everyone accepts

no code implementations NeurIPS Workshop ICBINB 2021 David Rohde

It is often argued that causal inference is a step that follows probabilistic estimation in a two step procedure, with a separate statistical estimation and causal inference step and each step is governed by its own principles.

Causal Inference

Combining Reward and Rank Signals for Slate Recommendation

no code implementations26 Jul 2021 Imad Aouali, Sergey Ivanov, Mike Gartrell, David Rohde, Flavian vasile, Victor Zaytsev, Diego Legrand

In this paper, we formulate several Bayesian models that incorporate the reward signal (Reward model), the rank signal (Rank model), or both (Full model), for non-personalized slate recommendation.

Recommendation Systems

From Clicks to Conversions: Recommendation for long-term reward

no code implementations1 Sep 2020 Philomène Chagniot, Flavian vasile, David Rohde

Recommender systems are often optimised for short-term reward: a recommendation is considered successful if a reward (e. g. a click) can be observed immediately after the recommendation.

Recommendation Systems

BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals

no code implementations28 Aug 2020 Otmane Sakhi, Stephen Bonner, David Rohde, Flavian vasile

In contrast, the organic signal is typically strong and covers most items, but is not always relevant to the recommendation task.

Recommendation Systems

Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks

no code implementations2 Oct 2019 Otmane Sakhi, Stephen Bonner, David Rohde, Flavian vasile

The combination of the re-parameterization trick with the use of variational auto-encoders has caused a sensation in Bayesian deep learning, allowing the training of realistic generative models of images and has considerably increased our ability to use scalable latent variable models.

Binary Classification General Classification +1

Learning from Bandit Feedback: An Overview of the State-of-the-art

no code implementations18 Sep 2019 Olivier Jeunen, Dmytro Mykhaylov, David Rohde, Flavian vasile, Alexandre Gilotte, Martin Bompaire

In order to handle this "bandit-feedback" setting, several Counterfactual Risk Minimisation (CRM) methods have been proposed in recent years, that attempt to estimate the performance of different policies on historical data.

counterfactual Recommendation Systems

On the Value of Bandit Feedback for Offline Recommender System Evaluation

no code implementations26 Jul 2019 Olivier Jeunen, David Rohde, Flavian vasile

The procedure aims to give an answer to the question: "Given the natural sequence of user-item interactions up to time t, can we predict which item the user will interact with at time t+1?".

Recommendation Systems

Replacing the do-calculus with Bayes rule

no code implementations17 Jun 2019 Finnian Lattimore, David Rohde

The concept of causality has a controversial history.

A Bayesian Solution to the M-Bias Problem

no code implementations17 Jun 2019 David Rohde

It is common practice in using regression type models for inferring causal effects, that inferring the correct causal relationship requires extra covariates are included or ``adjusted for''.

Causal Inference

Latent Variable Session-Based Recommendation

no code implementations pproximateinference AABI Symposium 2019 David Rohde, Stephen Bonner

An attractive feature of the latent variable approach is that, as the user continues to act, the posterior on the user's state tightens reflecting the recommender system's increased knowledge about that user.

Feature Engineering Session-Based Recommendations

Three Methods for Training on Bandit Feedback

no code implementations24 Apr 2019 Dmytro Mykhaylov, David Rohde, Flavian vasile, Martin Bompaire, Olivier Jeunen

There are three quite distinct ways to train a machine learning model on recommender system logs.

Recommendation Systems

RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising

1 code implementation2 Aug 2018 David Rohde, Stephen Bonner, Travis Dunlop, Flavian vasile, Alexandros Karatzoglou

Recommender Systems are becoming ubiquitous in many settings and take many forms, from product recommendation in e-commerce stores, to query suggestions in search engines, to friend recommendation in social networks.

Product Recommendation Recommendation Systems +2

Cannot find the paper you are looking for? You can Submit a new open access paper.