Search Results for author: Olivier Jeunen

Found 18 papers, 6 papers with code

Optimal Baseline Corrections for Off-Policy Contextual Bandits

no code implementations9 May 2024 Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke

The foundation of our framework is the derivation of an equivalent baseline correction for all of the existing control variates.

Multi-Objective Recommendation via Multivariate Policy Learning

no code implementations3 May 2024 Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko

We frame this as a decision-making task, where the scalarisation weights are actions taken to maximise an overall North Star reward (e. g. long-term user retention or growth).

Decision Making Fairness +1

Learning Metrics that Maximise Power for Accelerated A/B-Tests

no code implementations6 Feb 2024 Olivier Jeunen, Aleksei Ustimenko

Online controlled experiments are a crucial tool to allow for confident decision-making in technology companies.

Decision Making

Learning-to-Rank with Nested Feedback

no code implementations8 Jan 2024 Hitesh Sagtani, Olivier Jeunen, Aleksei Ustimenko

Many platforms on the web present ranked lists of content to users, typically optimized for engagement-, satisfaction- or retention- driven metrics.

Learning-To-Rank

On Gradient Boosted Decision Trees and Neural Rankers: A Case-Study on Short-Video Recommendations at ShareChat

no code implementations4 Dec 2023 Olivier Jeunen, Hitesh Sagtani, Himanshu Doi, Rasul Karimov, Neeti Pokharna, Danish Kalim, Aleksei Ustimenko, Christopher Green, Wenzhe Shi, Rishabh Mehrotra

We highlight (1) neural networks' ability to handle large training data size, user- and item-embeddings allows for more accurate models than GBDTs in this setting, and (2) because GBDTs are less reliant on specialised hardware, they can provide an equally accurate model at a lower cost.

Ad-load Balancing via Off-policy Learning in a Content Marketplace

no code implementations19 Sep 2023 Hitesh Sagtani, Madan Jhawar, Rishabh Mehrotra, Olivier Jeunen

We start by presenting a motivating analysis of the ad-load balancing problem, highlighting the conflicting objectives between user satisfaction and ads revenue.

Offline Recommender System Evaluation under Unobserved Confounding

1 code implementation8 Sep 2023 Olivier Jeunen, Ben London

Because the data collection policy is typically under the practitioner's control, the unconfoundedness assumption is often left implicit, and its violations are rarely dealt with in the existing literature.

Decision Making Recommendation Systems

On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-$n$ Recommendation

1 code implementation27 Jul 2023 Olivier Jeunen, Ivan Potapov, Aleksei Ustimenko

Through a correlation analysis between off- and on-line experiments conducted on a large-scale recommendation platform, we show that our unbiased DCG estimates strongly correlate with online reward, even when some of the metric's inherent assumptions are violated.

Information Retrieval Off-policy evaluation

A Probabilistic Position Bias Model for Short-Video Recommendation Feeds

1 code implementation26 Jul 2023 Olivier Jeunen

Empirical insights from a large-scale social media platform show how our probabilistic position bias model more accurately captures empirical exposure than existing models, and paves the way for unbiased evaluation and learning-to-rank.

Learning-To-Rank Position

A Common Misassumption in Online Experiments with Machine Learning Models

1 code implementation21 Apr 2023 Olivier Jeunen

Online experiments such as Randomised Controlled Trials (RCTs) or A/B-tests are the bread and butter of modern platforms on the web.

Practical Bandits: An Industry Perspective

no code implementations2 Feb 2023 Bram van den Akker, Olivier Jeunen, Ying Li, Ben London, Zahra Nazari, Devesh Parekh

The research literature on these topics is broad and vast, but this can overwhelm practitioners, whose primary aim is to solve practical problems, and therefore need to decide on a specific instantiation or approach for each project.

Decision Making Decision Making Under Uncertainty

Disentangling Causal Effects from Sets of Interventions in the Presence of Unobserved Confounders

1 code implementation11 Oct 2022 Olivier Jeunen, Ciarán M. Gilligan-Lee, Rishabh Mehrotra, Mounia Lalmas

We address this challenge by aiming to learn the effect of a single-intervention from both observational data and sets of interventions.

Causal Inference

Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation

no code implementations18 Sep 2022 Imad Aouali, Amine Benhalloum, Martin Bompaire, Benjamin Heymann, Olivier Jeunen, David Rohde, Otmane Sakhi, Flavian vasile

Naturally, the reason for this is that we can directly measure utility metrics that rely on interventions, being the recommendations that are being shown to users.

counterfactual Recommendation Systems

Learning from Bandit Feedback: An Overview of the State-of-the-art

no code implementations18 Sep 2019 Olivier Jeunen, Dmytro Mykhaylov, David Rohde, Flavian vasile, Alexandre Gilotte, Martin Bompaire

In order to handle this "bandit-feedback" setting, several Counterfactual Risk Minimisation (CRM) methods have been proposed in recent years, that attempt to estimate the performance of different policies on historical data.

counterfactual Recommendation Systems

On the Value of Bandit Feedback for Offline Recommender System Evaluation

no code implementations26 Jul 2019 Olivier Jeunen, David Rohde, Flavian vasile

The procedure aims to give an answer to the question: "Given the natural sequence of user-item interactions up to time t, can we predict which item the user will interact with at time t+1?".

Recommendation Systems

Three Methods for Training on Bandit Feedback

no code implementations24 Apr 2019 Dmytro Mykhaylov, David Rohde, Flavian vasile, Martin Bompaire, Olivier Jeunen

There are three quite distinct ways to train a machine learning model on recommender system logs.

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.