Search Results for author: Olivier Jeunen

Found 18 papers, 6 papers with code

Optimal Baseline Corrections for Off-Policy Contextual Bandits

no code implementations • 9 May 2024 • Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke

The foundation of our framework is the derivation of an equivalent baseline correction for all of the existing control variates.

Paper
Add Code

Multi-Objective Recommendation via Multivariate Policy Learning

no code implementations • 3 May 2024 • Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko

We frame this as a decision-making task, where the scalarisation weights are actions taken to maximise an overall North Star reward (e. g. long-term user retention or growth).

Decision Making Fairness +1

Paper
Add Code

Learning Metrics that Maximise Power for Accelerated A/B-Tests

no code implementations • 6 Feb 2024 • Olivier Jeunen, Aleksei Ustimenko

Online controlled experiments are a crucial tool to allow for confident decision-making in technology companies.

Decision Making

Paper
Add Code

Variance Reduction in Ratio Metrics for Efficient Online Experiments

no code implementations • 8 Jan 2024 • Shubham Baweja, Neeti Pokharna, Aleksei Ustimenko, Olivier Jeunen

The main culprit for this inefficiency is the variance of the online metrics.

Paper
Add Code

Learning-to-Rank with Nested Feedback

no code implementations • 8 Jan 2024 • Hitesh Sagtani, Olivier Jeunen, Aleksei Ustimenko

Many platforms on the web present ranked lists of content to users, typically optimized for engagement-, satisfaction- or retention- driven metrics.

Learning-To-Rank

Paper
Add Code

On Gradient Boosted Decision Trees and Neural Rankers: A Case-Study on Short-Video Recommendations at ShareChat

no code implementations • 4 Dec 2023 • Olivier Jeunen, Hitesh Sagtani, Himanshu Doi, Rasul Karimov, Neeti Pokharna, Danish Kalim, Aleksei Ustimenko, Christopher Green, Wenzhe Shi, Rishabh Mehrotra

We highlight (1) neural networks' ability to handle large training data size, user- and item-embeddings allows for more accurate models than GBDTs in this setting, and (2) because GBDTs are less reliant on specialised hardware, they can provide an equally accurate model at a lower cost.

Paper
Add Code

Ad-load Balancing via Off-policy Learning in a Content Marketplace

no code implementations • 19 Sep 2023 • Hitesh Sagtani, Madan Jhawar, Rishabh Mehrotra, Olivier Jeunen

We start by presenting a motivating analysis of the ad-load balancing problem, highlighting the conflicting objectives between user satisfaction and ads revenue.

Paper
Add Code

Offline Recommender System Evaluation under Unobserved Confounding

1 code implementation • 8 Sep 2023 • Olivier Jeunen, Ben London

Because the data collection policy is typically under the practitioner's control, the unconfoundedness assumption is often left implicit, and its violations are rarely dealt with in the existing literature.

Decision Making Recommendation Systems

Paper
Code

On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-$n$ Recommendation

1 code implementation • 27 Jul 2023 • Olivier Jeunen, Ivan Potapov, Aleksei Ustimenko

Through a correlation analysis between off- and on-line experiments conducted on a large-scale recommendation platform, we show that our unbiased DCG estimates strongly correlate with online reward, even when some of the metric's inherent assumptions are violated.

Information Retrieval Off-policy evaluation

Paper
Code

A Probabilistic Position Bias Model for Short-Video Recommendation Feeds

1 code implementation • 26 Jul 2023 • Olivier Jeunen

Empirical insights from a large-scale social media platform show how our probabilistic position bias model more accurately captures empirical exposure than existing models, and paves the way for unbiased evaluation and learning-to-rank.

Learning-To-Rank Position

Paper
Code

RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation

1 code implementation • 15 Jun 2023 • Gabriel Bénédict, Olivier Jeunen, Samuele Papa, Samarth Bhargav, Daan Odijk, Maarten de Rijke

In this paper we propose RecFusion, which comprise a set of diffusion models for recommendation.

Recommendation Systems

Paper
Code

A Common Misassumption in Online Experiments with Machine Learning Models

1 code implementation • 21 Apr 2023 • Olivier Jeunen

Online experiments such as Randomised Controlled Trials (RCTs) or A/B-tests are the bread and butter of modern platforms on the web.

Paper
Code

Practical Bandits: An Industry Perspective

no code implementations • 2 Feb 2023 • Bram van den Akker, Olivier Jeunen, Ying Li, Ben London, Zahra Nazari, Devesh Parekh

The research literature on these topics is broad and vast, but this can overwhelm practitioners, whose primary aim is to solve practical problems, and therefore need to decide on a specific instantiation or approach for each project.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Disentangling Causal Effects from Sets of Interventions in the Presence of Unobserved Confounders

1 code implementation • 11 Oct 2022 • Olivier Jeunen, Ciarán M. Gilligan-Lee, Rishabh Mehrotra, Mounia Lalmas

We address this challenge by aiming to learn the effect of a single-intervention from both observational data and sets of interventions.

Causal Inference

Paper
Code

Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation

no code implementations • 18 Sep 2022 • Imad Aouali, Amine Benhalloum, Martin Bompaire, Benjamin Heymann, Olivier Jeunen, David Rohde, Otmane Sakhi, Flavian vasile

Naturally, the reason for this is that we can directly measure utility metrics that rely on interventions, being the recommendations that are being shown to users.

counterfactual Recommendation Systems

Paper
Add Code

Learning from Bandit Feedback: An Overview of the State-of-the-art

no code implementations • 18 Sep 2019 • Olivier Jeunen, Dmytro Mykhaylov, David Rohde, Flavian vasile, Alexandre Gilotte, Martin Bompaire

In order to handle this "bandit-feedback" setting, several Counterfactual Risk Minimisation (CRM) methods have been proposed in recent years, that attempt to estimate the performance of different policies on historical data.

counterfactual Recommendation Systems

Paper
Add Code

On the Value of Bandit Feedback for Offline Recommender System Evaluation

no code implementations • 26 Jul 2019 • Olivier Jeunen, David Rohde, Flavian vasile

The procedure aims to give an answer to the question: "Given the natural sequence of user-item interactions up to time t, can we predict which item the user will interact with at time t+1?".

Recommendation Systems

Paper
Add Code

Three Methods for Training on Bandit Feedback

no code implementations • 24 Apr 2019 • Dmytro Mykhaylov, David Rohde, Flavian vasile, Martin Bompaire, Olivier Jeunen

There are three quite distinct ways to train a machine learning model on recommender system logs.

Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.