Search Results for author: Olivier Cappé

Found 16 papers, 3 papers with code

A/B/n Testing with Control in the Presence of Subpopulations

no code implementations NeurIPS 2021 Yoan Russac, Christina Katsimerou, Dennis Bohle, Olivier Cappé, Aurélien Garivier, Wouter Koolen

At every time step, a subpopulation is sampled and an arm is chosen: the resulting observation is an independent draw from the arm conditioned on the subpopulation.

Fast Rate Learning in Stochastic First Price Bidding

no code implementations5 Jul 2021 Juliette Achddou, Olivier Cappé, Aurélien Garivier

First-price auctions have largely replaced traditional bidding approaches based on Vickrey auctions in programmatic advertising.

On Limited-Memory Subsampling Strategies for Bandits

1 code implementation21 Jun 2021 Dorian Baudry, Yoan Russac, Olivier Cappé

There has been a recent surge of interest in nonparametric bandit algorithms based on subsampling.

Efficient Algorithms for Stochastic Repeated Second-price Auctions

no code implementations10 Nov 2020 Juliette Achddou, Olivier Cappé, Aurélien Garivier

We further provide the first parametric lower bound for this problem that applies to generic UCB-like strategies.

Marketing

Self-Concordant Analysis of Generalized Linear Bandits with Forgetting

no code implementations2 Nov 2020 Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

Contextual sequential decision problems with categorical or numerical observations are ubiquitous and Generalized Linear Bandits (GLB) offer a solid theoretical framework to address them.

A Comparative Study of Gamma Markov Chains for Temporal Non-Negative Matrix Factorization

1 code implementation23 Jun 2020 Louis Filstroff, Olivier Gouvert, Cédric Févotte, Olivier Cappé

Non-negative matrix factorization (NMF) has become a well-established class of methods for the analysis of non-negative data.

Time Series Time Series Analysis

Algorithms for Non-Stationary Generalized Linear Bandits

no code implementations23 Mar 2020 Yoan Russac, Olivier Cappé, Aurélien Garivier

The statistical framework of Generalized Linear Models (GLM) can be applied to sequential problems involving categorical or ordinal rewards associated, for instance, with clicks, likes or ratings.

Weighted Linear Bandits for Non-Stationary Environments

1 code implementation NeurIPS 2019 Yoan Russac, Claire Vernade, Olivier Cappé

To address this problem, we propose D-LinUCB, a novel optimistic algorithm based on discounted linear regression, where exponential weights are used to smoothly forget the past.

regression

Stochastic Bandit Models for Delayed Conversions

no code implementations28 Jun 2017 Claire Vernade, Olivier Cappé, Vianney Perchet

We assume that the probability of conversion associated with each action is unknown while the distribution of the conversion delay is known, distinguishing between the (idealized) case where the conversion events may be observed whatever their delay and the more realistic setting in which late conversions are censored.

Product Recommendation

Multiple-Play Bandits in the Position-Based Model

no code implementations NeurIPS 2016 Paul Lagrée, Claire Vernade, Olivier Cappé

Sequentially learning to place items in multi-position displays or lists is a task that can be cast into the multiple-play semi-bandit setting.

Position

Sequential ranking under random semi-bandit feedback

no code implementations4 Mar 2016 Hossein Vahabi, Paul Lagrée, Claire Vernade, Olivier Cappé

In many web applications, a recommendation is not a single item suggested to a user but a list of possibly interesting contents that may be ranked in some contexts.

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

no code implementations16 Jul 2014 Emilie Kaufmann, Olivier Cappé, Aurélien Garivier

The stochastic multi-armed bandit model is a simple abstraction that has proven useful in many different contexts in statistics and machine learning.

LEMMA

On the Complexity of A/B Testing

no code implementations13 May 2014 Emilie Kaufmann, Olivier Cappé, Aurélien Garivier

A/B testing refers to the task of determining the best option among two alternatives that yield random outcomes.

The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond

no code implementations12 Feb 2011 Aurélien Garivier, Olivier Cappé

This paper presents a finite-time analysis of the KL-UCB algorithm, an online, horizon-free index policy for stochastic bandit problems.

Online EM Algorithm for Latent Data Models

no code implementations27 Dec 2007 Olivier Cappé, Eric Moulines

The resulting algorithm is usually simpler and is shown to achieve convergence to the stationary points of the Kullback-Leibler divergence between the marginal distribution of the observation and the model distribution at the optimal rate, i. e., that of the maximum likelihood estimator.

Cannot find the paper you are looking for? You can Submit a new open access paper.