Search Results for author: Giulia Clerici

Found 2 papers, 1 papers with code

Linear Bandits with Memory: from Rotting to Rising

no code implementations16 Feb 2023 Giulia Clerici, Pierre Laforgue, Nicolò Cesa-Bianchi

By choosing the cycle length so as to trade-off approximation and estimation errors, we then prove a bound of order $\sqrt{d}\,(m+1)^{\frac{1}{2}+\max\{\gamma, 0\}}\, T^{3/4}$ (ignoring log factors) on the regret against the optimal sequence of actions, where $T$ is the horizon and $d$ is the dimension of the linear action space.

Decision Making Model Selection

A Last Switch Dependent Analysis of Satiation and Seasonality in Bandits

1 code implementation22 Oct 2021 Pierre Laforgue, Giulia Clerici, Nicolò Cesa-Bianchi, Ran Gilad-Bachrach

Motivated by the fact that humans like some level of unpredictability or novelty, and might therefore get quickly bored when interacting with a stationary policy, we introduce a novel non-stationary bandit problem, where the expected reward of an arm is fully determined by the time elapsed since the arm last took part in a switch of actions.

Cannot find the paper you are looking for? You can Submit a new open access paper.