no code implementations • 18 Oct 2023 • Nataša Bolić, Tommaso Cesari, Roberto Colomboni
If the distribution admits a density bounded by some constant $M$, then, for any time horizon $T$: $\bullet$ If the agents' valuations are revealed after each interaction, we provide an algorithm achieving regret $M \log T$ and show this rate is optimal, up to constant factors.
no code implementations • 14 Jul 2023 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, Stefano Leonardi
We study the problem of regret minimization for a single bidder in a sequence of first-price auctions where the bidder discovers the item's value only if the auction is won.
no code implementations • 21 Feb 2023 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, Stefano Leonardi
We provide a complete characterization of the regret regimes for fixed-price mechanisms under different feedback models in the two cases where the learner can post either the same or different prices to buyers and sellers.
no code implementations • 2 Sep 2022 • François Bachoc, Tommaso Cesari, Roberto Colomboni, Andrea Paudice
We analyze the cumulative regret of the Dyadic Search algorithm of Bachoc et al. [2022].
no code implementations • 13 Aug 2022 • François Bachoc, Tommaso Cesari, Roberto Colomboni, Andrea Paudice
This paper studies a natural generalization of the problem of minimizing a univariate convex function $f$ by querying its values sequentially.
no code implementations • 8 Jul 2022 • Nicolò Cesa-Bianchi, Tommaso Cesari, Takayuki Osogami, Marco Scarsini, Segev Wasserkrug
We study a repeated game between a supplier and a retailer who want to maximize their respective profits without full knowledge of the problem parameters.
no code implementations • 14 Feb 2022 • Tommaso Cesari, Jonathan Pergoli, Michele Maestrini, Pierluigi Di Lizia
In this paper, we present a real-world application of online learning with expert advice to the field of Space Operations, testing our theory on real-life data coming from the Copernicus Sentinel-6 satellite.
no code implementations • 6 Dec 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Claudio Gentile, Yishay Mansour
We investigate a nonstochastic bandit setting in which the loss of an action is not immediately charged to the player, but rather spread over the subsequent rounds in an adversarial way.
no code implementations • 8 Sep 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, Stefano Leonardi
In this paper, we cast the bilateral trade problem in a regret minimization framework over $T$ rounds of seller/buyer interactions, with no prior knowledge on their private valuations.
no code implementations • 16 Feb 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, Stefano Leonardi
Despite the simplicity of this problem, a classical result by Myerson and Satterthwaite (1983) affirms the impossibility of designing a mechanism which is simultaneously efficient, incentive compatible, individually rational, and budget balanced.
no code implementations • 26 Oct 2020 • François Bachoc, Tommaso Cesari, Sébastien Gerchinovitz
We study the problem of approximating the level set of an unknown function by sequentially querying its values.
no code implementations • 5 Oct 2020 • Riccardo Della Vecchia, Tommaso Cesari
Furthermore, we prove that this is only $\sqrt$ k log k-away from the best achievable rate and that Coop-FTPL has a state-of-the-art T 3/2 worst-case computational complexity.
no code implementations • 8 Jul 2020 • Tommaso Cesari, Roberto Colomboni
The property of almost every point being a Lebesgue point has proven to be crucial for the consistency of several classification algorithms based on nearest neighbors.
no code implementations • 6 Feb 2020 • Clément Bouttier, Tommaso Cesari, Mélanie Ducoffe, Sébastien Gerchinovitz
We consider the problem of maximizing a non-concave Lipschitz multivariate function over a compact domain by sequentially querying its (possibly perturbed) values.
no code implementations • NeurIPS 2021 • Nicolò Cesa-Bianchi, Tommaso Cesari, Yishay Mansour, Vianney Perchet
We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making.
no code implementations • 9 Jul 2018 • Nicolò Cesa-Bianchi, Tommaso Cesari, Vianney Perchet
When $K=2$ in the distribution-dependent case, the hardness of our setting reduces to that of a stochastic $2$-armed bandit: we prove that an upper bound of order $(\log T)/\Delta$ (up to $\log\log$ factors) on the regret can be achieved with no information on the demand curve.