no code implementations • 15 Sep 2022 • Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Giulia Romano, Nicola Gatti
We present the first best-of-both-world type algorithm for this general class of problems, with no-regret guarantees both in the case in which rewards and constraints are selected according to an unknown stochastic model, and in the case in which they are selected at each round by an adversary.
no code implementations • 1 Jun 2022 • Giulia Romano, Andrea Agostini, Francesco Trovò, Nicola Gatti, Marcello Restelli
We provide two algorithms to address TP-MAB problems, namely, TP-UCB-FR and TP-UCB-EW, which exploit the partial information disclosed by the reward collected over time.
no code implementations • 18 Jan 2022 • Matteo Castiglioni, Alessandro Nuara, Giulia Romano, Giorgio Spadaro, Francesco Trovò, Nicola Gatti
More interestingly, we provide an algorithm, namely GCB_{safe}(\psi,\phi), guaranteeing both sublinear pseudo-regret and safety w. h. p.
no code implementations • 10 Dec 2020 • Giulia Romano, Gianluca Tartaglia, Alberto Marchesi, Nicola Gatti
We evaluate our mechanisms in terms of competitive ratio, measuring the worst-case ratio between their revenue and that of an optimal mechanism that knows the distribution of valuations.
Computer Science and Game Theory