no code implementations • 1 Jun 2022 • Giulia Romano, Andrea Agostini, Francesco Trovò, Nicola Gatti, Marcello Restelli
We provide two algorithms to address TP-MAB problems, namely, TP-UCB-FR and TP-UCB-EW, which exploit the partial information disclosed by the reward collected over time.