no code implementations • 13 Jul 2023 • Roberto Colomboni, Emmanuel Esposito, Andrea Paudice
The fat-shattering dimension characterizes the uniform convergence property of real-valued functions.
no code implementations • 30 May 2023 • Emmanuel Esposito, Saeed Masoudian, Hao Qiu, Dirk van der Hoeven, Nicolò Cesa-Bianchi, Yevgeny Seldin
However, if the mapping of states to losses is stochastic, we show that the regret grows at a rate of $\sqrt{\big(K+\min\{|\mathcal{S}|, d\}\big)T}$ (within log factors), implying that if the number $|\mathcal{S}|$ of states is smaller than the delay, then intermediate observations help.
no code implementations • 9 Oct 2022 • Emmanuel Esposito, Federico Fusco, Dirk van der Hoeven, Nicolò Cesa-Bianchi
The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback.