Search Results for author: Vrettos Moulos

Found 3 papers, 0 papers with code

Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards

no code implementations NeurIPS 2020 Vrettos Moulos

For rewards generated from a one-parameter exponential family of Markov chains, we provide a finite-time upper bound for the regret incurred from this adaptive allocation rule, which reveals the logarithmic dependence of the regret on the time horizon, and which is asymptotically optimal.

A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

no code implementations5 Jan 2020 Vrettos Moulos

This paper develops a Hoeffding inequality for the partial sums $\sum_{k=1}^n f (X_k)$, where $\{X_k\}_{k \in \mathbb{Z}_{> 0}}$ is an irreducible Markov chain on a finite state space $S$, and $f : S \to [a, b]$ is a real-valued function.

Optimal Best Markovian Arm Identification with Fixed Confidence

no code implementations NeurIPS 2019 Vrettos Moulos

We give a complete characterization of the sampling complexity of best Markovian arm identification in one-parameter Markovian bandit models.

Cannot find the paper you are looking for? You can Submit a new open access paper.