no code implementations • 8 Dec 2017 • Xue Lu, Niall Adams, Nikolas Kantas
In this paper, we overcome the shortcoming of slow response to change by deploying adaptive estimation in the standard methods and propose a new family of algorithms, which are adaptive versions of $\epsilon$-Greedy, UCB, and Thompson sampling.