Search Results for author: Tanut Treetanthiploet

Found 5 papers, 0 papers with code

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

no code implementations8 Aug 2022 Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

This work uses the entropy-regularised relaxed stochastic control perspective as a principled framework for designing reinforcement learning (RL) algorithms.

reinforcement-learning Reinforcement Learning (RL) +1

Generalised correlated batched bandits via the ARC algorithm with application to dynamic pricing

no code implementations8 Feb 2021 samuel cohen, Tanut Treetanthiploet

The Asymptotic Randomised Control (ARC) algorithm provides a rigorous approximation to the optimal strategy for a wide class of Bayesian bandits, while retaining low computational complexity.

Asymptotic Randomised Control with applications to bandits

no code implementations14 Oct 2020 Samuel N. Cohen, Tanut Treetanthiploet

We consider a general multi-armed bandit problem with correlated (and simple contextual and restless) elements, as a relaxed control problem.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.