no code implementations • 17 Aug 2016 • K J Prabuchandran, Tejas Bodas, Theja Tulabandhula
A recent goal in the Reinforcement Learning (RL) framework is to choose a sequence of actions or a policy to maximize the reward collected or minimize the regret incurred in a finite time horizon.