Search Results for author: Kishan Panaganti

Found 2 papers, 0 papers with code

Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees

no code implementations20 Jun 2020 Kishan Panaganti, Dileep Kalathil

We first propose the Robust Least Squares Policy Evaluation algorithm, which is a multi-step online model-free learning algorithm for policy evaluation.

OpenAI Gym

Bounded Regret for Finitely Parameterized Multi-Armed Bandits

no code implementations3 Mar 2020 Kishan Panaganti, Dileep Kalathil

We propose an algorithm that is simple and easy to implement, which we call Finitely Parameterized Upper Confidence Bound (FP-UCB) algorithm, which uses the information about the underlying parameter set for faster learning.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.