1 code implementation • 27 Oct 2023 • Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh
The goal of an offline reinforcement learning (RL) algorithm is to learn optimal polices using historical (offline) data, without access to the environment for online exploration.
1 code implementation • 5 Mar 2023 • Zaiyan Xu, Kishan Panaganti, Dileep Kalathil
We formulate this as a distributionally robust reinforcement learning (DR-RL) problem where the objective is to learn the policy which maximizes the value function against the worst possible stochastic model of the environment in an uncertainty set.
1 code implementation • 10 Aug 2022 • Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh
The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the uncertainty in model parameters.