Search Results for author: Elita A. Lobo

Found 2 papers, 1 papers with code

Percentile Criterion Optimization in Offline Reinforcement Learning

2 code implementations NeurIPS 2023 Elita A. Lobo, Cyrus Cousins, Yair Zick, Marek Petrik

The percentile criterion is approximately solved by constructing an \emph{ambiguity set} that contains the true model with high probability and optimizing the policy for the worst model in the set.

Decision Making reinforcement-learning

Soft-Robust Algorithms for Batch Reinforcement Learning

no code implementations30 Nov 2020 Elita A. Lobo, Mohammad Ghavamzadeh, Marek Petrik

In reinforcement learning, robust policies for high-stakes decision-making problems with limited data are usually computed by optimizing the percentile criterion, which minimizes the probability of a catastrophic failure.

Decision Making reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.