Search Results for author: Adrian Müller

Found 2 papers, 0 papers with code

Truly No-Regret Learning in Constrained MDPs

no code implementations24 Feb 2024 Adrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He

As Efroni et al. (2020) pointed out, it is an open question whether primal-dual algorithms can provably achieve sublinear regret if we do not allow error cancellations.

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes

no code implementations12 Jun 2023 Adrian Müller, Pragnya Alatur, Giorgia Ramponi, Niao He

Unlike existing Lagrangian approaches, our algorithm achieves this regret without the need for the cancellation of errors.

Safe Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.