no code implementations • 12 Jun 2023 • Moritz A. Zanger, Wendelin Böhmer, Matthijs T. J. Spaan
In contrast to classical reinforcement learning, distributional reinforcement learning algorithms aim to learn the distribution of returns rather than their expected value.
1 code implementation • 14 Apr 2021 • Moritz A. Zanger, Karam Daaboul, J. Marius Zöllner
Further, we provide theoretical and empirical analyses regarding the implications of model-usage on constrained policy optimization problems and introduce a practical algorithm that accelerates policy search with model-generated data.