no code implementations • 3 Aug 2023 • Joao Carvalho, An T. Le, Mark Baierl, Dorothea Koert, Jan Peters
Learning priors on trajectory distributions can help accelerate robot motion planning optimization.
1 code implementation • 7 Mar 2023 • Daniel Palenicek, Michael Lutter, Joao Carvalho, Jan Peters
Therefore, we conclude that the limitation of model-based value expansion methods is not the model accuracy of the learned models.
no code implementations • 8 Mar 2022 • Joao Carvalho, Jan Peters
This estimator is unbiased, has low variance, and can be used with differentiable and non-differentiable function approximators.
1 code implementation • 8 Jan 2020 • Samuele Tosatto, Joao Carvalho, Hany Abdulsamad, Jan Peters
Reinforcement learning (RL) algorithms still suffer from high sample complexity despite outstanding recent successes.