1 code implementation • 3 Apr 2024 • Marko Zaric, Jakob Hollenstein, Justus Piater, Erwan Renaudo
Learning actions that are relevant to decision-making and can be executed effectively is a key problem in autonomous robotics.
no code implementations • 18 Dec 2023 • Jakob Hollenstein, Georg Martius, Justus Piater
Proximal Policy Optimization (PPO), a popular on-policy deep reinforcement learning method, employs a stochastic policy for exploration.
no code implementations • 8 Jun 2022 • Jakob Hollenstein, Sayantan Auddy, Matteo Saveriano, Erwan Renaudo, Justus Piater
Many Deep Reinforcement Learning (D-RL) algorithms rely on simple forms of exploration such as the additive action noise often used in continuous control domains.
1 code implementation • 14 Feb 2022 • Sayantan Auddy, Jakob Hollenstein, Matteo Saveriano, Antonio Rodríguez-Sánchez, Justus Piater
We empirically demonstrate the effectiveness of this approach in remembering long sequences of trajectory learning tasks without the need to store any data from past demonstrations.