no code implementations • 29 Sep 2021 • Jan Wöhlke, Felix Schmitt, Herke van Hoof
Combining the benefits of planning and learning values, we propose the Value Refinement Network (VRN), an architecture that locally refines a plan in a (simpler) state space abstraction, represented by a pre-computed value function, with respect to the full agent state.
no code implementations • 23 Sep 2021 • Jan Wöhlke, Felix Schmitt, Herke van Hoof
In simulated robotic navigation tasks, VI-RL results in consistent strong improvement over vanilla RL, is on par with vanilla hierarchal RL on single layouts but more broadly applicable to multiple layouts, and is on par with trainable HL path planning baselines except for a parking task with difficult non-holonomic dynamics where it shows marked improvements.
no code implementations • 28 Apr 2021 • W. Bradley Knox, Alessandro Allievi, Holger Banzhaf, Felix Schmitt, Peter Stone
This article considers the problem of diagnosing certain common errors in reward design.
2 code implementations • 1 Jun 2017 • Axel Huebl, Rene Widera, Felix Schmitt, Alexander Matthes, Norbert Podhorszki, Jong Youl Choi, Scott Klasky, Michael Bussmann
We implement and benchmark parallel I/O methods for the fully-manycore driven particle-in-cell code PIConGPU.
Performance Computational Physics D.4.8; B.4.3; I.6.6
no code implementations • 13 Apr 2016 • Michael Herman, Tobias Gindele, Jörg Wagner, Felix Schmitt, Wolfram Burgard
Inverse Reinforcement Learning (IRL) describes the problem of learning an unknown reward function of a Markov Decision Process (MDP) from observed behavior of an agent.
1 code implementation • 4 Apr 2014 • Axel Huebl, David Pugmire, Felix Schmitt, Richard Pausch, Michael Bussmann
Emerging new technologies in plasma simulations allow tracking billions of particles while computing their radiative spectra.
Plasma Physics Computational Physics 85-08