no code implementations • 15 Sep 2018 • Philipp Ennen, Pia Bresenitz, Rene Vossen, Frank Hees
However, due to the small number of real-world trajectory samples in Guided Policy Search, the resulting neural networks are only robust in the neighbourhood of the trajectory distribution explored by real-world interactions.