no code implementations • 17 Sep 2020 • Roy Eliya, J. Michael Herrmann
We propose a new method for training an agent via an evolutionary strategy (ES), in which we iteratively improve a set of samples to imitate: Starting with a random set, in every iteration we replace a subset of the samples with samples from the best trajectories discovered so far.