no code implementations • 5 Nov 2018 • Junjie Zeng, Long Qin, Yue Hu, Cong Hu, Quanjun Yin
The first advantage of the proposed method is that SSG can solve the limitations of sparse reward and local minima trap for RL agents; thus, LSPI can be used to generate paths in complex environments.