no code implementations • 8 Sep 2019 • Qi Zhang, Tao Du, Changzheng Tian
To improve the generalization ability for the driving behavior, the reinforcement learning method requires extrinsic reward from the real environment, which may damage the car.