no code implementations • 19 Jan 2020 • Baicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran
The output of the feedback neural network is converted to a shaping reward that is augmented to the reward provided by the environment.