no code implementations • 18 Jan 2020 • Juan Vargas, Lazar Andjelic, Amir Barati Farimani
Actor critic methods with sparse rewards in model-based deep reinforcement learning typically require a deterministic binary reward function that reflects only two possible outcomes: if, for each step, the goal has been achieved or not.