Reward Estimation for Variance Reduction in Deep Reinforcement Learning

9 May 2018Joshua RomoffPeter HendersonAlexandre PichéVincent Francois-LavetJoelle Pineau

Reinforcement Learning (RL) agents require the specification of a reward signal for learning behaviours. However, introduction of corrupt or stochastic rewards can yield high variance in learning... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.