TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent

10 Apr 2018Alex KearneyVivek VeeriahJaden B. TravnikRichard S. SuttonPatrick M. Pilarski

In this paper, we introduce a method for adapting the step-sizes of temporal difference (TD) learning. The performance of TD methods often depends on well chosen step-sizes, yet few algorithms have been developed for setting the step-size automatically for TD learning... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.