Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

ICLR 2018 Su Young LeeSungik ChoiSae-Young Chung

We propose Episodic Backward Update (EBU) - a novel deep reinforcement learning algorithm with a direct value propagation. In contrast to the conventional use of the experience replay with uniform random sampling, our agent samples a whole episode and successively propagates the value of a state to its previous states... (read more)

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper