no code implementations • 22 Dec 2016 • Wendelin Böhmer, Rong Guo, Klaus Obermayer
This paper investigates a type of instability that is linked to the greedy policy improvement in approximated reinforcement learning.
reinforcement-learning Reinforcement Learning (RL)