Search Results for author: Rong Guo

Found 1 papers, 0 papers with code

Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning

no code implementations • 22 Dec 2016 • Wendelin Böhmer, Rong Guo, Klaus Obermayer

This paper investigates a type of instability that is linked to the greedy policy improvement in approximated reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.