Search Results for author: Rong Guo

Found 1 papers, 0 papers with code

Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning

no code implementations22 Dec 2016 Wendelin Böhmer, Rong Guo, Klaus Obermayer

This paper investigates a type of instability that is linked to the greedy policy improvement in approximated reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.