no code implementations • NeurIPS 2013 • Philip S. Thomas, William C. Dabney, Stephen Giguere, Sridhar Mahadevan
Natural actor-critics are a popular class of policy search algorithms for finding locally optimal policies for Markov decision processes.