no code implementations • 29 Jul 2015 • Thomas Furmston, Guy Lever
In this work we investigate approximate Newton methods for policy optimization in Markov Decision Processes (MDPs).
no code implementations • NeurIPS 2012 • Thomas Furmston, David Barber
This analysis leads naturally to the consideration of this approximate Newton method as an alternative gradient-based method for Markov Decision Processes.