no code implementations • 20 Apr 2015 • Reinaldo Uribe Muriel, Fernando Lozando, Charles Anderson
This paper describes a novel method to solve average-reward semi-Markov decision processes, by reducing them to a minimal sequence of cumulative reward problems.