no code implementations • NeurIPS 2018 • Mathieu Fehr, Olivier Buffet, Vincent Thomas, Jilles Dibangoye
In this paper, we focus on POMDPs and ρ-POMDPs with λ ρ -Lipschitz reward function, and demonstrate that, for finite horizons, the optimal value function is Lipschitz-continuous.