Search Results for author: Richard Klima

Found 1 papers, 0 papers with code

Robust Temporal Difference Learning for Critical Domains

no code implementations23 Jan 2019 Richard Klima, Daan Bloembergen, Michael Kaisers, Karl Tuyls

We prove convergence of the operator to the optimal robust Q-function with respect to the model using the theory of Generalized Markov Decision Processes.

Cannot find the paper you are looking for? You can Submit a new open access paper.