Search Results for author: Richard Klima

Found 1 papers, 0 papers with code

Robust Temporal Difference Learning for Critical Domains

no code implementations • 23 Jan 2019 • Richard Klima, Daan Bloembergen, Michael Kaisers, Karl Tuyls

We prove convergence of the operator to the optimal robust Q-function with respect to the model using the theory of Generalized Markov Decision Processes.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.