Search Results for author: Jesús A. De Loera

Found 2 papers, 0 papers with code

Geometric Policy Iteration for Markov Decision Processes

no code implementations12 Jun 2022 Yue Wu, Jesús A. De Loera

GPI updates the policy of a single state by switching to an action that is mapped to the boundary of the value function polytope, followed by an immediate update of the value function.

Computational Efficiency

Cannot find the paper you are looking for? You can Submit a new open access paper.