no code implementations • 29 Sep 2021 • Elioth Sanabria, David Yao, Henry Lam
In this paper, we show that even for problems with large state space, when the solution policy of the MDP can be represented by a tree-like structure, our proposed algorithm retrieves a tree of the solution policy of the MDP in computationally tractable time.