no code implementations • 16 Sep 2021 • John Mern, Sidhart Krishnan, Anil Yildiz, Kyle Hatch, Mykel J. Kochenderfer
In this work, we propose a method to build predictable policy trees as surrogates for policies such as neural networks.
1 code implementation • 7 Oct 2020 • John Mern, Anil Yildiz, Larry Bush, Tapan Mukerji, Mykel J. Kochenderfer
Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces.
1 code implementation • 7 Oct 2020 • John Mern, Anil Yildiz, Zachary Sunberg, Tapan Mukerji, Mykel J. Kochenderfer
Monte Carlo tree search with progressive widening attempts to improve scaling by sampling from the action space to construct a policy search tree.