no code implementations • 3 Oct 2023 • Oded Blumenthal, Guy Shani
POMCP develops an action-observation tree, and at the leaves, uses a rollout policy to provide a value estimate for the leaf.
Decision Making