no code implementations • 1 Jan 2021 • Thomas Pierrot, Valentin Macé, Jean-Baptiste Sevestre, Louis Monier, Alexandre Laterre, Nicolas Perrin, Karim Beguir, Olivier Sigaud
Very large action spaces constitute a critical challenge for deep Reinforcement Learning (RL) algorithms.
no code implementations • 10 Jan 2021 • Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont
Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.