1 code implementation • 23 Sep 2021 • Gautam Salhotra, Christopher E. Denniston, David A. Caron, Gaurav S. Sukhatme
We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore.