1 code implementation • 15 Aug 2023 • Rowan Hodson, Bruce Bassett, Charel van Hoof, Benjamin Rosman, Mark Solms, Jonathan P. Shock, Ryan Smith
First, we compare performance of SI to Bayesian reinforcement learning (RL) schemes designed to solve similar problems.
1 code implementation • 4 Jun 2021 • Alejandro Daniel Noel, Charel van Hoof, Beren Millidge
Our model is capable of solving sparse-reward problems with a very high sample efficiency due to its objective function, which encourages directed exploration of uncertain states.