no code implementations • 30 Aug 2022 • Rushiv Arora, Bruno Castro da Silva, Eliot Moss
We found that an optimal policy trained on the discovered dynamics of the underlying system can generalize well.
Model-based Reinforcement Learning reinforcement-learning +1