no code implementations • 24 Jun 2022 • James Macglashan, Evan Archer, Alisa Devlic, Takuma Seno, Craig Sherstan, Peter R. Wurman, Peter Stone
These value estimates provide insight into an agent's learning and decision-making process and enable new training methods to mitigate common problems.