Search Results for author: Oliver Hayman

Found 1 papers, 0 papers with code

Goodhart's Law in Reinforcement Learning

no code implementations • 13 Oct 2023 • Jacek Karwowski, Oliver Hayman, Xingjian Bai, Klaus Kiendlhofer, Charlie Griffin, Joar Skalse

First, we propose a way to quantify the magnitude of this effect and show empirically that optimising an imperfect proxy reward often leads to the behaviour predicted by Goodhart's law for a wide range of environments and reward functions.

reinforcement-learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.