Uncovering Surprising Behaviors in Reinforcement Learning via Worst-case Analysis

ICLR 2019 Avraham RudermanRichard EverettBristy SikderHubert SoyerJonathan UesatoAnanya KumarCharlie BeattiePushmeet Kohli

Reinforcement learning agents are typically trained and evaluated according to their performance averaged over some distribution of environment settings. But does the distribution over environment settings contain important biases, and do these lead to agents that fail in certain cases despite high average-case performance?.. (read more)

PDF Abstract


No code implementations yet. Submit your code now


Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.