no code implementations • 27 Nov 2021 • Julian Stastny, Maxime Riché, Alexander Lyzhov, Johannes Treutlein, Allan Dafoe, Jesse Clifton
However, the mixed-motive environments typically studied have a single cooperative outcome on which all agents can agree.
no code implementations • 13 Jul 2019 • Jesse Clifton, Lili Wu, Eric Laber
We introduce Parameterized Exploration (PE), a simple family of methods for model-based tuning of the exploration schedule in sequential decision problems.