no code implementations • 16 Apr 2024 • Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mossé, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker
Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, so that, for example, they refuse to comply with requests for help with committing crimes or with producing racist text.
no code implementations • 28 May 2023 • Emanuel Tewolde, Caspar Oesterheld, Vincent Conitzer, Paul W. Goldberg
For such games, two natural equilibrium concepts have been proposed as alternative solution concepts to ex-ante optimality.