Search Results for author: Caspar Oesterheld

Found 15 papers, 4 papers with code

Computing Game Symmetries and Equilibria That Respect Them

no code implementations15 Jan 2025 Emanuel Tewolde, Brian Hu Zhang, Caspar Oesterheld, Tuomas Sandholm, Vincent Conitzer

Strategic interactions can be represented more concisely, and analyzed and solved more efficiently, if we are aware of the symmetries within the multiagent system.

Observation Interference in Partially Observable Assistance Games

no code implementations23 Dec 2024 Scott Emmons, Caspar Oesterheld, Vincent Conitzer, Stuart Russell

We show that this incentive for interference goes away if the human is playing optimally, or if we introduce a communication channel for the human to communicate their preferences to the assistant.

Characterising Simulation-Based Program Equilibria

no code implementations19 Dec 2024 Emery Cooper, Caspar Oesterheld, Vincent Conitzer

Finally, we explore the limits of simulation-based program equilibrium, showing that the Tennenholtz folk theorem cannot be attained by simulation-based programs without access to shared randomness.

A dataset of questions on decision-theoretic reasoning in Newcomb-like problems

1 code implementation15 Nov 2024 Caspar Oesterheld, Emery Cooper, Miles Kodama, Linh Chi Nguyen, Ethan Perez

We introduce a dataset of natural-language questions in the decision theory of so-called Newcomb-like problems.

Can CDT rationalise the ex ante optimal policy via modified anthropics?

no code implementations7 Nov 2024 Emery Cooper, Caspar Oesterheld, Vincent Conitzer

If so, then causal decision theory might recommend one-boxing in order to cause the predictor to fill the opaque box.

Imperfect-Recall Games: Equilibrium Concepts and Their Complexity

no code implementations23 Jun 2024 Emanuel Tewolde, Brian Hu Zhang, Caspar Oesterheld, Manolis Zampetakis, Tuomas Sandholm, Paul W. Goldberg, Vincent Conitzer

We investigate optimal decision making under imperfect recall, that is, when an agent forgets information it once held before.

Decision Making

Recursive Joint Simulation in Games

no code implementations12 Feb 2024 Vojtech Kovarik, Caspar Oesterheld, Vincent Conitzer

In this paper, we study an interaction between AI agents where the agents run a recursive joint simulation.

AI Agent

A Theory of Bounded Inductive Rationality

no code implementations11 Jul 2023 Caspar Oesterheld, Abram Demski, Vincent Conitzer

In this paper, we develop a theory of rational decision making that does not assume logical omniscience.

Decision Making

Incentivizing honest performative predictions with proper scoring rules

1 code implementation28 May 2023 Caspar Oesterheld, Johannes Treutlein, Emery Cooper, Rubi Hudson

We show that, for binary predictions, if the influence of the expert's prediction on outcomes is bounded, it is possible to define scoring rules under which optimal reports are arbitrarily close to fixed points.

Prediction

The Computational Complexity of Single-Player Imperfect-Recall Games

no code implementations28 May 2023 Emanuel Tewolde, Caspar Oesterheld, Vincent Conitzer, Paul W. Goldberg

For such games, two natural equilibrium concepts have been proposed as alternative solution concepts to ex-ante optimality.

For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria

1 code implementation7 Jul 2022 Scott Emmons, Caspar Oesterheld, Andrew Critch, Vincent Conitzer, Stuart Russell

In this work, we show that any locally optimal symmetric strategy profile is also a (global) Nash equilibrium.

Reinforcement Learning in Newcomblike Environments

no code implementations NeurIPS 2021 James Bell, Linda Linsefors, Caspar Oesterheld, Joar Skalse

This gives us a powerful tool for reasoning about the limit behaviour of agents -- for example, it lets us show that there are Newcomblike environments in which a reinforcement learning agent cannot converge to any optimal policy.

reinforcement-learning Reinforcement Learning +1

A New Formalism, Method and Open Issues for Zero-Shot Coordination

1 code implementation11 Jun 2021 Johannes Treutlein, Michael Dennis, Caspar Oesterheld, Jakob Foerster

We introduce an extension of the algorithm, other-play with tie-breaking, and prove that it is optimal in the LFC problem and an equilibrium in the LFC game.

Multi-agent Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.