Search Results for author: Jakob Nicolaus Foerster

Found 9 papers, 5 papers with code

Discovering Temporally-Aware Reinforcement Learning Algorithms

1 code implementation8 Feb 2024 Matthew Thomas Jackson, Chris Lu, Louis Kirsch, Robert Tjarko Lange, Shimon Whiteson, Jakob Nicolaus Foerster

We propose a simple augmentation to two existing objective discovery approaches that allows the discovered algorithm to dynamically update its objective function throughout the agent's training procedure, resulting in expressive schedules and increased generalization across different training horizons.

Meta-Learning reinforcement-learning

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

1 code implementation2 Jun 2023 Andrew Jesson, Chris Lu, Gunshi Gupta, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal

We show that the additive term is bounded proportional to the Lipschitz constant of the value function, which offers theoretical grounding for spectral normalization of critic weights.

Bayesian Inference Continuous Control +3

Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

no code implementations19 Mar 2023 Yat Long Lo, Christian Schroeder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson

By enabling agents to communicate, recent cooperative multi-agent reinforcement learning (MARL) methods have demonstrated better task performance and more coordinated behavior.

Multi-agent Reinforcement Learning reinforcement-learning +1

Proximal Learning With Opponent-Learning Awareness

1 code implementation18 Oct 2022 Stephen Zhao, Chris Lu, Roger Baker Grosse, Jakob Nicolaus Foerster

This problem is especially pronounced in the opponent modeling setting, where the opponent's policy is unknown and must be inferred from observations; in such settings, LOLA is ill-specified because behaviorally equivalent opponent policies can result in non-equivalent updates.

Multi-agent Reinforcement Learning

A Fine-Tuning Approach to Belief State Modeling

no code implementations ICLR 2022 Samuel Sokota, Hengyuan Hu, David J Wu, J Zico Kolter, Jakob Nicolaus Foerster, Noam Brown

Furthermore, because this specialization occurs after the action or policy has already been decided, BFT does not require the belief model to process it as input.

Zero-Shot Coordination via Semantic Relationships Between Actions and Observations

no code implementations29 Sep 2021 Mingwei Ma, Jizhou Liu, Samuel Sokota, Max Kleiman-Weiner, Jakob Nicolaus Foerster

An unaddressed challenge in zero-shot coordination is to take advantage of the semantic relationship between the features of an action and the features of observations.

Inductive Bias

Cannot find the paper you are looking for? You can Submit a new open access paper.