no code implementations • 22 Dec 2023 • Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang
This paper presents a general framework model for integrating and learning structured reasoning into AI agents' policies.
no code implementations • 27 Oct 2023 • Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Henry Mguni, Jun Wang
To that purpose, we offer a new leader-follower bilevel framework that is capable of learning to ask relevant questions (prompts) and subsequently undertaking reasoning to guide the learning of actions.
no code implementations • 28 Sep 2023 • Eleftherios Triantafyllidis, Filippos Christianos, Zhibin Li
We evaluate our framework and related intrinsic learning methods in an environment challenged with exploration, and a complex robotic manipulation task challenged by both exploration and long-horizons.
1 code implementation • 9 May 2023 • Adam Michalski, Filippos Christianos, Stefano V. Albrecht
The Starcraft Multi-Agent Challenge (SMAC) has been widely used in MARL research, but is built on top of a heavy, closed-source computer game, StarCraft II.
1 code implementation • 23 Feb 2023 • Callum Rhys Tilbury, Filippos Christianos, Stefano V. Albrecht
This method, however, is statistically biased, and a recent MARL benchmarking paper suggests that this bias makes MADDPG perform poorly in grid-world situations, where the action space is discrete.
no code implementations • 26 Oct 2022 • Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone
Reasoning with occluded traffic agents is a significant open challenge for planning for autonomous vehicles.
1 code implementation • 28 Sep 2022 • Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
This work focuses on equilibrium selection in no-conflict multi-agent games, where we specifically study the problem of selecting a Pareto-optimal Nash equilibrium among several existing equilibria.
Multi-agent Reinforcement Learning reinforcement-learning +1
3 code implementations • 2 Aug 2022 • Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht
The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning.
1 code implementation • 5 Jul 2022 • Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht
We show that a team of agents is able to adapt to novel tasks when provided with task embeddings.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • ICML Workshop URL 2021 • Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht
Intrinsic rewards can improve exploration in reinforcement learning, but the exploration process may suffer from instability caused by non-stationary reward shaping and strong dependency on hyperparameters.
1 code implementation • 15 Feb 2021 • Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht
Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 28 Sep 2020 • Georgios Papoudakis, Filippos Christianos, Stefano V Albrecht
Modelling the behaviours of other agents (opponents) is essential for understanding how agents interact and making effective decisions.
1 code implementation • 18 Jun 2020 • Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht
Ad hoc teamwork is the challenging problem of designing an autonomous agent which can adapt quickly to collaborate with teammates without prior coordination mechanisms, including joint training.
1 code implementation • NeurIPS 2021 • Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht
Existing methods for agent modelling commonly assume knowledge of the local observations and chosen actions of the modelled agents during execution.
8 code implementations • 14 Jun 2020 • Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult.
3 code implementations • NeurIPS 2020 • Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards.
no code implementations • 11 Jun 2019 • Georgios Papoudakis, Filippos Christianos, Arrasy Rahman, Stefano V. Albrecht
Recent developments in deep reinforcement learning are concerned with creating decision-making agents which can perform well in various complex domains.