no code implementations • 9 Feb 2023 • Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht
Training a team to complete a complex task via multi-agent reinforcement learning can be difficult due to challenges such as policy search in a large joint policy space, and non-stationarity caused by mutually adapting agents.
Multi-agent Reinforcement Learning reinforcement-learning +1
3 code implementations • 2 Aug 2022 • Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht
The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning.
no code implementations • 28 Jul 2022 • Arrasy Rahman, Elliot Fosong, Ignacio Carlucho, Stefano V. Albrecht
Early approaches address the AHT challenge by training the learner with a diverse set of handcrafted teammate policies, usually designed based on an expert's domain knowledge about the policies the learner may encounter.
no code implementations • 19 Jul 2022 • Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht
We propose the novel few-shot teamwork (FST) problem, where skilled agents trained in a team to complete one task are combined with skilled agents from different tasks, and together must learn to adapt to an unseen but related task.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 16 Feb 2022 • Reuth Mirsky, Ignacio Carlucho, Arrasy Rahman, Elliot Fosong, William Macke, Mohan Sridharan, Peter Stone, Stefano V. Albrecht
Ad hoc teamwork is the research problem of designing agents that can collaborate with new teammates without prior coordination.
1 code implementation • 18 Jul 2020 • Ibrahim H. Ahmed, Josiah P. Hanna, Elliot Fosong, Stefano V. Albrecht
Authentication and key agreement are decided based on the agents' observed behaviors during the interaction.