no code implementations • 29 Jul 2024 • Steven Morad, Ajay Shankar, Jan Blumenkamp, Amanda Prorok
We present a method for developing navigation policies for multi-robot teams that interpret and follow natural language instructions.
no code implementations • 19 Jun 2024 • Edan Toledo, Amanda Prorok
Sample efficiency is a critical challenge in reinforcement learning.
1 code implementation • 23 May 2024 • Matteo Bettini, Ryan Kortvelesy, Amanda Prorok
To address this, we introduce Diversity Control (DiCo), a method able to control diversity to an exact value of a given metric by representing policies as the sum of a parameter-shared component and dynamically scaled per-agent components.
no code implementations • 21 Mar 2024 • Zhan Gao, Guang Yang, Amanda Prorok
By introducing two sub-objectives of multi-agent navigation and environment optimization, we propose an $\textit{agent-environment co-optimization}$ problem and develop a $\textit{coordinated algorithm}$ that alternates between these sub-objectives to search for an optimal synthesis of agent actions and obstacle configurations in the environment; ultimately, improving the navigation performance.
1 code implementation • 11 Mar 2024 • Dulhan Jayalath, Steven Morad, Amanda Prorok
Our objective is to learn a fixed-size latent Markov state from a variable number of agent observations.
1 code implementation • 15 Feb 2024 • Steven Morad, Chris Lu, Ryan Kortvelesy, Stephan Liwicki, Jakob Foerster, Amanda Prorok
Memory models such as Recurrent Neural Networks (RNNs) and Transformers address Partially Observable Markov Decision Processes (POMDPs) by mapping trajectories to latent Markov states.
no code implementations • 4 Dec 2023 • Zhan Gao, Amanda Prorok, Elvin Isufi
Analyzing the stability of graph neural networks (GNNs) under topological perturbations is key to understanding their transferability and the role of each architecture component.
1 code implementation • 3 Dec 2023 • Matteo Bettini, Amanda Prorok, Vincent Moens
The field of Multi-Agent Reinforcement Learning (MARL) is currently facing a reproducibility crisis.
no code implementations • 23 Nov 2023 • Ajay Shankar, Heedo Woo, Amanda Prorok
However, certain missions \textit{require} two multirotors to approach each other within 1-2 body-lengths of each other and hold formation -- we consider one such practical instance: vertically docking two multirotors in the air.
1 code implementation • 24 Jun 2023 • Jasmine Bayrooti, Zhan Gao, Amanda Prorok
Furthermore, we show that it is possible to learn a model achieving high accuracies, within 3% of DP-SGD on MNIST under (1, 10^-5)-differential privacy and within 6% of DP-SGD on CIFAR-100 under (10, 10^-5)-differential privacy, without ever sharing raw data with other agents.
no code implementations • 18 May 2023 • Zhan Gao, Amanda Prorok
The goal of this paper is to consider the environment as a decision variable in a system-level optimization problem, where both agent performance and environment cost are incorporated.
1 code implementation • 3 May 2023 • Matteo Bettini, Ajay Shankar, Amanda Prorok
In this paper, we introduce System Neural Diversity (SND): a measure of behavioral heterogeneity in multi-agent systems.
no code implementations • 8 Mar 2023 • Zhan Gao, Guang Yang, Amanda Prorok
Control barrier functions (CBFs) enable guaranteed safe multi-agent navigation in the continuous domain.
3 code implementations • 3 Mar 2023 • Steven Morad, Ryan Kortvelesy, Matteo Bettini, Stephan Liwicki, Amanda Prorok
Real world applications of Reinforcement Learning (RL) are often partially observable, thus requiring memory.
2 code implementations • 24 Feb 2023 • Ryan Kortvelesy, Steven Morad, Amanda Prorok
The problem of permutation-invariant learning over set representations is particularly relevant in the field of multi-agent systems -- a few potential applications include unsupervised training of aggregation functions in graph neural networks (GNNs), neural cellular automata on graphs, and prediction of scenes with multiple objects.
no code implementations • 20 Jan 2023 • Chenning Yu, QingBiao Li, Sicun Gao, Amanda Prorok
Though it is complete and optimal, it does not scale well.
2 code implementations • 17 Jan 2023 • Matteo Bettini, Ajay Shankar, Amanda Prorok
Through simulations and real-world experiments, we show that: (i) when homogeneous methods fail due to strong heterogeneous requirements, HetGPPO succeeds, and, (ii) when homogeneous methods are able to learn apparently heterogeneous behaviors, HetGPPO achieves higher resilience to both training and deployment noise.
no code implementations • 30 Oct 2022 • Zhan Gao, Yulin Shao, Deniz Gunduz, Amanda Prorok
Wireless local area networks (WLANs) manage multiple access points (APs) and assign scarce radio frequency resources to APs for satisfying traffic demands of associated user devices.
no code implementations • 22 Sep 2022 • Zhan Gao, Amanda Prorok
Traditional approaches to the design of multi-agent navigation algorithms consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance.
no code implementations • 1 Aug 2022 • Jan Blumenkamp, QingBiao Li, Binyu Wang, Zhe Liu, Amanda Prorok
We consider the problem of navigating a mobile robot towards a target in an unknown environment that is endowed with visual sensors, where neither the robot nor the sensors have access to global positioning information and only use first-person-view images.
1 code implementation • 7 Jul 2022 • Matteo Bettini, Ryan Kortvelesy, Jan Blumenkamp, Amanda Prorok
VMAS's scenarios prove challenging in orthogonal ways for state-of-the-art MARL algorithms.
1 code implementation • 25 May 2022 • Ryan Kortvelesy, Amanda Prorok
In multi-agent reinforcement learning, the use of a global objective is a powerful tool for incentivising cooperation.
2 code implementations • 2 Nov 2021 • Jan Blumenkamp, Steven Morad, Jennifer Gielis, QingBiao Li, Amanda Prorok
We demonstrate our framework on a case-study that requires tight coordination between robots, and present first-of-a-kind results that show successful real-world deployment of GNN-based policies on a decentralized multi-robot system relying on Adhoc communication.
1 code implementation • ICLR 2022 • Benjamin Hudson, QingBiao Li, Matthew Malencia, Amanda Prorok
To close this gap, we present a hybrid data-driven approach for solving the TSP based on Graph Neural Networks (GNNs) and Guided Local Search (GLS).
1 code implementation • 29 Sep 2021 • Francesco Sovrano, Alex Raymond, Amanda Prorok
In this paper, we propose a method for organising experience by means of partitioning the experience buffer into clusters labelled on a per-explanation basis.
no code implementations • 25 Sep 2021 • Amanda Prorok, Matthew Malencia, Luca Carlone, Gaurav S. Sukhatme, Brian M. Sadler, Vijay Kumar
In this survey article, we analyze how resilience is achieved in networks of agents and multi-robot systems that are able to overcome adversity by leveraging system-wide complementarity, diversity, and redundancy - often involving a reconfiguration of robotic capabilities to provide some key ability that was not present in the system a priori.
no code implementations • 26 Jul 2021 • Amanda Prorok, Jan Blumenkamp, QingBiao Li, Ryan Kortvelesy, Zhe Liu, Ethan Stump
Many multi-robot planning problems are burdened by the curse of dimensionality, which compounds the difficulty of applying solutions to large-scale problem instances.
1 code implementation • 27 Jun 2021 • Steven D. Morad, Stephan Liwicki, Ryan Kortvelesy, Roberto Mecca, Amanda Prorok
Solving partially-observable Markov decision processes (POMDPs) is critical when applying reinforcement learning to real-world problems, where agents have an incomplete view of the world.
1 code implementation • 18 May 2021 • Lifeng Zhou, Vishnu D. Sharma, QingBiao Li, Amanda Prorok, Alejandro Ribeiro, Pratap Tokekar, Vijay Kumar
We demonstrate the performance of our GNN-based learning approach in a scenario of active target tracking with large networks of robots.
1 code implementation • 24 Mar 2021 • Ryan Kortvelesy, Amanda Prorok
Recent work in the multi-agent domain has shown the promise of Graph Neural Networks (GNNs) to learn complex coordination strategies.
2 code implementations • 3 Mar 2021 • Jacopo Panerati, Hehui Zheng, SiQi Zhou, James Xu, Amanda Prorok, Angela P. Schoellig
Robotic simulators are crucial for academic research and education as well as the development of safety-critical applications.
no code implementations • 29 Dec 2020 • Fernando Gama, QingBiao Li, Ekaterina Tolstaya, Amanda Prorok, Alejandro Ribeiro
Dynamical systems consisting of a set of autonomous agents face the challenge of having to accomplish a global task, relying only on local information.
no code implementations • 1 Dec 2020 • Rupert Mitchell, Jan Blumenkamp, Amanda Prorok
In this paper, we consider the problem of providing robustness to adversarial communication in multi-agent systems.
1 code implementation • 26 Nov 2020 • QingBiao Li, Weizhe Lin, Zhe Liu, Amanda Prorok
Our Message-Aware Graph Attention neTwork (MAGAT) is based on a key-query-like mechanism that determines the relative importance of features in the messages received from various neighboring robots.
1 code implementation • 6 Aug 2020 • Jan Blumenkamp, Amanda Prorok
Such a design choice, however, precludes the existence of a single, differentiable communication channel, and consequently prohibits the learning of inter-agent communication strategies.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 11 May 2020 • Binyu Wang, Zhe Liu, Qing-Biao Li, Amanda Prorok
Path planning for mobile robots in large dynamic environments is a challenging problem, as the robots are required to efficiently reach their given goals while simultaneously avoiding potential conflicts with other robots or dynamic objects.
1 code implementation • 12 Dec 2019 • Qing-Biao Li, Fernando Gama, Alejandro Ribeiro, Amanda Prorok
We train the model to imitate an expert algorithm, and use the resulting model online in decentralized planning involving only local communication and local observations.
no code implementations • 26 Nov 2019 • Rupert Mitchell, Jenny Fletcher, Jacopo Panerati, Amanda Prorok
In this article, we present a safe and efficient framework that enables the learning of driving policies for autonomous vehicles operating in a shared workspace, where the absence of collisions cannot be guaranteed.
2 code implementations • 23 Sep 2019 • Hehui Zheng, Jacopo Panerati, Giovanni Beltrame, Amanda Prorok
We present a method that generates private flocking controllers that hide the identity of the leader robot.
no code implementations • 9 May 2018 • Amanda Prorok
This paper shows how Graph Neural Networks can be used for learning distributed coordination mechanisms in connected teams of robots.