Multi-agent Reinforcement Learning

387 papers with code • 3 benchmarks • 9 datasets

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Benchmarks

Add a Result

These leaderboards are used to track progress in Multi-agent Reinforcement Learning

Dataset	Best Model	Compare
ParticleEnvs Cooperative Communication	MATD3	See all
SMAC-Exp	DRIMA	See all
UAV Logistics	Fusion-Multi-Actor-Attention-Critic	See all

Libraries

Use these libraries to find Multi-agent Reinforcement Learning models and implementations

facebookresearch/benchmarl

6 papers

150

opendilab/DI-engine

4 papers

2,505

cyanrain7/trust-region-policy-optim…

4 papers

163

cyanrain7/trpo-in-marl

4 papers

163

See all 17 libraries.

Datasets

Subtasks

SMAC

Latest papers

Most implemented Social Latest No code

Laser Learning Environment: A new environment for coordination-critical multi-agent tasks

yamoling/lle • 4 Apr 2024

We introduce the Laser Learning Environment (LLE), a collaborative multi-agent reinforcement learning environment in which coordination is central.

04 Apr 2024

Paper
Code

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems

arana-initiatives/gov-rek-marls • 1 Apr 2024

For multi-agent reinforcement learning systems (MARLS), the problem formulation generally involves investing massive reward engineering effort specific to a given problem.

01 Apr 2024

Paper
Code

Inferring Latent Temporal Sparse Coordination Graph for Multi-Agent Reinforcement Learning

Wei9711/LTSCG • • 28 Mar 2024

The LTS-CG leverages agents' historical observations to calculate an agent-pair probability matrix, where a sparse graph is sampled from and used for knowledge exchange between agents, thereby simultaneously capturing agent dependencies and relation uncertainty.

28 Mar 2024

Paper
Code

Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding

ai4co/eph-mapf • • 12 Mar 2024

We first propose a selective communication block to gather richer information for better agent coordination within multi-agent environments and train the model with a Q-learning-based algorithm.

12 Mar 2024

Paper
Code

Generalising Multi-Agent Cooperation through Task-Agnostic Communication

proroklab/task-agnostic-comms • • 11 Mar 2024

Our objective is to learn a fixed-size latent Markov state from a variable number of agent observations.

11 Mar 2024

Paper
Code

PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning

colazhang22/pps-qmix • • 5 Mar 2024

Agents share Q-value network periodically during the training process.

05 Mar 2024

Paper
Code

Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

hyunghona/emu • • 2 Mar 2024

To address this, we introduce Efficient episodic Memory Utilization (EMU) for MARL, with two primary objectives: (a) accelerating reinforcement learning by leveraging semantically coherent memory from an episodic buffer and (b) selectively promoting desirable transitions to prevent local convergence.

02 Mar 2024

Paper
Code

Independent Learning in Constrained Markov Potential Games

philip-jordan/iprox-cmpg • 27 Feb 2024

We propose an independent policy gradient algorithm for learning approximate constrained Nash equilibria: Each agent observes their own actions and rewards, along with a shared state.

27 Feb 2024

Paper
Code

Modelling crypto markets by multi-agent reinforcement learning

johannlussange/symba_crypto • 16 Feb 2024

Building on a previous foundation work (Lussange et al. 2020), this study introduces a multi-agent reinforcement learning (MARL) model simulating crypto markets, which is calibrated to the Binance's daily closing prices of $153$ cryptocurrencies that were continuously traded between 2018 and 2022.

16 Feb 2024

Paper
Code

Risk-Sensitive Multi-Agent Reinforcement Learning in Network Aggregative Markov Games

hafezgh/risk-sensitive-marl-namg • 8 Feb 2024

Under a set of assumptions, we prove the convergence of the algorithm to a subjective notion of Markov perfect Nash equilibrium in NAMGs.

08 Feb 2024

Paper
Code

Multi-agent Reinforcement Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result