Browse > Methodology > Multi-agent Reinforcement Learning

Multi-agent Reinforcement Learning

16 papers with code · Methodology

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

2 Dec 2017geek-ai/MAgent

Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents. MAgent is highly scalable and can host up to one million agents on a single GPU server.

MULTI-AGENT REINFORCEMENT LEARNING

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

NeurIPS 2016 iassael/learning-to-communicate

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks.

MULTI-AGENT REINFORCEMENT LEARNING Q-LEARNING

The StarCraft Multi-Agent Challenge

11 Feb 2019oxwhirl/pymarl

A particularly challenging class of problems in this area is partially observable, cooperative, multi-agent learning, in which teams of agents must learn to coordinate their behaviour while conditioning only on their private observations. In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap.

MULTI-AGENT REINFORCEMENT LEARNING STARCRAFT STARCRAFT II

Mean Field Multi-Agent Reinforcement Learning

ICML 2018 mlii/mfrl

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of agent interactions.

MULTI-AGENT REINFORCEMENT LEARNING Q-LEARNING

Learning with Opponent-Learning Awareness

13 Sep 2017alshedivat/lola

The LOLA learning rule includes a term that accounts for the impact of one agent's policy on the anticipated parameter update of the other agents. We also show that the LOLA update rule can be efficiently calculated using an extension of the policy gradient estimator, making the method suitable for model-free RL.

MULTI-AGENT REINFORCEMENT LEARNING

Deep Multi-Agent Reinforcement Learning with Relevance Graphs

30 Nov 2018tegg89/mann

Over recent years, deep reinforcement learning has shown strong successes in complex single-agent tasks, and more recently this approach has also been applied to multi-agent domains. In this paper, we propose a novel approach, called MAGnet, to multi-agent reinforcement learning (MARL) that utilizes a relevance graph representation of the environment obtained by a self-attention mechanism, and a message-generation technique inspired by the NerveNet architecture.

MULTI-AGENT REINFORCEMENT LEARNING

Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning

ICLR 2019 eugenevinitsky/sequential_social_dilemma_games

We propose a unified mechanism for achieving coordination and communication in Multi-Agent Reinforcement Learning (MARL), through rewarding agents for having causal influence over other agents' actions. Causal influence is assessed using counterfactual reasoning.

MULTI-AGENT REINFORCEMENT LEARNING

Inequity aversion improves cooperation in intertemporal social dilemmas

NeurIPS 2018 eugenevinitsky/sequential_social_dilemma_games

Groups of humans are often able to find ways to cooperate with one another in complex, temporally extended social dilemmas. This promotes a particular resolution of the matrix game social dilemma wherein inequity-averse individuals are personally pro-social and punish defectors.

MULTI-AGENT REINFORCEMENT LEARNING

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

10 Feb 2017eugenevinitsky/sequential_social_dilemma_games

We introduce sequential social dilemmas that share the mixed incentive structure of matrix game social dilemmas but also require agents to learn policies that implement their strategic intentions. We analyze the dynamics of policies learned by multiple self-interested independent learning agents, each using its own deep Q-network, on two Markov games we introduce here: 1. a fruit Gathering game and 2. a Wolfpack hunting game.

MULTI-AGENT REINFORCEMENT LEARNING

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

18 Feb 2018illidanlab/Simulator

Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency. An efficient fleet management strategy not only can significantly improve the utilization of transportation resources but also increase the revenue and customer satisfaction.

MULTI-AGENT REINFORCEMENT LEARNING Q-LEARNING