SMAC

38 papers with code • 11 benchmarks • 1 datasets

The StarCraft Multi-Agent Challenge (SMAC) is a benchmark that provides elements of partial observability, challenging dynamics, and high-dimensional observation spaces. SMAC is built using the StarCraft II game engine, creating a testbed for research in cooperative MARL where each game unit is an independent RL agent.

Benchmarks

Add a Result

These leaderboards are used to track progress in SMAC

Dataset	Best Model	Compare
SMAC 6h_vs_8z	ACE	See all
SMAC MMM2	ACE	See all
SMAC corridor	ACE	See all
SMAC 3s5z_vs_3s6z	ACE	See all
SMAC 27m_vs_30m	DDN	See all
SMAC 6h_vs_9z	DDN	See all
SMAC 3s5z_vs_4s6z	DDN	See all
SMAC MMM2_7m2M1M_vs_8m4M1M	DDN	See all
SMAC MMM2_7m2M1M_vs_9m3M1M	DDN	See all
SMAC 26m_vs_30m	DMIX	See all
SMAC corridor_2z_vs_24zg	DDN	See all

Show all 11 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find SMAC models and implementations

oxwhirl/pymarl

2 papers

1,729

Denys88/rl_games

2 papers

731

Datasets

SMAC

Subtasks

Latest papers

Most implemented Social Latest No code

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

oxwhirl/smacv2 • NeurIPS 2023

In this work, we conduct new analysis demonstrating that SMAC lacks the stochasticity and partial observability to require complex *closed-loop* policies.

169

14 Dec 2022

Paper
Code

Effects of Spectral Normalization in Multi-agent Reinforcement Learning

kinalmehta/epymarl_spectral • • 10 Dec 2022

A reliable critic is central to on-policy actor-critic learning.

10 Dec 2022

Paper
Code

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

opendilab/ace • • 29 Nov 2022

In the learning phase, each agent minimizes the TD error that is dependent on how the subsequent agents have reacted to their chosen action.

180

29 Nov 2022

Paper
Code

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

liushunyu/cia • • 23 Nov 2022

Value Decomposition (VD) aims to deduce the contributions of agents for decentralized policies in the presence of only global rewards, and has recently emerged as a powerful credit assignment paradigm for tackling cooperative Multi-Agent Reinforcement Learning (MARL) problems.

23 Nov 2022

Paper
Code

Latent State Marginalization as a Low-cost Approach for Improving Exploration

zdhnarsil/stochastic-marginal-actor-critic • • 3 Oct 2022

While the maximum entropy (MaxEnt) reinforcement learning (RL) framework -- often touted for its exploration and robustness capabilities -- is usually motivated from a probabilistic perspective, the use of deep probabilistic models has not gained much traction in practice due to their inherent complexity.

03 Oct 2022

Paper
Code

Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft

junaiddk/transmix • • 15 Aug 2022

The StarCraft II Multi-Agent Challenge (SMAC) was created to be a challenging benchmark problem for cooperative multi-agent reinforcement learning (MARL).

15 Aug 2022

Paper
Code

Scalable Multi-Agent Model-Based Reinforcement Learning

jbr-ai-labs/mamba • 25 May 2022

While in mixed environments full autonomy of the agents can be a desirable outcome, cooperative environments allow agents to share information to facilitate coordination.

25 May 2022

Paper
Code

Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution

cugbbaiyun/hgcn-mix • • 9 Dec 2021

HGCN-MIX models agents as well as their relationships as a hypergraph, where agents are nodes and hyperedges among nodes indicate that the corresponding agents can coordinate to achieve larger rewards.

09 Dec 2021

Paper
Code

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks

reinholdm/offline-pre-trained-multi-agent-decision-transformer • • 6 Dec 2021

In this paper, we facilitate the research by providing large-scale datasets, and use them to examine the usage of the Decision Transformer in the context of MARL.

06 Dec 2021

Paper
Code

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

hsvgbkhgbv/shapley-q-learning • • 31 May 2021

This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory.

31 May 2021

Paper
Code

SMAC

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result