TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
SMAC+	Def_Armored_parallel	IQL	Median Win Rate	0.0	# 6
SMAC+	Def_Armored_sequential	IQL	Median Win Rate	9.4	# 8
SMAC+	Def_Infantry_parallel	IQL	Median Win Rate	40.0	# 8
SMAC+	Def_Infantry_sequential	IQL	Median Win Rate	93.8	# 7
SMAC+	Def_Outnumbered_parallel	IQL	Median Win Rate	0.0	# 4
SMAC+	Def_Outnumbered_sequential	IQL	Median Win Rate	0.0	# 5
SMAC+	Off_Hard_parallel	IQL	Median Win Rate	0.0	# 3
SMAC+	Off_Superhard_parallel	IQL	Median Win Rate	0.0	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-off-superhard-parallel)](https://paperswithcode.com/sota/smac-on-smac-off-superhard-parallel?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-off-hard-parallel)](https://paperswithcode.com/sota/smac-on-smac-off-hard-parallel?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-outnumbered-parallel)](https://paperswithcode.com/sota/smac-on-smac-def-outnumbered-parallel?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-outnumbered-sequential)](https://paperswithcode.com/sota/smac-on-smac-def-outnumbered-sequential?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-armored-parallel)](https://paperswithcode.com/sota/smac-on-smac-def-armored-parallel?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-infantry-sequential)](https://paperswithcode.com/sota/smac-on-smac-def-infantry-sequential?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-armored-sequential)](https://paperswithcode.com/sota/smac-on-smac-def-armored-sequential?p=the-starcraft-multi-agent-challenges-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-starcraft-multi-agent-challenges-learning/smac-on-smac-def-infantry-parallel)](https://paperswithcode.com/sota/smac-on-smac-def-infantry-parallel?p=the-starcraft-multi-agent-challenges-learning)`

The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

5 Jul 2022 · Mingyu Kim, Jihwan Oh, Yongsik Lee, Joonkee Kim, SeongHwan Kim, Song Chong, Se-Young Yun ·

In this paper, we propose a novel benchmark called the StarCraft Multi-Agent Challenges+, where agents learn to perform multi-stage tasks and to use environmental factors without precise reward functions. The previous challenges (SMAC) recognized as a standard benchmark of Multi-Agent Reinforcement Learning are mainly concerned with ensuring that all agents cooperatively eliminate approaching adversaries only through fine manipulation with obvious reward functions. This challenge, on the other hand, is interested in the exploration capability of MARL algorithms to efficiently learn implicit multi-stage tasks and environmental factors as well as micro-control. This study covers both offensive and defensive scenarios. In the offensive scenarios, agents must learn to first find opponents and then eliminate them. The defensive scenarios require agents to use topographic features. For example, agents need to position themselves behind protective structures to make it harder for enemies to attack. We investigate MARL algorithms under SMAC+ and observe that recent approaches work well in similar settings to the previous challenges, but misbehave in offensive scenarios. Additionally, we observe that an enhanced exploration approach has a positive effect on performance but is not able to completely solve all scenarios. This study proposes new directions for future research.

PDF Abstract

Code

Add Remove Mark official

osilab-kaist/smac_exp official

Tasks

Add Remove

Multi-agent Reinforcement Learning

SMAC+

Datasets

Introduced in the Paper:

Def_Outnumbered_sequential

Def_Armored_sequential

Def_Infantry_sequential

Off_Hard_sequential

Off_Near_sequential

Off_Distant_sequential

Off_Complicated_sequential

Off_Superhard_sequential

Used in the Paper:

SMAC-Exp

Def_Infantry_parallel

Off_Hard_parallel

Off_Superhard_parallel

Def_Outnumbered_parallel

Def_Armored_parallel

Results from the Paper

Edit

Ranked #1 on SMAC+ on Off_Superhard_parallel

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
SMAC+	Def_Armored_parallel	IQL	Median Win Rate	0.0	# 6	Compare
SMAC+	Def_Armored_sequential	IQL	Median Win Rate	9.4	# 8	Compare
SMAC+	Def_Infantry_parallel	IQL	Median Win Rate	40.0	# 8	Compare
SMAC+	Def_Infantry_sequential	IQL	Median Win Rate	93.8	# 7	Compare
SMAC+	Def_Outnumbered_parallel	IQL	Median Win Rate	0.0	# 4	Compare
SMAC+	Def_Outnumbered_sequential	IQL	Median Win Rate	0.0	# 5	Compare
SMAC+	Off_Hard_parallel	IQL	Median Win Rate	0.0	# 3	Compare
SMAC+	Off_Superhard_parallel	IQL	Median Win Rate	0.0	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove