TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Multi-agent Reinforcement Learning	ParticleEnvs Cooperative Communication	MATD3	final agent reward	-14	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/reducing-overestimation-bias-in-multi-agent/multi-agent-reinforcement-learning-on)](https://paperswithcode.com/sota/multi-agent-reinforcement-learning-on?p=reducing-overestimation-bias-in-multi-agent)`

Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

3 Oct 2019 · Johannes Ackermann, Volker Gabler, Takayuki Osa, Masashi Sugiyama ·

Many real world tasks require multiple agents to work together. Multi-agent reinforcement learning (RL) methods have been proposed in recent years to solve these tasks, but current methods often fail to efficiently learn policies. We thus investigate the presence of a common weakness in single-agent RL, namely value function overestimation bias, in the multi-agent setting. Based on our findings, we propose an approach that reduces this bias by using double centralized critics. We evaluate it on six mixed cooperative-competitive tasks, showing a significant advantage over current methods. Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.

PDF Abstract