Neural Replicator Dynamics

1 Jun 2019Daniel HennesDustin MorrillShayegan OmidshafieiRemi MunosJulien PerolatMarc LanctotAudrunas GruslysJean-Baptiste LespiauPaavo ParmasEdgar Duenez-GuzmanKarl Tuyls

Policy gradient and actor-critic algorithms form the basis of many commonly used training techniques in deep reinforcement learning. Using these algorithms in multiagent environments poses problems such as nonstationarity and instability... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper