no code implementations • 23 Mar 2024 • FNU Hairi, Zifan Zhang, Jia Liu
This leads to an interesting open question: Can the local TD-update approach entail low sample and communication complexities?
no code implementations • ICLR 2022 • FNU Hairi, Jia Liu, Songtao Lu
In this paper, we establish the first finite-time convergence result of the actor-critic algorithm for fully decentralized multi-agent reinforcement learning (MARL) problems with average reward.
Multi-agent Reinforcement Learning reinforcement-learning +1