no code implementations • 21 Nov 2023 • Keshav P. Keval, Vivek S. Borkar
In this paper, we propose a reinforcement learning algorithm to solve a multi-agent Markov decision process (MMDP).
Q-Learning