Search Results for author: Lakshmi Mandal

Found 2 papers, 1 papers with code

Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes

no code implementations20 Nov 2023 Lakshmi Mandal, Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

In this work, we consider a `cooperative' multi-agent Markov decision process (MDP) involving m greater than 1 agents, where all agents are aware of the system model.

n-Step Temporal Difference Learning with Optimal n

1 code implementation13 Mar 2023 Lakshmi Mandal, Shalabh Bhatnagar

We consider the problem of finding the optimal value of n in the n-step temporal difference (TD) learning algorithm.

Cannot find the paper you are looking for? You can Submit a new open access paper.