Search Results for author: Lakshmi Mandal

Found 2 papers, 1 papers with code

Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes

no code implementations • 20 Nov 2023 • Lakshmi Mandal, Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

In this work, we consider a `cooperative' multi-agent Markov decision process (MDP) involving m greater than 1 agents, where all agents are aware of the system model.

Paper
Add Code

n-Step Temporal Difference Learning with Optimal n

1 code implementation • 13 Mar 2023 • Lakshmi Mandal, Shalabh Bhatnagar

We consider the problem of finding the optimal value of n in the n-step temporal difference (TD) learning algorithm.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.