Search Results for author: H. L. Prasad

Found 3 papers, 0 papers with code

A constrained optimization perspective on actor critic algorithms and application to network routing

no code implementations28 Jul 2015 Prashanth L. A., H. L. Prasad, Shalabh Bhatnagar, Prakash Chandra

We propose a novel actor-critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process.

A Study of Gradient Descent Schemes for General-Sum Stochastic Games

no code implementations1 Jul 2015 H. L. Prasad, Shalabh Bhatnagar

However, the optimization problem there has a non-linear objective and non-linear constraints with special structure.

Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games

no code implementations8 Jan 2014 H. L. Prasad, L. A. Prashanth, Shalabh Bhatnagar

We then provide a characterization of solution points of these sub-problems that correspond to Nash equilibria of the underlying game and for this purpose, we derive a set of necessary and sufficient SG-SP (Stochastic Game - Sub-Problem) conditions.

Cannot find the paper you are looking for? You can Submit a new open access paper.