Search Results for author: H. L. Prasad

Found 3 papers, 0 papers with code

A constrained optimization perspective on actor critic algorithms and application to network routing

no code implementations • 28 Jul 2015 • Prashanth L. A., H. L. Prasad, Shalabh Bhatnagar, Prakash Chandra

We propose a novel actor-critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process.

Paper
Add Code

A Study of Gradient Descent Schemes for General-Sum Stochastic Games

no code implementations • 1 Jul 2015 • H. L. Prasad, Shalabh Bhatnagar

However, the optimization problem there has a non-linear objective and non-linear constraints with special structure.

Paper
Add Code

Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games

no code implementations • 8 Jan 2014 • H. L. Prasad, L. A. Prashanth, Shalabh Bhatnagar

We then provide a characterization of solution points of these sub-problems that correspond to Nash equilibria of the underlying game and for this purpose, we derive a set of necessary and sufficient SG-SP (Stochastic Game - Sub-Problem) conditions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.