Search Results for author: Raghuram Bharadwaj Diddigi

Found 10 papers, 4 papers with code

Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm

no code implementations19 Oct 2021 Raghuram Bharadwaj Diddigi, Prateek Jain, Prabuchandran K. J., Shalabh Bhatnagar

Learning optimal behavior from existing data is one of the most important problems in Reinforcement Learning (RL).

Reinforcement Learning (RL)

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

2 code implementations7 Jan 2021 P. Parnika, Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Shalabh Bhatnagar

In this work, we consider the problem of computing optimal actions for Reinforcement Learning (RL) agents in a co-operative setting, where the objective is to optimize a common goal.

reinforcement-learning Reinforcement Learning (RL)

A Convergent Off-Policy Temporal Difference Algorithm

1 code implementation13 Nov 2019 Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

In this work, we propose a convergent on-line off-policy TD algorithm under linear function approximation.

Reinforcement Learning (RL)

Generalized Second Order Value Iteration in Markov Decision Processes

2 code implementations10 May 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

In this work, we propose a second order value iteration procedure that is obtained by applying the Newton-Raphson method to the successive relaxation value iteration scheme.

Successive Over Relaxation Q-Learning

no code implementations9 Mar 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

We first derive a modified fixed point iteration for SOR Q-values and utilize stochastic approximation to derive a learning algorithm to compute the optimal value function and an optimal policy.

Q-Learning Reinforcement Learning (RL)

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

no code implementations11 Feb 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

In many of the practical applications, the analytical form of the density is not known and only the samples from the distribution are available.

Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids

no code implementations25 Aug 2017 Raghuram Bharadwaj Diddigi, D. Sai Koti Reddy, Shalabh Bhatnagar

Finally, we also consider a variant of this problem where the cost of power production at the main site is taken into consideration.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.