Search Results for author: Raghuram Bharadwaj Diddigi

Found 10 papers, 4 papers with code

Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm

no code implementations • 19 Oct 2021 • Raghuram Bharadwaj Diddigi, Prateek Jain, Prabuchandran K. J., Shalabh Bhatnagar

Learning optimal behavior from existing data is one of the most important problems in Reinforcement Learning (RL).

Reinforcement Learning (RL)

Paper
Add Code

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

2 code implementations • 7 Jan 2021 • P. Parnika, Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Shalabh Bhatnagar

In this work, we consider the problem of computing optimal actions for Reinforcement Learning (RL) agents in a co-operative setting, where the objective is to optimize a common goal.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks

1 code implementation • 6 Feb 2020 • Shravan Nayak, Chanakya Ajit Ekbote, Annanya Pratap Singh Chauhan, Raghuram Bharadwaj Diddigi, Prishita Ray, Abhinava Sikdar, Sai Koti Reddy Danda, Shalabh Bhatnagar

A microgrid is capable of generating a limited amount of energy from a renewable resource and is responsible for handling the demands of its dedicated customers.

energy trading Management +2

Paper
Code

A Convergent Off-Policy Temporal Difference Algorithm

1 code implementation • 13 Nov 2019 • Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

In this work, we propose a convergent on-line off-policy TD algorithm under linear function approximation.

Reinforcement Learning (RL)

Paper
Code

A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games

no code implementations • 16 Jun 2019 • Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

This problem is formulated as a min-max Markov game in the literature.

Q-Learning

Paper
Add Code

Generalized Second Order Value Iteration in Markov Decision Processes

2 code implementations • 10 May 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

In this work, we propose a second order value iteration procedure that is obtained by applying the Newton-Raphson method to the successive relaxation value iteration scheme.

Paper
Code

Successive Over Relaxation Q-Learning

no code implementations • 9 Mar 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

We first derive a modified fixed point iteration for SOR Q-values and utilize stochastic approximation to derive a learning algorithm to compute the optimal value function and an optimal policy.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

no code implementations • 11 Feb 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

In many of the practical applications, the analytical form of the density is not known and only the samples from the distribution are available.

Paper
Add Code

Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks

no code implementations • 27 Aug 2017 • Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

We consider the problem of tracking an intruder using a network of wireless sensors.

Intrusion Detection Reinforcement Learning (RL) +1

Paper
Add Code

Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids

no code implementations • 25 Aug 2017 • Raghuram Bharadwaj Diddigi, D. Sai Koti Reddy, Shalabh Bhatnagar

Finally, we also consider a variant of this problem where the cost of power production at the main site is taken into consideration.

Q-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.