Search Results for author: Soumyajit Guin

Found 2 papers, 1 papers with code

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes

1 code implementation10 Oct 2022 Soumyajit Guin, Shalabh Bhatnagar

In many situations, finite horizon control problems are of interest and for such problems, the optimal policies are time-varying in general.

reinforcement-learning Reinforcement Learning (RL)

Actor-Critic or Critic-Actor? A Tale of Two Time Scales

no code implementations10 Oct 2022 Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin

We revisit the standard formulation of tabular actor-critic algorithm as a two time-scale stochastic approximation with value function computed on a faster time-scale and policy computed on a slower time-scale.

Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.