Search Results for author: Subhojyoti Khastigir

Found 1 papers, 0 papers with code

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

no code implementations20 May 2023 Naman Saxena, Subhojyoti Khastigir, Shishir Kolathaya, Shalabh Bhatnagar

In this work, we present both on-policy and off-policy deterministic policy gradient theorems for the average reward performance criterion.

Cannot find the paper you are looking for? You can Submit a new open access paper.