Search Results for author: Naman Saxena

Found 3 papers, 0 papers with code

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

no code implementations20 May 2023 Naman Saxena, Subhojyoti Khastigir, Shishir Kolathaya, Shalabh Bhatnagar

In this work, we present both on-policy and off-policy deterministic policy gradient theorems for the average reward performance criterion.

A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks

no code implementations20 May 2023 Arunselvan Ramaswamy, Shalabh Bhatnagar, Naman Saxena

We show, in theory and through experiments, that our algorithm updates have low variance, and the training loss reduces in a smooth manner.

Q-Learning reinforcement-learning +1

Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning

no code implementations30 Nov 2022 Naman Saxena, Gorantla Sandeep, Pushpak Jagtap

Signal Temporal Logic (STL) is a powerful framework for describing the complex temporal and logical behaviour of the dynamical system.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.