Search Results for author: Prasenjit Karmakar

Found 7 papers, 0 papers with code

A Practical AoI Scheduler in IoT Networks with Relays

no code implementations8 Mar 2022 Biplav Choudhury, Prasenjit Karmakar, Vijay K. Shah, Jeffrey H. Reed

The proposed v-PPO based AoI scheduler adapts well to changing network conditions and accounts for unknown traffic generation patterns, making it practical for real-world IoT deployments.

Scheduling

Exploiting Multi-modal Contextual Sensing for City-bus's Stay Location Characterization: Towards Sub-60 Seconds Accurate Arrival Time Prediction

no code implementations24 May 2021 Ratna Mandal, Prasenjit Karmakar, Soumyajit Chatterjee, Debaleen Das Spandan, Shouvit Pradhan, Sujoy Saha, Sandip Chakraborty, Subrata Nandi

However, providing prior information for transportation systems like public buses in real-time is inherently challenging because of the diverse nature of different stay-locations that a public bus stops.

Stochastic Approximation with Markov Noise: Analysis and applications in reinforcement learning

no code implementations8 Apr 2020 Prasenjit Karmakar

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by "controlled" Markov noise.

reinforcement-learning Reinforcement Learning (RL)

On the function approximation error for risk-sensitive reinforcement learning

no code implementations22 Dec 2016 Prasenjit Karmakar, Shalabh Bhatnagar

The novelty of our approach is that we use the irreduciblity of Markov chain to get the new bounds whereas the earlier work by Basu et al. used spectral variation bound which is true for any matrix.

reinforcement-learning Reinforcement Learning (RL)

On a convergent off -policy temporal difference learning algorithm in on-line learning environment

no code implementations19 May 2016 Prasenjit Karmakar, Rajkumar Maity, Shalabh Bhatnagar

In this paper we provide a rigorous convergence analysis of a "off"-policy temporal difference learning algorithm with linear function approximation and per time-step linear computational complexity in "online" learning environment.

Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

no code implementations31 Mar 2015 Prasenjit Karmakar, Shalabh Bhatnagar

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by `controlled' Markov noise.

Cannot find the paper you are looking for? You can Submit a new open access paper.