Search Results for author: Prasenjit Karmakar

Found 7 papers, 0 papers with code

AQuaMoHo: Localized Low-Cost Outdoor Air Quality Sensing over a Thermo-Hygrometer

no code implementations • 25 Apr 2022 • Prithviraj Pramanik, Prasenjit Karmakar, Praveen Kumar Sharma, Soumyajit Chatterjee, Abhijit Roy, Santanu Mandal, Subrata Nandi, Sandip Chakraborty, Mousumi Saha, Sujoy Saha

Efficient air quality sensing serves as one of the essential services provided in any recent smart city.

Paper
Add Code

A Practical AoI Scheduler in IoT Networks with Relays

no code implementations • 8 Mar 2022 • Biplav Choudhury, Prasenjit Karmakar, Vijay K. Shah, Jeffrey H. Reed

The proposed v-PPO based AoI scheduler adapts well to changing network conditions and accounts for unknown traffic generation patterns, making it practical for real-world IoT deployments.

Scheduling

Paper
Add Code

Exploiting Multi-modal Contextual Sensing for City-bus's Stay Location Characterization: Towards Sub-60 Seconds Accurate Arrival Time Prediction

no code implementations • 24 May 2021 • Ratna Mandal, Prasenjit Karmakar, Soumyajit Chatterjee, Debaleen Das Spandan, Shouvit Pradhan, Sujoy Saha, Sandip Chakraborty, Subrata Nandi

However, providing prior information for transportation systems like public buses in real-time is inherently challenging because of the diverse nature of different stay-locations that a public bus stops.

Paper
Add Code

Stochastic Approximation with Markov Noise: Analysis and applications in reinforcement learning

no code implementations • 8 Apr 2020 • Prasenjit Karmakar

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by "controlled" Markov noise.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

On the function approximation error for risk-sensitive reinforcement learning

no code implementations • 22 Dec 2016 • Prasenjit Karmakar, Shalabh Bhatnagar

The novelty of our approach is that we use the irreduciblity of Markov chain to get the new bounds whereas the earlier work by Basu et al. used spectral variation bound which is true for any matrix.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

On a convergent off -policy temporal difference learning algorithm in on-line learning environment

no code implementations • 19 May 2016 • Prasenjit Karmakar, Rajkumar Maity, Shalabh Bhatnagar

In this paper we provide a rigorous convergence analysis of a "off"-policy temporal difference learning algorithm with linear function approximation and per time-step linear computational complexity in "online" learning environment.

Paper
Add Code

Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

no code implementations • 31 Mar 2015 • Prasenjit Karmakar, Shalabh Bhatnagar

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by `controlled' Markov noise.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.