Search Results for author: Santanu Rathod

Found 4 papers, 0 papers with code

Scheduling to Learn In An Unsupervised Online Streaming Model

no code implementations2 Dec 2021 R. Vaze, Santanu Rathod

The utility of a sample is a scalar multiple of its accuracy minus the response time (difference of the departure slot and the arrival slot), where the departure slot is also decided by the algorithm.

Scheduling

On reducing the order of arm-passes bandit streaming algorithms under memory bottleneck

no code implementations30 Nov 2021 Santanu Rathod

In this work we explore multi-arm bandit streaming model, especially in cases where the model faces resource bottleneck.

Global Convergence Using Policy Gradient Methods for Model-free Markovian Jump Linear Quadratic Control

no code implementations30 Nov 2021 Santanu Rathod, Manoj Bhadu, Abir De

Owing to the growth of interest in Reinforcement Learning in the last few years, gradient based policy control methods have been gaining popularity for Control problems as well.

Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.