Search Results for author: Ajin George Joseph

Found 3 papers, 0 papers with code

An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method

no code implementations15 Jun 2018 Ajin George Joseph, Shalabh Bhatnagar

In this paper, we provide two new stable online algorithms for the problem of prediction in reinforcement learning, \emph{i. e.}, estimating the value function of a model-free Markov reward process using the linear function approximation architecture and with memory and computation costs scaling quadratically in the size of the feature set.

Computational Efficiency Reinforcement Learning (RL)

A Cross Entropy based Optimization Algorithm with Global Convergence Guarantees

no code implementations31 Jan 2018 Ajin George Joseph, Shalabh Bhatnagar

The cross entropy (CE) method is a model based search method to solve optimization problems where the objective function has minimal structure.

An Incremental Off-policy Search in a Model-free Markov Decision Process Using a Single Sample Path

no code implementations31 Jan 2018 Ajin George Joseph, Shalabh Bhatnagar

In this paper, we consider a modified version of the control problem in a model free Markov decision process (MDP) setting with large state and action spaces.

Cannot find the paper you are looking for? You can Submit a new open access paper.