Search Results for author: Rajkumar Maity

Found 1 papers, 0 papers with code

On a convergent off -policy temporal difference learning algorithm in on-line learning environment

no code implementations • 19 May 2016 • Prasenjit Karmakar, Rajkumar Maity, Shalabh Bhatnagar

In this paper we provide a rigorous convergence analysis of a "off"-policy temporal difference learning algorithm with linear function approximation and per time-step linear computational complexity in "online" learning environment.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.