Search Results for author: R. Song

Found 1 papers, 1 papers with code

Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings

1 code implementation13 Jan 2020 C. Shi, S. Zhang, W. Lu, R. Song

We propose to model the action-value state function (Q-function) associated with a policy based on series/sieve method to derive its confidence interval.

Decision Making reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.