Search Results for author: Sungee Hong

Found 1 papers, 0 papers with code

Distributional Off-policy Evaluation with Bellman Residual Minimization

no code implementations2 Feb 2024 Sungee Hong, Zhengling Qi, Raymond K. W. Wong

We consider the problem of distributional off-policy evaluation which serves as the foundation of many distributional reinforcement learning (DRL) algorithms.

Distributional Reinforcement Learning Off-policy evaluation

Cannot find the paper you are looking for? You can Submit a new open access paper.