Search Results for author: Thomas Moscibroda

Found 2 papers, 1 papers with code

Conservative State Value Estimation for Offline Reinforcement Learning

1 code implementation NeurIPS 2023 Liting Chen, Jie Yan, Zhengdao Shao, Lu Wang, QIngwei Lin, Saravan Rajmohan, Thomas Moscibroda, Dongmei Zhang

In this paper, we propose Conservative State Value Estimation (CSVE), a new approach that learns conservative V-function via directly imposing penalty on OOD states.

D4RL reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.