no code implementations • 10 Mar 2024 • Chuning Zhu, Xinqi Wang, Tyler Han, Simon S. Du, Abhishek Gupta
In this work, we propose a novel class of models - generalized occupancy models (GOMs) - that retain the generality of model-based RL while avoiding compounding error.
no code implementations • 18 Aug 2023 • Pengbo Hu, Ji Qi, Xingyu Li, Hong Li, Xinqi Wang, Bing Quan, Ruiyu Wang, Yi Zhou
Our approach succeeds in performance while significantly saving inference steps.
no code implementations • 1 Jun 2022 • Xinqi Wang, Qiwen Cui, Simon S. Du
This paper presents a systematic study on gap-dependent sample complexity in offline reinforcement learning.