2 code implementations • ICLR 2022 • Chongchong Li, Yue Wang, Wei Chen, YuTing Liu, Zhi-Ming Ma, Tie-Yan Liu
Then we proposed a two-model-based learning method to control the prediction error and the gradient error.
Continuous Control Model-based Reinforcement Learning