1 code implementation • 1 Aug 2022 • Yongle Luo, Yuxin Wang, Kun Dong, Qiang Zhang, Erkang Cheng, Zhiyong Sun, Bo Song
To solve these tasks efficiently, we propose a novel self-guided continual RL framework, RelayHER (RHER).
no code implementations • 5 Mar 2020 • Yongle Luo, Kun Dong, Lili Zhao, Zhiyong Sun, Chao Zhou, Bo Song
The experiment results show that the Dense2Sparse method obtained higher expected reward compared with the ones using standalone dense reward or sparse reward, and it also has a superior tolerance of system uncertainty.