no code implementations • 30 Dec 2023 • GuoJian Wang, Faguo Wu, Xiao Zhang, Tianyuan Chen, Zhiming Zheng
The sparsity of reward feedback remains a challenging problem in online deep reinforcement learning (DRL).