no code implementations • 30 Sep 2021 • Yimin Shi
Deep Reinforcement Learning (DRL) sometimes needs a large amount of data to converge in the training procedure and in some cases, each action of the agent may produce regret.
no code implementations • 29 Apr 2020 • Yunlian Lv, Ning Xie, Yimin Shi, Zijiao Wang, Heng Tao Shen
On the other hand, TSE module is used to generate sub-targets which allow agent to learn from failures.