no code implementations • 9 Feb 2023 • Sheng Yue, Guanbo Wang, Wei Shao, Zhaofeng Zhang, Sen Lin, Ju Ren, Junshan Zhang
This work aims to tackle a major challenge in offline Inverse Reinforcement Learning (IRL), namely the reward extrapolation error, where the learned reward function may fail to explain the task correctly and misguide the agent in unseen environments due to the intrinsic covariate shift.
no code implementations • 14 Aug 2021 • Sheng Yue, Ju Ren, Jiang Xin, Deyu Zhang, Yaoxue Zhang, Weihua Zhuang
After that, we formulate a resource allocation problem integrating NUFM in multi-access wireless systems to jointly improve the convergence rate and minimize the wall-clock time along with energy cost.
no code implementations • 16 Dec 2020 • Sheng Yue, Ju Ren, Jiang Xin, Sen Lin, Junshan Zhang
To overcome these challenges, we explore continual edge learning capable of leveraging the knowledge transfer from previous tasks.