1 code implementation • 5 Sep 2023 • Junming Yang, Xingguo Chen, Shengyuan Wang, Bolei Zhang
Model-based offline reinforcement learning (RL), which builds a supervised transition model with logging dataset to avoid costly interactions with the online environment, has been a promising approach for offline policy optimization.
no code implementations • 22 Jan 2022 • Guang Yang, Xingguo Chen, Shangdong Yang, Huihui Wang, Shaokang Dong, Yang Gao
Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA.
no code implementations • 25 Nov 2019 • Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao
Traditional methods attempt to use pre-defined rules to capture the interaction relationship between agents.