Search Results for author: Shangdong Yang

Found 3 papers, 1 papers with code

Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient

1 code implementation10 Oct 2022 Wubing Chen, Wenbin Li, Xiao Liu, Shangdong Yang, Yang Gao

Empirically, we evaluate MAPPG on the well-known matrix game and differential game, and verify that MAPPG can converge to the global optimum for both discrete and continuous action spaces.

Multi-agent Reinforcement Learning reinforcement-learning +3

Keeping Minimal Experience to Achieve Efficient Interpretable Policy Distillation

no code implementations2 Mar 2022 Xiao Liu, Shuyang Liu, Wenbin Li, Shangdong Yang, Yang Gao

Although deep reinforcement learning has become a universal solution for complex control tasks, its real-world applicability is still limited because lacking security guarantees for policies.

Online Attentive Kernel-Based Temporal Difference Learning

no code implementations22 Jan 2022 Guang Yang, Xingguo Chen, Shangdong Yang, Huihui Wang, Shaokang Dong, Yang Gao

Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA.

Acrobot Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.