no code implementations • 14 Jan 2024 • Xiao Liu, Jie Zhao, Wubing Chen, Mao Tan, Yongxing Su
To address this issue, we propose a novel self-interpretable structure, named Backbone Extract Tree (BET), to better explain the agent's behavior by identify the error-prone states.
no code implementations • 12 Sep 2023 • Xiao Liu, Wubing Chen, Mao Tan
We then design a fidelity-induced mechanism by integrate a fidelity measurement into the reinforcement learning feedback.
1 code implementation • 10 Oct 2022 • Wubing Chen, Wenbin Li, Xiao Liu, Shangdong Yang, Yang Gao
Empirically, we evaluate MAPPG on the well-known matrix game and differential game, and verify that MAPPG can converge to the global optimum for both discrete and continuous action spaces.
Multi-agent Reinforcement Learning reinforcement-learning +3