1 code implementation • 26 Sep 2023 • Chenyang Miao, Yunduan Cui, Huiyun Li, Xinyu Wu
It alleviates the inconsistency of multiple agents' policy updates by introducing the relative entropy regularization to the Centralized Training with Decentralized Execution (CTDE) framework with the Actor-Critic (AC) structure.