no code implementations • 24 Nov 2023 • Zhuoying Chen, Huiping Li, Zhaoxu Wang
Prioritized Experience Replay (PER) enables the model to learn more about relatively important samples by artificially changing their accessed frequencies.
no code implementations • 1 Nov 2023 • Rizhong Wang, Huiping Li, Di Cui, Demin Xu
Once a joint policy is obtained, it is critical to design a value function factorization method to extract optimal decentralized policies for the agents, which needs to satisfy the individual-global-max (IGM) principle.
no code implementations • 13 Sep 2023 • Zhuoying Chen, Huiping Li, Rizhong Wang
Prioritized Experience Replay (PER) is a technical means of deep reinforcement learning by selecting experience samples with more knowledge quantity to improve the training rate of neural network.
no code implementations • 10 Dec 2021 • Haojiao Liang, Huiping Li, Jian Gao, Rongxin Cui, Demin Xu
Energy efficiency and safety are two critical objectives for marine vehicles operating in environments with obstacles, and they generally conflict with each other.
no code implementations • 21 Apr 2021 • Fei Li, Huiping Li, Yuyao He
Secondly, by utilizing the probabilistic positively invariant set, the probabilistically resolvable constant tube-based stochastic model predictive control algorithm is developed by employing the constantly tightened constraints in the entire prediction horizons.