1 code implementation • 25 Aug 2022 • Hua Zheng, Wei Xie, M. Ben Feng
For reinforcement learning on complex stochastic systems where many factors dynamically impact the output trajectories, it is desirable to effectively leverage the information from historical samples collected in previous iterations to accelerate policy optimization.
1 code implementation • 17 Oct 2021 • Hua Zheng, Wei Xie, M. Ben Feng
For reinforcement learning on complex stochastic systems, it is desirable to effectively leverage the information from historical samples collected in previous iterations to accelerate policy optimization.