no code implementations • 5 Apr 2020 • Ala'eddin Masadeh, Zhengdao Wang, Ahmed E. Kamal
In AC, an actor optimizes the used policy, while a critic estimates a value function and evaluate the optimized policy by the actor.
reinforcement-learning Reinforcement Learning (RL)