no code implementations • 23 Feb 2021 • Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko
Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures.
no code implementations • 11 Dec 2020 • Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko
A critical problem with the practical utility of controllers trained with deep Reinforcement Learning (RL) is the notable lack of smoothness in the actions learned by the RL policies.