1 code implementation • 20 Feb 2021 • Mónika Farsang, Luca Szegletes
Proximal Policy Optimization (PPO) is among the most widely used algorithms in reinforcement learning, which achieves state-of-the-art performance in many challenging problems.
no code implementations • 20 Feb 2021 • Mónika Farsang, Luca Szegletes
An in-depth understanding of the particular environment is crucial in reinforcement learning (RL).