Search Results for author: José Arjona-Medina

Found 1 papers, 0 papers with code

Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

no code implementations2 Dec 2020 Markus Holzleitner, Lukas Gruber, José Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter

We prove under commonly used assumptions the convergence of actor-critic reinforcement learning algorithms, which simultaneously learn a policy function, the actor, and a value function, the critic.

Reinforcement Learning (RL) valid

Cannot find the paper you are looking for? You can Submit a new open access paper.