Search Results for author: Kimon Protopapas

Found 1 papers, 0 papers with code

Policy Mirror Descent with Lookahead

no code implementations21 Mar 2024 Kimon Protopapas, Anas Barakat

In this work, we propose a new class of PMD algorithms called $h$-PMD which incorporates multi-step greedy policy improvement with lookahead depth $h$ to the PMD update rule.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.