Search Results for author: Bozhidar Vasilev

Found 1 papers, 0 papers with code

Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients

no code implementations27 Apr 2021 Bozhidar Vasilev, Tarun Gupta, Bei Peng, Shimon Whiteson

Policy gradient methods are an attractive approach to multi-agent reinforcement learning problems due to their convergence properties and robustness in partially observable scenarios.

Policy Gradient Methods Reinforcement Learning (RL) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.