Search Results for author: Muhammad A. Masood

Found 2 papers, 0 papers with code

Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies

no code implementations31 May 2019 Muhammad A. Masood, Finale Doshi-Velez

Standard reinforcement learning methods aim to master one way of solving a task whereas there may exist multiple near-optimal policies.

Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.