Search Results for author: Maryam Ziaei

Found 1 papers, 1 papers with code

Self Punishment and Reward Backfill for Deep Q-Learning

1 code implementation10 Apr 2020 Mohammad Reza Bonyadi, Rui Wang, Maryam Ziaei

We prove that, under certain assumptions and regardless of the reinforcement learning algorithm used, these two strategies maintain the order of policies in the space of all possible policies in terms of their total reward, and, by extension, maintain the optimal policy.

Atari Games Q-Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.