Search Results for author: Ang A. Li

Found 1 papers, 1 papers with code

Revisiting Prioritized Experience Replay: A Value Perspective

2 code implementations5 Feb 2021 Ang A. Li, Zongqing Lu, Chenglin Miao

Furthermore, we successfully extend our theoretical framework to maximum-entropy RL by deriving the lower and upper bounds of these value metrics for soft Q-learning, which turn out to be the product of $|\text{TD}|$ and "on-policyness" of the experiences.

Atari Games Q-Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.