Search Results for author: Kenny Song

Found 2 papers, 0 papers with code

Efficient Entropy for Policy Gradient with Multidimensional Action Space

no code implementations • 2 Jun 2018 • Yiming Zhang, Quan Ho Vuong, Kenny Song, Xiao-Yue Gong, Keith W. Ross

We develop several novel unbiased estimators for the entropy bonus and its gradient.

Paper
Add Code

Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus

no code implementations • ICLR 2018 • Vuong Ho Quan, Yiming Zhang, Kenny Song, Xiao-Yue Gong, Keith W. Ross

In the case of high-dimensional action spaces, calculating the entropy and the gradient of the entropy requires enumerating all the actions in the action space and running forward and backpropagation for each action, which may be computationally infeasible.

Atari Games reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.