Search Results for author: Zhao-Hua Li

Found 1 papers, 0 papers with code

Neural-to-Tree Policy Distillation with Policy Improvement Criterion

no code implementations16 Aug 2021 Zhao-Hua Li, Yang Yu, Yingfeng Chen, Ke Chen, Zhipeng Hu, Changjie Fan

The empirical results show that the proposed method can preserve a higher cumulative reward than behavior cloning and learn a more consistent policy to the original one.

Decision Making reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.