Search Results for author: Shihong Deng

Found 4 papers, 1 papers with code

Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

no code implementations7 Mar 2023 Wei Xi, Yongxin Zhang, Changnan Xiao, Xuefeng Huang, Shihong Deng, Haowei Liang, Jie Chen, Peng Sun

Deep Reinforcement Learning combined with Fictitious Play shows impressive results on many benchmark games, most of which are, however, single-stage.

Decision Making

An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

no code implementations1 Jun 2021 Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng

We find valued-based reinforcement learning methods with {\epsilon}-greedy mechanism are capable of enjoying three characteristics, Closed-form Diversity, Objective-invariant Exploration and Adaptive Trade-off, which help value-based methods avoid the policy collapse problem.

Atari Games reinforcement-learning +1

CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration

no code implementations9 May 2021 Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng, Haiyan Yin

We study the problem of model-free reinforcement learning, which is often solved following the principle of Generalized Policy Iteration (GPI).

Atari Games

Hierarchical Meta Reinforcement Learning for Multi-Task Environments

1 code implementation1 Jan 2021 Dongyang Zhao, Yue Huang, Changnan Xiao, Yue Li, Shihong Deng

To address the problem brought by the environment, we propose a Meta Soft Hierarchical reinforcement learning framework (MeSH), in which each low-level sub-policy focuses on a specific sub-task respectively and high-level policy automatically learns to utilize low-level sub-policies through meta-gradients.

Hierarchical Reinforcement Learning Meta Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.