Search Results for author: Yun Hua

Found 4 papers, 2 papers with code

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

1 code implementation6 Dec 2023 Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks?

Benchmarking Decision Making +1

VMAgent: Scheduling Simulator for Reinforcement Learning

2 code implementations9 Dec 2021 Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.

Cloud Computing reinforcement-learning +2

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

no code implementations9 Feb 2021 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha

In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning.

Multi-agent Reinforcement Learning

HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem

no code implementations11 Feb 2020 Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha

In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward.

Meta-Learning Meta Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.