Search Results for author: Wanpeng Zhang

Found 10 papers, 2 papers with code

Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

no code implementations5 Jun 2023 Wanpeng Zhang, Yilin Li, Boyu Yang, Zongqing Lu

COREP primarily employs a guided updating mechanism to learn a stable graph representation for states termed as causal-origin representation.

reinforcement-learning

Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control

no code implementations26 Aug 2021 Wanpeng Zhang, Xiaoyan Cao, Yao Yao, Zhicheng An, Xi Xiao, Dijun Luo

In this paper, we present a model-based robust RL framework for autonomous greenhouse control to meet the sample efficiency and safety challenges.

Decision Making Model-based Reinforcement Learning +2

Model-Based Opponent Modeling

no code implementations4 Aug 2021 Xiaopeng Yu, Jiechuan Jiang, Wanpeng Zhang, Haobin Jiang, Zongqing Lu

When one agent interacts with a multi-agent environment, it is challenging to deal with various opponents unseen before.

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

no code implementations3 Aug 2021 Wanpeng Zhang, Xi Xiao, Yao Yao, Mingzhe Chen, Dijun Luo

MBDP consists of two kinds of dropout mechanisms, where the rollout-dropout aims to improve the robustness with a small cost of sample efficiency, while the model-dropout is designed to compensate for the lost efficiency at a slight expense of robustness.

Model-based Reinforcement Learning

IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control

1 code implementation6 Jul 2021 Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang, Zhicheng An, Zhong Zhang, Li Xiao, Shihui Guo, Xiaoyu Cao, Meihong Wu, Dijun Luo

However, the optimal control of autonomous greenhouses is challenging, requiring decision-making based on high-dimensional sensory data, and the scaling of production is limited by the scarcity of labor capable of handling this task.

Cloud Computing Decision Making

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

1 code implementation5 Jul 2021 Yao Yao, Li Xiao, Zhicheng An, Wanpeng Zhang, Dijun Luo

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics.

Continuous Control reinforcement-learning +1

Deceptive Opponent Modeling with Proactive Network Interdiction for Stochastic Goal Recognition Control

no code implementations25 Sep 2019 Junren Luo, Wei Gao, Zhiyong Liao, Weilin Yuan, Wanpeng Zhang, Shaofei Chen

Goal recognition based on the observations of the behaviors collected online has been used to model some potential applications.

Cannot find the paper you are looking for? You can Submit a new open access paper.