Search Results for author: Yuanyang Zhu

Found 4 papers, 1 papers with code

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

1 code implementation • 1 Aug 2023 • Junyi Wang, Yuanyang Zhu, Zhi Wang, Yan Zheng, Jianye Hao, Chunlin Chen

Evolutionary reinforcement learning (ERL) algorithms recently raise attention in tackling complex reinforcement learning (RL) problems due to high parallelism, while they are prone to insufficient exploration or model collapse without carefully tuning hyperparameters (aka meta-parameters).

Bilevel Optimization reinforcement-learning +1

Paper
Code

Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning

no code implementations • 12 May 2023 • Qingpeng Zhao, Yuanyang Zhu, Zichuan Liu, Zhi Wang, Chunlin Chen

In cooperative multi-agent reinforcement learning (MARL), the environmental stochasticity and uncertainties will increase exponentially when the number of agents increases, which puts hard pressure on how to come up with a compact latent representation from partial observation for boosting value decomposition.

Multi-agent Reinforcement Learning Starcraft +1

Paper
Add Code

MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees

no code implementations • 15 Sep 2022 • Zichuan Liu, Yuanyang Zhu, Zhi Wang, Yang Gao, Chunlin Chen

While achieving tremendous success in various fields, existing multi-agent reinforcement learning (MARL) with a black-box neural network architecture makes decisions in an opaque manner that hinders humans from understanding the learned knowledge and how input observations influence decisions.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction

no code implementations • 15 Apr 2021 • Yuanyang Zhu, Zhi Wang, Chunlin Chen, Daoyi Dong

In this paper, we focus on efficient navigation with the RL technique and combine the advantages of these two kinds of methods into a rule-based RL (RuRL) algorithm for reducing the sample complexity and cost of time.

Navigate reinforcement-learning +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.