Search Results for author: Yiqin Yang

Found 8 papers, 4 papers with code

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning

1 code implementation11 Dec 2023 Dianyu Zhong, Yiqin Yang, Qianchuan Zhao

The large action space is one fundamental obstacle to deploying Reinforcement Learning methods in the real world.

reinforcement-learning

Learning Diverse Risk Preferences in Population-based Self-play

1 code implementation19 May 2023 Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao

In this paper, we aim to introduce diversity from the perspective that agents could have diverse risk preferences in the face of uncertainty.

reinforcement-learning Reinforcement Learning (RL)

The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning

no code implementations27 Feb 2023 Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang

Self-supervised methods have become crucial for advancing deep learning by leveraging data itself to reduce the need for expensive annotations.

Offline RL reinforcement-learning +1

On the Role of Discount Factor in Offline Reinforcement Learning

no code implementations7 Jun 2022 Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang

The discount factor, $\gamma$, plays a vital role in improving online RL sample efficiency and estimation accuracy, but the role of the discount factor in offline RL is not well explored.

D4RL Offline RL +2

Offline Reinforcement Learning with Value-based Episodic Memory

1 code implementation ICLR 2022 Xiaoteng Ma, Yiqin Yang, Hao Hu, Qihan Liu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang

Offline reinforcement learning (RL) shows promise of applying RL to real-world problems by effectively utilizing previously collected data.

D4RL Offline RL +2

Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning

no code implementations10 Feb 2021 Xiaoteng Ma, Yiqin Yang, Chenghao Li, Yiwen Lu, Qianchuan Zhao, Yang Jun

Value-based methods of multi-agent reinforcement learning (MARL), especially the value decomposition methods, have been demonstrated on a range of challenging cooperative tasks.

Continuous Control Multi-agent Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.