Search Results for author: Hu Haifeng

Found 1 papers, 1 papers with code

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

2 code implementations • 22 Dec 2021 • Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data.

Reinforcement Learning (RL)

26

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.