Search Results for author: Hu Haifeng

Found 1 papers, 1 papers with code

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

2 code implementations22 Dec 2021 Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.