Search Results for author: KaiChiu Wong

Found 1 papers, 1 papers with code

Weak Human Preference Supervision For Deep Reinforcement Learning

1 code implementation25 Jul 2020 Zehong Cao, KaiChiu Wong, Chin-Teng Lin

The current reward learning from human preferences could be used to resolve complex reinforcement learning (RL) tasks without access to a reward function by defining a single fixed preference between pairs of trajectory segments.

MuJoCo Games reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.