Search Results for author: Daan Wout

Found 2 papers, 1 papers with code

Learning Gaussian Policies from Corrective Human Feedback

no code implementations12 Mar 2019 Daan Wout, Jan Scholten, Carlos Celemin, Jens Kober

We demonstrate that the novel algorithm outperforms the current state-of-the-art in final performance, convergence rate and robustness to erroneous feedback in OpenAI Gym continuous control benchmarks, both for simulated and real human teachers.

Continuous Control Gaussian Processes +1

Deep Reinforcement Learning with Feedback-based Exploration

2 code implementations14 Mar 2019 Jan Scholten, Daan Wout, Carlos Celemin, Jens Kober

We employ binary corrective feedback as a general and intuitive manner to incorporate human intuition and domain knowledge in model-free machine learning.

Continuous Control OpenAI Gym +2

Cannot find the paper you are looking for? You can Submit a new open access paper.