Search Results for author: Daan Wout

Found 2 papers, 1 papers with code

Deep Reinforcement Learning with Feedback-based Exploration

2 code implementations14 Mar 2019 Jan Scholten, Daan Wout, Carlos Celemin, Jens Kober

We employ binary corrective feedback as a general and intuitive manner to incorporate human intuition and domain knowledge in model-free machine learning.

Continuous Control OpenAI Gym +2

Learning Gaussian Policies from Corrective Human Feedback

no code implementations12 Mar 2019 Daan Wout, Jan Scholten, Carlos Celemin, Jens Kober

We demonstrate that the novel algorithm outperforms the current state-of-the-art in final performance, convergence rate and robustness to erroneous feedback in OpenAI Gym continuous control benchmarks, both for simulated and real human teachers.

Continuous Control Gaussian Processes +1

Cannot find the paper you are looking for? You can Submit a new open access paper.