no code implementations • 7 Dec 2022 • Allan Zhou, Nicholas C. Landolfi, Daniel C. O'Neill
There is considerable interest in predicting the pathogenicity of protein variants in human genes.
no code implementations • 24 Jun 2020 • Erdem Biyik, Dylan P. Losey, Malayandi Palan, Nicholas C. Landolfi, Gleb Shevchuk, Dorsa Sadigh
As designing reward functions can be extremely challenging, a more promising approach is to directly learn reward functions from human teachers.
2 code implementations • 10 Oct 2019 • Erdem Biyik, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, Dorsa Sadigh
Robots can learn the right reward function by querying a human expert.
no code implementations • 11 Jul 2019 • Nicholas C. Landolfi, Garrett Thomas, Tengyu Ma
We then adapt the dynamical model with samples from this policy in the real environment.
1 code implementation • 21 Jun 2019 • Malayandi Palan, Nicholas C. Landolfi, Gleb Shevchuk, Dorsa Sadigh
In a user study, we compare our method to a standard IRL method; we find that users rated the robot trained with DemPref as being more successful at learning their desired behavior, and preferred to use the DemPref system (over IRL) to train the robot.