no code implementations • 27 Feb 2023 • Vivek Myers, Erdem Biyik, Dorsa Sadigh
Robot policies need to adapt to human preferences and/or new environments.
1 code implementation • 21 Oct 2021 • Vivek Myers, Nikhil Sardana
This problem setting can be extended to the Bayesian context, wherein rather than predicting a single label for each query data point, a model predicts a distribution of labels capturing its uncertainty.
no code implementations • 27 Sep 2021 • Vivek Myers, Erdem Biyik, Nima Anari, Dorsa Sadigh
However, expert feedback is often assumed to be drawn from an underlying unimodal reward function.
1 code implementation • 20 Jul 2020 • Vivek Myers, Peyton Greenside
As modern data sets grow, so does the need to scale active search to large data sets and batch sizes.