no code implementations • 7 Feb 2020 • Gilwoo Lee, Brian Hou, Sanjiban Choudhury, Siddhartha S. Srinivasa
We first obtain an ensemble of experts, one for each latent MDP, and fuse their advice to compute a baseline policy.
no code implementations • 24 Oct 2018 • Lerrel Pinto, Aditya Mandalika, Brian Hou, Siddhartha Srinivasa
This paper proposes a sample-efficient yet simple approach to learning closed-loop policies for nonprehensile manipulation.
no code implementations • 6 Oct 2018 • Gilwoo Lee, Sanjiban Choudhury, Brian Hou, Siddhartha S. Srinivasa
We present the first PAC optimal algorithm for Bayes-Adaptive Markov Decision Processes (BAMDPs) in continuous state and action spaces, to the best of our knowledge.
no code implementations • ICLR 2019 • Gilwoo Lee, Brian Hou, Aditya Mandalika, Jeongseok Lee, Sanjiban Choudhury, Siddhartha S. Srinivasa
Addressing uncertainty is critical for autonomous systems to robustly adapt to the real world.