1 code implementation • 24 Jan 2022 • Zhiwei Jia, Kaixiang Lin, Yizhou Zhao, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme
Recent years have witnessed an emerging paradigm shift toward embodied artificial intelligence, in which an agent must learn to solve challenging tasks by interacting with its environment.
no code implementations • 28 Oct 2021 • Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne
Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains.
1 code implementation • 10 Aug 2021 • Alessandro Suglia, Qiaozi Gao, Jesse Thomason, Govind Thattai, Gaurav Sukhatme
Language-guided robots performing home and office tasks must navigate in and interact with the world.
no code implementations • 24 Jun 2021 • K. R. Zentner, Ryan Julian, Ujjwal Puri, Yulun Zhang, Gaurav Sukhatme
We explore possible methods for multi-task transfer learning which seek to exploit the shared physical structure of robotics tasks.
1 code implementation • ICML 2020 • Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun
In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.
no code implementations • 25 Sep 2019 • Sarah Bechtle, Artem Molchanov, Yevgen Chebotar, Edward Grefenstette, Ludovic Righetti, Gaurav Sukhatme, Franziska Meier
We present a meta-learning method for learning parametric loss functions that can generalize across different tasks and model architectures.
1 code implementation • 12 Jun 2019 • Sarah Bechtle, Artem Molchanov, Yevgen Chebotar, Edward Grefenstette, Ludovic Righetti, Gaurav Sukhatme, Franziska Meier
This information shapes the learned loss function such that the environment does not need to provide this information during meta-test time.
no code implementations • 4 Jan 2019 • Shoubhik Debnath, Gaurav Sukhatme, Lantao Liu
Then, we leverage this approximate model along with a notion of reachability using Mean First Passage Times to perform Model-Based reinforcement learning.
no code implementations • 4 Jan 2019 • Shoubhik Debnath, Lantao Liu, Gaurav Sukhatme
A new mechanism for efficiently solving the Markov decision processes (MDPs) is proposed in this paper.
no code implementations • 3 Jan 2019 • Shoubhik Debnath, Lantao Liu, Gaurav Sukhatme
The solution convergence of Markov Decision Processes (MDPs) can be accelerated by prioritized sweeping of states ranked by their potential impacts to other states.
1 code implementation • 4 Oct 2018 • Zhanpeng He, Ryan Julian, Eric Heiden, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman
We complete unseen tasks by choosing new sequences of skill latents to control the robot using MPC, where our MPC model is composed of the pre-trained skill policy executed in the simulation environment, run in parallel with the real robot.
1 code implementation • 26 Sep 2018 • Ryan Julian, Eric Heiden, Zhanpeng He, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman
In particular, we first use simulation to jointly learn a policy for a set of low-level skills, and a "skill embedding" parameterization which can be used to compose them.
no code implementations • 4 Jul 2018 • Artem Molchanov, Karol Hausman, Stan Birchfield, Gaurav Sukhatme
In this work, we introduce a method based on region-growing that allows learning in an environment with any pair of initial and goal states.
no code implementations • NeurIPS 2017 • Karol Hausman, Yevgen Chebotar, Stefan Schaal, Gaurav Sukhatme, Joseph Lim
Imitation learning has traditionally been applied to learn a single task from demonstrations thereof.
no code implementations • 13 Apr 2016 • Jeannette Bohg, Karol Hausman, Bharath Sankaran, Oliver Brock, Danica Kragic, Stefan Schaal, Gaurav Sukhatme
Recent approaches in robotics follow the insight that perception is facilitated by interaction with the environment.
Robotics
no code implementations • 9 Aug 2014 • Jie Chen, Kian Hsiang Low, Colin Keng-Yan Tan, Ali Oran, Patrick Jaillet, John Dolan, Gaurav Sukhatme
The problem of modeling and predicting spatiotemporal traffic phenomena over an urban road network is important to many traffic applications such as detecting and forecasting congestion hotspots.