1 code implementation • 1 Oct 2024 • Aaron Walsman, Muru Zhang, Adam Fishman, Ali Farhadi, Dieter Fox
By disassembling an unseen assembly and periodically saving images of it, the agent is able to create a set of instructions so that it has the information necessary to rebuild it.
no code implementations • 29 Sep 2024 • Soofiyan Atar, Yi Li, Markus Grotz, Michael Wolf, Dieter Fox, Joshua Smith
In warehouse environments, robots require robust picking capabilities to manage a wide variety of objects.
no code implementations • 29 Jun 2024 • Markus Grotz, Mohit Shridhar, Tamim Asfour, Dieter Fox
To kickstart the benchmark, we extended several state-of-the art methods to bimanual manipulation and also present a language-conditioned behavioral cloning agent -- PerAct2, which enables the learning and execution of bimanual 6-DoF manipulation tasks.
no code implementations • 27 Jun 2024 • Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna
Large-scale endeavors like and widespread community efforts such as Open-X-Embodiment have contributed to growing the scale of robot demonstration data.
no code implementations • 26 Jun 2024 • Shengyi Qian, Kaichun Mo, Valts Blukis, David F. Fouhey, Dieter Fox, Ankit Goyal
Our results suggest that 3D-aware pretraining is a promising approach to improve sample efficiency and generalization of vision-based robotic manipulation policies.
no code implementations • 15 Jun 2024 • Wentao Yuan, Jiafei Duan, Valts Blukis, Wilbert Pumacay, Ranjay Krishna, Adithyavairavan Murali, Arsalan Mousavian, Dieter Fox
In spite of the recent adoption of vision language models (VLMs) to control robot behavior, VLMs struggle to precisely articulate robot actions using language.
1 code implementation • 12 Jun 2024 • Ankit Goyal, Valts Blukis, Jie Xu, Yijie Guo, Yu-Wei Chao, Dieter Fox
In this work, we study how to build a robotic system that can solve multiple 3D manipulation tasks given language instructions.
Ranked #3 on Robot Manipulation Generalization on GEMBench
no code implementations • 19 May 2024 • Zoey Chen, Aaron Walsman, Marius Memmel, Kaichun Mo, Alex Fang, Karthikeya Vemuri, Alan Wu, Dieter Fox, Abhishek Gupta
We present an integrated end-to-end pipeline that generates simulation scenes complete with articulated kinematic and dynamic structures from real-world images and use these for training robotic control policies.
no code implementations • 2 May 2024 • Ryan Hoque, Ajay Mandlekar, Caelan Garrett, Ken Goldberg, Dieter Fox
Imitation learning is a promising paradigm for training robot control policies, but these policies can suffer from distribution shift, where the conditions at evaluation time differ from those in the training data.
no code implementations • 18 Apr 2024 • Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Patrick Yin, Dieter Fox, Abhishek Gupta
In this work, we propose a learning system that can leverage a small amount of real-world data to autonomously refine a simulation model and then plan an accurate control strategy that can be deployed in the real world.
no code implementations • 11 Apr 2024 • Tongzhou Mu, Yijie Guo, Jie Xu, Ankit Goyal, Hao Su, Dieter Fox, Animesh Garg
Encouraged by the remarkable achievements of language and vision foundation models, developing generalist robotic agents through imitation learning, using large demonstration datasets, has become a prominent area of interest in robot learning.
1 code implementation • CVPR 2024 • Yijia Weng, Bowen Wen, Jonathan Tremblay, Valts Blukis, Dieter Fox, Leonidas Guibas, Stan Birchfield
We address the problem of building digital twins of unknown articulated objects from two RGBD scans of the object at different articulation states.
1 code implementation • 13 Feb 2024 • Wilbert Pumacay, Ishika Singh, Jiafei Duan, Ranjay Krishna, Jesse Thomason, Dieter Fox
To realize effective large-scale, real-world robotic applications, we must evaluate how well our robot policies adapt to changes in environmental conditions.
Ranked #1 on Robot Manipulation Generalization on The COLOSSEUM
no code implementations • 4 Nov 2023 • Yi Li, Muru Zhang, Markus Grotz, Kaichun Mo, Dieter Fox
Segmentation and tracking of unseen object instances in discrete frames pose a significant challenge in dynamic industrial robotic contexts, such as distribution warehouses.
no code implementations • 2 Nov 2023 • Wentao Yuan, Adithyavairavan Murali, Arsalan Mousavian, Dieter Fox
With the advent of large language models and large-scale robotic datasets, there has been tremendous progress in high-level decision-making for object manipulation.
2 code implementations • 26 Oct 2023 • Ajay Mandlekar, Soroush Nasiriany, Bowen Wen, Iretiayo Akinola, Yashraj Narang, Linxi Fan, Yuke Zhu, Dieter Fox
Imitation learning from a large set of human demonstrations has proved to be an effective paradigm for building capable robot agents.
no code implementations • 24 Oct 2023 • Ajay Mandlekar, Caelan Garrett, Danfei Xu, Dieter Fox
Finally, we collected 2. 1K demos with HITL-TAMP across 12 contact-rich, long-horizon tasks and show that the system often produces near-perfect agents.
no code implementations • 10 Oct 2023 • Yi Ru Wang, Jiafei Duan, Dieter Fox, Siddhartha Srinivasa
To address this gap, we introduce NEWTON, a repository and benchmark for evaluating the physics reasoning skills of LLMs.
no code implementations • 10 Jul 2023 • Yuzhe Qin, Wei Yang, Binghao Huang, Karl Van Wyk, Hao Su, Xiaolong Wang, Yu-Wei Chao, Dieter Fox
For real-world experiments, AnyTeleop can outperform a previous system that was designed for a specific robot hardware with a higher success rate, using the same robot.
no code implementations • 10 Jul 2023 • Anthony Simeonov, Ankit Goyal, Lucas Manuelli, Lin Yen-Chen, Alina Sarmiento, Alberto Rodriguez, Pulkit Agrawal, Dieter Fox
We propose a system for rearranging objects in a scene to achieve a desired object-scene placing relationship, such as a book inserted in an open slot of a bookshelf.
1 code implementation • 26 Jun 2023 • Ankit Goyal, Jie Xu, Yijie Guo, Valts Blukis, Yu-Wei Chao, Dieter Fox
In simulations, we find that a single RVT model works well across 18 RLBench tasks with 249 task variations, achieving 26% higher relative success than the existing state-of-the-art method (PerAct).
Ranked #6 on Robot Manipulation on RLBench
no code implementations • 23 Jun 2023 • Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna
By contrast, we introduce AR2-D2: a system for collecting demonstrations which (1) does not require people with specialized training, (2) does not require any real robots during data collection, and therefore, (3) enables manipulation of diverse objects with a real robot.
no code implementations • 22 Jun 2023 • Xiaolin Fang, Caelan Reed Garrett, Clemens Eppner, Tomás Lozano-Pérez, Leslie Pack Kaelbling, Dieter Fox
Task and Motion Planning (TAMP) approaches are effective at planning long-horizon autonomous robot manipulation.
no code implementations • 25 May 2023 • Murtaza Dalal, Ajay Mandlekar, Caelan Garrett, Ankur Handa, Ruslan Salakhutdinov, Dieter Fox
In this work, we show that the combination of large-scale datasets generated by TAMP supervisors and flexible Transformer models to fit them is a powerful paradigm for robot manipulation.
no code implementations • 18 Apr 2023 • Adithyavairavan Murali, Arsalan Mousavian, Clemens Eppner, Adam Fishman, Dieter Fox
CabiNet is a collision model that accepts object and scene point clouds, captured from a single-view depth observation, and predicts collisions for SE(3) object poses in the scene.
no code implementations • 3 Apr 2023 • Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber
At inference, given one or more views of a novel real-world object, FINV first finds a set of latent codes for the object by inverting the generative model from multiple initial seeds.
no code implementations • CVPR 2023 • Sammy Christen, Wei Yang, Claudia Pérez-D'Arpino, Otmar Hilliges, Dieter Fox, Yu-Wei Chao
We propose the first framework to learn control policies for vision-based human-to-robot handovers, a critical task for human-robot interaction.
1 code implementation • CVPR 2023 • Bowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas Muller, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield
We present a near real-time method for 6-DoF tracking of an unknown object from a monocular RGBD video sequence, while simultaneously performing neural 3D reconstruction of the object.
1 code implementation • 13 Dec 2022 • Yann Labbé, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic
Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner.
no code implementations • 3 Nov 2022 • Zhutian Yang, Caelan Reed Garrett, Tomás Lozano-Pérez, Leslie Kaelbling, Dieter Fox
The core of our algorithm is PIGINet, a novel Transformer-based learning method that takes in a task plan, the goal, and the initial state, and predicts the probability of finding motion trajectories associated with the task plan.
2 code implementations • 25 Oct 2022 • Ankur Handa, Arthur Allshire, Viktor Makoviychuk, Aleksei Petrenko, Ritvik Singh, Jingzhou Liu, Denys Makoviichuk, Karl Van Wyk, Alexander Zhurkevich, Balakumar Sundaralingam, Yashraj Narang, Jean-Francois Lafleche, Dieter Fox, Gavriel State
Our policies are trained to adapt to a wide range of conditions in simulation.
no code implementations • 21 Oct 2022 • Valts Blukis, Taeyeop Lee, Jonathan Tremblay, Bowen Wen, In So Kweon, Kuk-Jin Yoon, Dieter Fox, Stan Birchfield
At test-time, we build the representation from a single RGB input image observing the scene from only one viewpoint.
1 code implementation • 21 Oct 2022 • Adam Fishman, Adithyavairan Murali, Clemens Eppner, Bryan Peele, Byron Boots, Dieter Fox
Collision-free motion generation in unknown environments is a core building block for robot manipulation.
no code implementations • 28 Sep 2022 • Zoey Qiuyu Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
The policy learned from our dataset can generalize well on unseen object poses in both simulation and the real world
no code implementations • 22 Sep 2022 • Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg
To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information.
1 code implementation • 12 Sep 2022 • Mohit Shridhar, Lucas Manuelli, Dieter Fox
With this formulation, we train a single multi-task Transformer for 18 RLBench tasks (with 249 variations) and 7 real-world tasks (with 18 variations) from just a few demonstrations per task.
Ranked #7 on Robot Manipulation on RLBench
2 code implementations • 27 Jul 2022 • Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox
We pair this simulator with a new dataset of fan-made LEGO creations that have been uploaded to the internet in order to provide complex scenes containing over a thousand unique brick shapes.
no code implementations • 29 Jun 2022 • Yun-Chun Chen, Adithyavairavan Murali, Balakumar Sundaralingam, Wei Yang, Animesh Garg, Dieter Fox
The pipeline of current robotic pick-and-place methods typically consists of several stages: grasp pose detection, finding inverse kinematic solutions for the detected poses, planning a collision-free trajectory, and then executing the open-loop trajectory to the grasp pose with a low-level tracking controller.
no code implementations • 19 May 2022 • Yu-Wei Chao, Chris Paxton, Yu Xiang, Wei Yang, Balakumar Sundaralingam, Tao Chen, Adithyavairavan Murali, Maya Cakmak, Dieter Fox
We analyze the performance of a set of baselines and show a correlation with a real-world evaluation.
1 code implementation • 7 May 2022 • Yashraj Narang, Kier Storey, Iretiayo Akinola, Miles Macklin, Philipp Reist, Lukasz Wawrzyniak, Yunrong Guo, Adam Moravanszky, Gavriel State, Michelle Lu, Ankur Handa, Dieter Fox
We aim for Factory to open the doors to using simulation for robotic assembly, as well as many other contact-rich applications in robotics.
no code implementations • 11 Apr 2022 • Pratyusha Sharma, Balakumar Sundaralingam, Valts Blukis, Chris Paxton, Tucker Hermans, Antonio Torralba, Jacob Andreas, Dieter Fox
In this paper, we explore natural language as an expressive and flexible tool for robot correction.
no code implementations • 31 Mar 2022 • Wei Yang, Balakumar Sundaralingam, Chris Paxton, Iretiayo Akinola, Yu-Wei Chao, Maya Cakmak, Dieter Fox
However, how to responsively generate smooth motions to take an object from a human is still an open question.
no code implementations • 19 Mar 2022 • Eric Heiden, Miles Macklin, Yashraj Narang, Dieter Fox, Animesh Garg, Fabio Ramos
In this work, we present DiSECt: the first differentiable simulator for cutting soft materials.
no code implementations • CVPR 2022 • Ankit Goyal, Arsalan Mousavian, Chris Paxton, Yu-Wei Chao, Brian Okorn, Jia Deng, Dieter Fox
Accurate object rearrangement from vision is a crucial problem for a wide variety of real-world robotics applications in unstructured environments.
1 code implementation • 31 Dec 2021 • Xinke Deng, Junyi Geng, Timothy Bretl, Yu Xiang, Dieter Fox
The auto-encoder can be used in a particle filter framework to estimate and track 6D poses of objects in a category.
no code implementations • 9 Dec 2021 • Rika Antonova, Jingyun Yang, Priya Sundaresan, Dieter Fox, Fabio Ramos, Jeannette Bohg
Deformable object manipulation remains a challenging task in robotics research.
no code implementations • 9 Nov 2021 • Andreea Bobu, Chris Paxton, Wei Yang, Balakumar Sundaralingam, Yu-Wei Chao, Maya Cakmak, Dieter Fox
Second, we treat this low-dimensional concept as an automatic labeler to synthesize a large-scale high-dimensional data set with the simulator.
no code implementations • 28 Oct 2021 • Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne
Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains.
no code implementations • 19 Oct 2021 • Weiyu Liu, Chris Paxton, Tucker Hermans, Dieter Fox
Geometric organization of objects into semantically meaningful arrangements pervades the built world.
1 code implementation • 5 Oct 2021 • Michael Lutter, Boris Belousov, Shie Mannor, Dieter Fox, Animesh Garg, Jan Peters
Especially for continuous control, solving this differential equation and its extension the Hamilton-Jacobi-Isaacs equation, is important as it yields the optimal policy that achieves the maximum reward on a give task.
no code implementations • 29 Sep 2021 • Melissa Mozifian, Dieter Fox, David Meger, Fabio Ramos, Animesh Garg
In this paper, we consider the problem of continuous control for various robot manipulation tasks with an explicit representation that promotes skill reuse while learning multiple tasks, related through the reward function.
1 code implementation • 24 Sep 2021 • Mohit Shridhar, Lucas Manuelli, Dieter Fox
We even learn one multi-task policy for 10 simulated and 9 real-world tasks that is better or comparable to single-task policies.
1 code implementation • 8 Sep 2021 • Wentao Yuan, Chris Paxton, Karthik Desingh, Dieter Fox
Sequential manipulation tasks require a robot to perceive the state of an environment and plan a sequence of actions leading to a desired goal state.
1 code implementation • 26 Aug 2021 • Chris Paxton, Chris Xie, Tucker Hermans, Dieter Fox
We further demonstrate the ability of our planner to generate and execute diverse manipulation plans through a set of real-world experiments with a variety of objects.
1 code implementation • 12 Jul 2021 • Valts Blukis, Chris Paxton, Dieter Fox, Animesh Garg, Yoav Artzi
Natural language provides an accessible and expressive interface to specify long-term tasks for robotic agents.
1 code implementation • 9 Jul 2021 • Rika Antonova, Fabio Ramos, Rafael Possas, Dieter Fox
This paper outlines BayesSimIG: a library that provides an implementation of BayesSim integrated with the recently released NVIDIA IsaacGym.
no code implementations • 7 Jul 2021 • Junha Roh, Karthik Desingh, Ali Farhadi, Dieter Fox
Specifically, given a reconstructed 3D scene in the form of point clouds with 3D bounding boxes of potential object candidates, and a language utterance referring to a target object in the scene, our model successfully identifies the target object from a set of potential candidates.
1 code implementation • 29 Jun 2021 • Christopher Xie, Arsalan Mousavian, Yu Xiang, Dieter Fox
We postulate that a network architecture that encodes relations between objects at a high-level can be beneficial.
no code implementations • 2 Jun 2021 • Ahmed H. Qureshi, Arsalan Mousavian, Chris Paxton, Michael C. Yip, Dieter Fox
We propose NeRP (Neural Rearrangement Planning), a deep learning based approach for multi-step neural object rearrangement planning which works with never-before-seen objects, that is trained on simulation data, and generalizes to the real world.
1 code implementation • 25 May 2021 • Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg
The adversarial perturbations encourage a optimal policy that is robust to changes in the dynamics.
1 code implementation • 10 May 2021 • Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg
This algorithm enables dynamic programming for continuous states and actions with a known dynamics model.
2 code implementations • CVPR 2021 • Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S. Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, Jan Kautz, Dieter Fox
We introduce DexYCB, a new dataset for capturing hand grasping of objects.
1 code implementation • CVPR 2021 • Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox
Key to our approach is a local implicit neural representation built on ray-voxel pairs that allows our method to generalize to unseen objects and achieve fast inference speed.
1 code implementation • 25 Mar 2021 • Martin Sundermeyer, Arsalan Mousavian, Rudolph Triebel, Dieter Fox
Our novel grasp representation treats 3D points of the recorded point cloud as potential grasp contacts.
no code implementations • 24 Dec 2020 • M. Asif Rana, Anqi Li, Dieter Fox, Sonia Chernova, Byron Boots, Nathan Ratliff
The policy structure provides the user an interface to 1) specifying the spaces that are directly relevant to the completion of the tasks, and 2) designing policies for certain tasks that do not need to be learned.
no code implementations • 7 Dec 2020 • Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White
This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference.
1 code implementation • 21 Nov 2020 • Michael Danielczuk, Arsalan Mousavian, Clemens Eppner, Dieter Fox
The learned model outperforms both traditional pipelines and learned ablations by 9. 8% in accuracy on a dataset of simulated collision queries and is 75x faster than the best-performing baseline.
2 code implementations • 18 Nov 2020 • Clemens Eppner, Arsalan Mousavian, Dieter Fox
We introduce ACRONYM, a dataset for robot grasp planning based on physics simulation.
no code implementations • 17 Nov 2020 • Shohin Mukherjee, Chris Paxton, Arsalan Mousavian, Adam Fishman, Maxim Likhachev, Dieter Fox
Zero-shot execution of unseen robotic tasks is important to allowing robots to perform a wide variety of tasks in human environments, but collecting the amounts of data necessary to train end-to-end policies in the real-world is often infeasible.
no code implementations • 17 Nov 2020 • Wei Yang, Chris Paxton, Arsalan Mousavian, Yu-Wei Chao, Maya Cakmak, Dieter Fox
We demonstrate the generalizability, usability, and robustness of our approach on a novel benchmark set of 26 diverse household objects, a user study with naive users (N=6) handing over a subset of 15 objects, and a systematic evaluation examining different ways of handing objects.
1 code implementation • 17 Nov 2020 • Bhairav Mehta, Ankur Handa, Dieter Fox, Fabio Ramos
Simulators are a critical component of modern robotics research.
no code implementations • 15 Nov 2020 • Alexander Lambert, Adam Fishman, Dieter Fox, Byron Boots, Fabio Ramos
By casting MPC as a Bayesian inference problem, we employ variational methods for posterior computation, naturally encoding the complexity and multi-modality of the decision making problem.
1 code implementation • 8 Nov 2020 • Junha Roh, Christoforos Mavrogiannis, Rishabh Madan, Dieter Fox, Siddhartha S. Srinivasa
Our key insight is that the geometric structure of the intersection and the incentive of agents to move efficiently and avoid collisions (rationality) reduces the space of likely behaviors, effectively relaxing the problem of trajectory prediction.
1 code implementation • 2 Oct 2020 • Lirui Wang, Yu Xiang, Wei Yang, Arsalan Mousavian, Dieter Fox
We demonstrate that our learned policy can be integrated into a tabletop 6D grasping system and a human-robot handover system to improve the grasping performance of unseen objects.
2 code implementations • ECCV 2020 • Wentao Yuan, Ben Eckart, Kihwan Kim, Varun Jampani, Dieter Fox, Jan Kautz
Point cloud registration is a fundamental problem in 3D computer vision, graphics and robotics.
1 code implementation • 30 Jul 2020 • Yu Xiang, Christopher Xie, Arsalan Mousavian, Dieter Fox
In this work, we propose a new method for unseen object instance segmentation by learning RGB-D feature embeddings from synthetic data.
1 code implementation • 16 Jul 2020 • Christopher Xie, Yu Xiang, Arsalan Mousavian, Dieter Fox
We also show that our method can segment unseen objects for robot grasping.
1 code implementation • NeurIPS 2020 • Yunzhu Li, Antonio Torralba, Animashree Anandkumar, Dieter Fox, Animesh Garg
We assume access to different configurations and environmental conditions, i. e., data from unknown interventions on the underlying system; thus, we can hope to discover the correct underlying causal graph without explicit interventions.
2 code implementations • L4DC 2020 • Muhammad Asif Rana, Anqi Li, Dieter Fox, Byron Boots, Fabio Ramos, Nathan Ratliff
The complex motions are encoded as rollouts of a stable dynamical system, which, under a change of coordinates defined by a diffeomorphism, is equivalent to a simple, hand-specified dynamical system.
no code implementations • 21 May 2020 • Michelle A. Lee, Carlos Florensa, Jonathan Tremblay, Nathan Ratliff, Animesh Garg, Fabio Ramos, Dieter Fox
Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state.
1 code implementation • 18 Mar 2020 • Daniel Gordon, Kiana Ehsani, Dieter Fox, Ali Farhadi
Recent single image unsupervised representation learning techniques show remarkable success on a variety of tasks.
no code implementations • 12 Mar 2020 • Wei Yang, Chris Paxton, Maya Cakmak, Dieter Fox
In this paper, we propose an approach for human-to-robot handovers in which the robot meets the human halfway, by classifying the human's grasp of the object and quickly planning a trajectory accordingly to take the object from the human's hand according to their intent.
no code implementations • 8 Mar 2020 • Kei Kase, Chris Paxton, Hammad Mazhar, Tetsuya OGATA, Dieter Fox
On the other hand, symbolic planning methods such as STRIPS have long been able to solve new problems given only a domain definition and a symbolic goal, but these approaches often struggle on the real world robotic tasks due to the challenges of grounding these symbols from sensor data in a partially-observable world.
no code implementations • 31 Dec 2019 • Mohak Bhardwaj, Ankur Handa, Dieter Fox, Byron Boots
Model-free Reinforcement Learning (RL) works well when experience can be collected cheaply and model-based RL is effective when system dynamics can be modeled accurately.
no code implementations • 11 Dec 2019 • Clemens Eppner, Arsalan Mousavian, Dieter Fox
With the increasing speed and quality of physics simulations, generating large-scale grasping data sets that feed learning algorithms is becoming more and more popular.
1 code implementation • 8 Dec 2019 • Adithyavairavan Murali, Arsalan Mousavian, Clemens Eppner, Chris Paxton, Dieter Fox
Grasping in cluttered environments is a fundamental but challenging robotic skill.
11 code implementations • CVPR 2020 • Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk, Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox
We present ALFRED (Action Learning From Realistic Environments and Directives), a benchmark for learning a mapping from natural language instructions and egocentric vision to sequences of actions for household tasks.
1 code implementation • CVPR 2020 • Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox
We evaluate the performance of our method for unseen object pose estimation on MOPED as well as the ModelNet and LINEMOD datasets.
1 code implementation • 22 Nov 2019 • Lirui Wang, Yu Xiang, Dieter Fox
In robot manipulation, planning the motion of a robot manipulator to grasp an object is a fundamental problem.
Robotics
no code implementations • 21 Nov 2019 • Visak Kumar, Tucker Herman, Dieter Fox, Stan Birchfield, Jonathan Tremblay
We propose a Grasping Objects Approach for Tactile (GOAT) robotic hands, learning to overcome the reality gap problem.
Robotics
2 code implementations • 21 Nov 2019 • Timothy E. Lee, Jonathan Tremblay, Thang To, Jia Cheng, Terry Mosier, Oliver Kroemer, Dieter Fox, Stan Birchfield
We show experimental results for three different camera sensors, demonstrating that our approach is able to achieve accuracy with a single frame that is better than that of classic off-line hand-eye calibration using multiple frames.
Robotics
no code implementations • 13 Nov 2019 • De-An Huang, Yu-Wei Chao, Chris Paxton, Xinke Deng, Li Fei-Fei, Juan Carlos Niebles, Animesh Garg, Dieter Fox
We further show that by using the automatically inferred goal from the video demonstration, our robot is able to reproduce the same task in a real kitchen environment.
no code implementations • 13 Nov 2019 • Ajay Mandlekar, Fabio Ramos, Byron Boots, Silvio Savarese, Li Fei-Fei, Animesh Garg, Dieter Fox
For simple short-horizon manipulation tasks with modest variation in task instances, offline learning from a small set of demonstrations can produce controllers that successfully solve the task.
1 code implementation • 11 Nov 2019 • Caelan Reed Garrett, Chris Paxton, Tomás Lozano-Pérez, Leslie Pack Kaelbling, Dieter Fox
To solve multi-step manipulation tasks in the real world, an autonomous robot must take actions to observe its environment and react to unexpected observations.
no code implementations • 30 Oct 2019 • Felix Leeb, Arunkumar Byravan, Dieter Fox
In this work, we bridge the gap between recent pose estimation and tracking work to develop a powerful method for robots to track objects in their surroundings.
no code implementations • 16 Oct 2019 • Junha Roh, Chris Paxton, Andrzej Pronobis, Ali Farhadi, Dieter Fox
Widespread adoption of self-driving cars will depend not only on their safety but largely on their ability to interact with human users.
no code implementations • 7 Oct 2019 • Mustafa Mukadam, Ching-An Cheng, Dieter Fox, Byron Boots, Nathan Ratliff
RMPfusion supplements RMPflow with weight functions that can hierarchically reshape the Lyapunov functions of the subtask RMPs according to the current configuration of the robot and environment.
no code implementations • 7 Oct 2019 • Ankur Handa, Karl Van Wyk, Wei Yang, Jacky Liang, Yu-Wei Chao, Qian Wan, Stan Birchfield, Nathan Ratliff, Dieter Fox
Teleoperation offers the possibility of imparting robotic systems with sophisticated reasoning skills, intuition, and creativity to perform tasks.
3 code implementations • 23 Sep 2019 • Xinke Deng, Yu Xiang, Arsalan Mousavian, Clemens Eppner, Timothy Bretl, Dieter Fox
In this way, our system is able to continuously collect data and improve its pose estimation modules.
Robotics
no code implementations • 5 Aug 2019 • Weilin Wan, Aaron Walsman, Dieter Fox
While recent work has shown direct estimation techniques can be quite powerful, geometric tracking methods using point clouds can provide a very high level of 3D accuracy which is necessary for many robotic applications.
no code implementations • 30 Jul 2019 • Christopher Xie, Yu Xiang, Arsalan Mousavian, Dieter Fox
We show that our method, trained on this dataset, can produce sharp and accurate masks, outperforming state-of-the-art methods on unseen object instance segmentation.
1 code implementation • 4 Jun 2019 • Fabio Ramos, Rafael Carvalhaes Possas, Dieter Fox
We introduce BayesSim, a framework for robotics simulations allowing a full Bayesian treatment for the parameters of the simulator.
2 code implementations • ICCV 2019 • Arsalan Mousavian, Clemens Eppner, Dieter Fox
We evaluate our approach in simulation and real-world robot experiments.
1 code implementation • 22 May 2019 • Xinke Deng, Arsalan Mousavian, Yu Xiang, Fei Xia, Timothy Bretl, Dieter Fox
In this work, we formulate the 6D object pose tracking problem in the Rao-Blackwellized particle filtering framework, where the 3D rotation and the 3D translation of an object are decoupled.
4 code implementations • 7 Apr 2019 • Samarth Brahmbhatt, Ankur Handa, James Hays, Dieter Fox
Using a dataset of contact demonstrations from humans grasping diverse household objects, we synthesize functional grasps for three hand models and two functional intents.
no code implementations • 20 Mar 2019 • Chris Paxton, Yonatan Bisk, Jesse Thomason, Arunkumar Byravan, Dieter Fox
High-level human instructions often correspond to behaviors with multiple implicit steps.
no code implementations • 6 Jan 2019 • Daniel Gordon, Dieter Fox, Ali Farhadi
In this work we propose Hierarchical Planning and Reinforcement Learning (HIP-RL), a method for merging the benefits and capabilities of Symbolic Planning with the learning abilities of Deep Reinforcement Learning.
no code implementations • CVPR 2019 • Christopher Xie, Yu Xiang, Zaid Harchaoui, Dieter Fox
We consider the problem of providing dense segmentation masks for object discovery in videos.
no code implementations • 21 Nov 2018 • Aaron Walsman, Yonatan Bisk, Saadia Gabriel, Dipendra Misra, Yoav Artzi, Yejin Choi, Dieter Fox
Building perceptual systems for robotics which perform well under tight computational budgets requires novel architectures which rethink the traditional computer vision pipeline.
1 code implementation • 16 Nov 2018 • Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff
We develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs).
Robotics Systems and Control
no code implementations • 12 Oct 2018 • Jacky Liang, Viktor Makoviychuk, Ankur Handa, Nuttapong Chentanez, Miles Macklin, Dieter Fox
Most Deep Reinforcement Learning (Deep RL) algorithms require a prohibitively large number of training samples for learning complex tasks.
Robotics
no code implementations • 12 Oct 2018 • Yevgen Chebotar, Ankur Handa, Viktor Makoviychuk, Miles Macklin, Jan Issac, Nathan Ratliff, Dieter Fox
In doing so, we are able to change the distribution of simulations to improve the policy transfer by matching the policy behavior in simulation and the real world.
8 code implementations • 27 Sep 2018 • Jonathan Tremblay, Thang To, Balakumar Sundaralingam, Yu Xiang, Dieter Fox, Stan Birchfield
Using synthetic data generated in this manner, we introduce a one-shot deep neural network that is able to perform competitively against a state-of-the-art network trained on a combination of real and synthetic data.
Robotics
1 code implementation • 15 Jun 2018 • Connor Schenck, Dieter Fox
In this paper we introduce Smooth Particle Networks (SPNets), a framework for integrating fluid dynamics with deep networks.
Robotics
no code implementations • 18 Apr 2018 • Niko Sünderhauf, Oliver Brock, Walter Scheirer, Raia Hadsell, Dieter Fox, Jürgen Leitner, Ben Upcroft, Pieter Abbeel, Wolfram Burgard, Michael Milford, Peter Corke
In this paper we discuss a number of robotics-specific learning, reasoning, and embodiment challenges for deep learning.
Robotics
2 code implementations • ECCV 2018 • Yi Li, Gu Wang, Xiangyang Ji, Yu Xiang, Dieter Fox
Estimating the 6D pose of objects from images is an important problem in various applications such as robot manipulation and virtual reality.
Ranked #1 on 6D Pose Estimation using RGB on YCB-Video
1 code implementation • CVPR 2018 • Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, Dieter Fox, Ali Farhadi
Our experiments show that our proposed model outperforms popular single controller based methods on IQUAD V1.
no code implementations • 21 Nov 2017 • Aaron Walsman, Weilin Wan, Tanner Schmidt, Dieter Fox
The last several years have seen significant progress in using depth cameras for tracking articulated objects such as human bodies, hands, and robotic manipulators.
no code implementations • ICLR 2018 • Antoine Bosselut, Omer Levy, Ari Holtzman, Corin Ennis, Dieter Fox, Yejin Choi
Understanding procedural language requires anticipating the causal effects of actions, even when they are not explicitly stated.
12 code implementations • 1 Nov 2017 • Yu Xiang, Tanner Schmidt, Venkatraman Narayanan, Dieter Fox
We conduct extensive experiments on our YCB-Video dataset and the OccludedLINEMOD dataset to show that PoseCNN is highly robust to occlusions, can handle symmetric objects, and provide accurate pose estimation using only color images as input.
Ranked #3 on 6D Pose Estimation using RGB on YCB-Video
no code implementations • 2 Oct 2017 • Arunkumar Byravan, Felix Leeb, Franziska Meier, Dieter Fox
In this work, we present an approach to deep visuomotor control using structured deep dynamics models.
no code implementations • ICCV 2017 • Yuke Zhu, Daniel Gordon, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi
A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world.
11 code implementations • 17 May 2017 • Daniel Gordon, Ali Farhadi, Dieter Fox
Robust object tracking requires knowledge and understanding of the object being tracked: its appearance, its motion, and how it changes over time.
1 code implementation • 9 Mar 2017 • Yu Xiang, Dieter Fox
3D scene understanding is important for robots to interact with the 3D world in a meaningful way.
no code implementations • 5 Mar 2017 • Conor Schenck, Dieter Fox
That is, a robot asks the questions What in the visual data stream is liquid?
no code implementations • 5 Mar 2017 • Connor Schenck, Dieter Fox
In this paper, we show how to close the loop between liquid simulation and real-time perception.
no code implementations • ICCV 2017 • Roozbeh Mottaghi, Connor Schenck, Dieter Fox, Ali Farhadi
Doing so requires estimating the volume of the cup, approximating the amount of water in the pitcher, and predicting the behavior of water when we tilt the pitcher.
no code implementations • 9 Oct 2016 • Connor Schenck, Dieter Fox
We propose both a model-based and a model-free method utilizing deep learning for estimating the volume of liquid in a container.
no code implementations • 2 Aug 2016 • Connor Schenck, Dieter Fox
Recent advances in AI and robotics have claimed many incredible results with deep learning, yet no work to date has applied deep learning to the problem of liquid perception and reasoning.
no code implementations • 20 Jun 2016 • Connor Schenck, Dieter Fox
In this paper, we apply fully-convolutional deep neural networks to the tasks of detecting and tracking liquids.
no code implementations • 8 Jun 2016 • Arunkumar Byravan, Dieter Fox
We introduce SE3-Nets, which are deep neural networks designed to model and learn rigid body motion from raw point cloud data.
1 code implementation • CVPR 2015 • Richard A. Newcombe, Dieter Fox, Steven M. Seitz
We present the first dense SLAM system capable of reconstructing non-rigidly deforming scenes in real-time, by fusing together RGBD scans captured from commodity sensors.
no code implementations • 27 Apr 2015 • Yuyin Sun, Adish Singla, Dieter Fox, Andreas Krause
Hierarchies of concepts are useful in many applications from navigation to organization of objects.
1 code implementation • 10 Feb 2015 • Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen
Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required.
no code implementations • 2 Jul 2013 • Marc Peter Deisenroth, Peter Englert, Jan Peters, Dieter Fox
Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics.
no code implementations • CVPR 2013 • Liefeng Bo, Xiaofeng Ren, Dieter Fox
Complex real-world signals, such as images, contain discriminative structures that differ in many aspects including scale, invariance, and data channel.
no code implementations • 27 Jun 2012 • Cynthia Matuszek, Nicholas FitzGerald, Luke Zettlemoyer, Liefeng Bo, Dieter Fox
As robots become more ubiquitous and capable, it becomes ever more important to enable untrained users to easily interact with them.
no code implementations • NeurIPS 2010 • Liefeng Bo, Xiaofeng Ren, Dieter Fox
We highlight the kernel view of orientation histograms, and show that they are equivalent to a certain type of match kernels over image patches.