no code implementations • 26 Jun 2024 • Shengyi Qian, Kaichun Mo, Valts Blukis, David F. Fouhey, Dieter Fox, Ankit Goyal
Our results suggest that 3D-aware pretraining is a promising approach to improve sample efficiency and generalization of vision-based robotic manipulation policies.
1 code implementation • 12 Jun 2024 • Ankit Goyal, Valts Blukis, Jie Xu, Yijie Guo, Yu-Wei Chao, Dieter Fox
In this work, we study how to build a robotic system that can solve multiple 3D manipulation tasks given language instructions.
Ranked #3 on Robot Manipulation Generalization on GEMBench
no code implementations • 11 Apr 2024 • Tongzhou Mu, Yijie Guo, Jie Xu, Ankit Goyal, Hao Su, Dieter Fox, Animesh Garg
Encouraged by the remarkable achievements of language and vision foundation models, developing generalist robotic agents through imitation learning, using large demonstration datasets, has become a prominent area of interest in robot learning.
no code implementations • 10 Jul 2023 • Anthony Simeonov, Ankit Goyal, Lucas Manuelli, Lin Yen-Chen, Alina Sarmiento, Alberto Rodriguez, Pulkit Agrawal, Dieter Fox
We propose a system for rearranging objects in a scene to achieve a desired object-scene placing relationship, such as a book inserted in an open slot of a bookshelf.
1 code implementation • 26 Jun 2023 • Ankit Goyal, Jie Xu, Yijie Guo, Valts Blukis, Yu-Wei Chao, Dieter Fox
In simulations, we find that a single RVT model works well across 18 RLBench tasks with 249 task variations, achieving 26% higher relative success than the existing state-of-the-art method (PerAct).
Ranked #6 on Robot Manipulation on RLBench
1 code implementation • CVPR 2023 • Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, Alejandro Newell, Hei Law, Ankit Goyal, Kaiyu Yang, Jia Deng
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural world.
no code implementations • 22 Sep 2022 • Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg
To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information.
1 code implementation • CVPR 2022 • Lahav Lipson, Zachary Teed, Ankit Goyal, Jia Deng
We propose a new approach to 6D object pose estimation which consists of an end-to-end differentiable architecture that makes use of geometric knowledge.
no code implementations • CVPR 2022 • Ankit Goyal, Arsalan Mousavian, Chris Paxton, Yu-Wei Chao, Brian Okorn, Jia Deng, Dieter Fox
Accurate object rearrangement from vision is a crucial problem for a wide variety of real-world robotics applications in unstructured environments.
no code implementations • 2 Dec 2021 • Ipsita Mohanty, Ankit Goyal, Alex Dotterweich
Contrastive learning techniques have been widely used in the field of computer vision as a means of augmenting datasets.
4 code implementations • 14 Oct 2021 • Ankit Goyal, Alexey Bochkovskiy, Jia Deng, Vladlen Koltun
This begs the question -- is it possible to build high-performing "non-deep" neural networks?
3 code implementations • 9 Jun 2021 • Ankit Goyal, Hei Law, Bowei Liu, Alejandro Newell, Jia Deng
It also outperforms state-of-the-art methods on ScanObjectNN, a real-world point cloud benchmark, and demonstrates better cross-dataset generalization.
Ranked #18 on Point Cloud Classification on PointCloud-C
2 code implementations • 1 Jan 2021 • Ankit Goyal, Hei Law, Bowei Liu, Alejandro Newell, Jia Deng
It also outperforms state-of-the-art methods on ScanObjectNN, a real-world point cloud benchmark, and demonstrates better cross-dataset generalization.
Ranked #12 on 3D Point Cloud Classification on ModelNet40-C
2 code implementations • NeurIPS 2020 • Ankit Goyal, Kaiyu Yang, Dawei Yang, Jia Deng
The 3D scenes in our dataset come in minimally contrastive pairs: two scenes in a pair are almost identical, but a spatial relation holds in one and fails in the other.
Ranked #1 on Spatial Relation Recognition on Rel3D
1 code implementation • ICML 2020 • Ankit Goyal, Jia Deng
The ability to jointly understand the geometry of objects and plan actions for manipulating them is crucial for intelligent agents.
Ranked #1 on Robot Task Planning on PackIt
no code implementations • 15 Jun 2019 • Rajhans Samdani, Pierre Rappolt, Ankit Goyal, Pratyus Patnaik
We present a system, Spoke, for creating and searching internal knowledge base (KB) articles for organizations.
1 code implementation • ACL 2018 • Ankit Goyal, Jian Wang, Jia Deng
In this paper, we study the problem of geometric reasoning in the context of question-answering.
1 code implementation • ICML 2017 • Chirag Gupta, Arun Sai Suggala, Ankit Goyal, Harsha Vardhan Simhadri, Bhargavi Paranjape, Ashish Kumar, Saurabh Goyal, Raghavendra Udupa, Manik Varma, Prateek Jain
Such applications demand prediction models with small storage and computational complexity that do not compromise significantly on accuracy.