no code implementations • 21 Nov 2017 • Aaron Walsman, Weilin Wan, Tanner Schmidt, Dieter Fox
The last several years have seen significant progress in using depth cameras for tracking articulated objects such as human bodies, hands, and robotic manipulators.
2 code implementations • 23 Jan 2018 • Claudia Yan, Dipendra Misra, Andrew Bennnett, Aaron Walsman, Yonatan Bisk, Yoav Artzi
We present CHALET, a 3D house simulator with support for navigation and manipulation.
no code implementations • 21 Nov 2018 • Aaron Walsman, Yonatan Bisk, Saadia Gabriel, Dipendra Misra, Yoav Artzi, Yejin Choi, Dieter Fox
Building perceptual systems for robotics which perform well under tight computational budgets requires novel architectures which rethink the traditional computer vision pipeline.
no code implementations • 5 Aug 2019 • Weilin Wan, Aaron Walsman, Dieter Fox
While recent work has shown direct estimation techniques can be quite powerful, geometric tracking methods using point clouds can provide a very high level of 3D accuracy which is necessary for many robotic applications.
2 code implementations • 6 Jul 2020 • Matthew Wallingford, Aditya Kusupati, Keivan Alizadeh-Vahid, Aaron Walsman, Aniruddha Kembhavi, Ali Farhadi
To foster research towards the goal of general ML methods, we introduce a new unified evaluation framework - FLUID (Flexible Sequential Data).
1 code implementation • 28 Sep 2020 • William Agnew, Christopher Xie, Aaron Walsman, Octavian Murad, Caelen Wang, Pedro Domingos, Siddhartha Srinivasa
By using these priors over the physical properties of objects, our system improves reconstruction quality not just by standard visual metrics, but also performance of model-based control on a variety of robotics manipulation tasks in challenging, cluttered environments.
2 code implementations • 27 Jul 2022 • Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox
We pair this simulator with a new dataset of fan-made LEGO creations that have been uploaded to the internet in order to provide complex scenes containing over a thousand unique brick shapes.
no code implementations • ICCV 2023 • Klemen Kotar, Aaron Walsman, Roozbeh Mottaghi
ENTL's generic architecture enables sharing of the spatio-temporal sequence encoder for multiple challenging embodied tasks.