Search Results for author: Nur Muhammad Mahi Shafiullah

Found 6 papers, 5 papers with code

Behavior Generation with Latent Actions

1 code implementation5 Mar 2024 Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

Unlike language or image generation, decision making requires modeling actions - continuous-valued vectors that are multimodal in their distribution, potentially drawn from uncurated sources, where generation errors can compound in sequential prediction.

Autonomous Driving Decision Making +2

OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

1 code implementation22 Jan 2024 Peiqi Liu, Yaswanth Orru, Jay Vakil, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

The results demonstrate that OK-Robot achieves a 58. 5% success rate in open-ended pick-and-drop tasks, representing a new state-of-the-art in Open Vocabulary Mobile Manipulation (OVMM) with nearly 1. 8x the performance of prior work.

object-detection Object Detection

On Bringing Robots Home

1 code implementation27 Nov 2023 Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

We use the Stick to collect 13 hours of data in 22 homes of New York City, and train Home Pretrained Representations (HPR).

From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

no code implementations18 Oct 2022 Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging.

Image Generation

CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

2 code implementations11 Oct 2022 Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization.

Segmentation Semantic Segmentation +1

Behavior Transformers: Cloning $k$ modes with one stone

2 code implementations22 Jun 2022 Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

In this work, we present Behavior Transformer (BeT), a new technique to model unlabeled demonstration data with multiple modes.

Object Detection Offline RL

Cannot find the paper you are looking for? You can Submit a new open access paper.