Search Results for author: Rogerio Bonatti

Found 19 papers, 6 papers with code

EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields

1 code implementation3 Oct 2023 Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta

We present EvDNeRF, a pipeline for generating event data and training an event-based dynamic NeRF, for the purpose of faithfully reconstructing eventstreams on scenes with rigid and non-rigid deformations that may be too fast to capture with a standard camera.

Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

1 code implementation ICCV 2023 Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, Shuang Ma

We introduce DualMind, a generalist agent designed to tackle various decision-making tasks that addresses challenges posed by current methods, such as overfitting behaviors and dependence on task-specific fine-tuning.

Decision Making

ConBaT: Control Barrier Transformer for Safe Policy Learning

no code implementations7 Mar 2023 Yue Meng, Sai Vemprala, Rogerio Bonatti, Chuchu Fan, Ashish Kapoor

In this work, we propose Control Barrier Transformer (ConBaT), an approach that learns safe behaviors from demonstrations in a self-supervised fashion.

Imitation Learning Model Predictive Control

ChatGPT for Robotics: Design Principles and Model Abilities

1 code implementation20 Feb 2023 Sai Vemprala, Rogerio Bonatti, Arthur Bucker, Ashish Kapoor

This paper presents an experimental study regarding the use of OpenAI's ChatGPT for robotics applications.

Mathematical Reasoning Prompt Engineering

SMART: Self-supervised Multi-task pretrAining with contRol Transformers

no code implementations24 Jan 2023 Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor

Self-supervised pretraining has been extensively studied in language and vision domains, where a unified model can be easily adapted to various downstream tasks by pretraining representations without explicit labels.

Imitation Learning Reinforcement Learning (RL)

PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training

no code implementations22 Sep 2022 Rogerio Bonatti, Sai Vemprala, Shuang Ma, Felipe Frujeri, Shuhang Chen, Ashish Kapoor

Robotics has long been a field riddled with complex systems architectures whose modules and connections, whether traditional or learning-based, require significant human expertise and prior knowledge.

LATTE: LAnguage Trajectory TransformEr

2 code implementations4 Aug 2022 Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Sai Vemprala, Rogerio Bonatti

Natural language is one of the most intuitive ways to express human intent.

Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

no code implementations25 Mar 2022 Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Rogerio Bonatti

However, using language is seldom an easy task when humans need to express their intent towards robots, since most of the current language interfaces require rigid templates with a static set of action targets and commands.

Imitation Learning Text Generation

3D Human Reconstruction in the Wild with Collaborative Aerial Cameras

no code implementations9 Aug 2021 Cherie Ho, Andrew Jong, Harry Freeman, Rohan Rao, Rogerio Bonatti, Sebastian Scherer

Aerial vehicles are revolutionizing applications that require capturing the 3D structure of dynamic targets in the wild, such as sports, medicine, and entertainment.

3D Human Reconstruction

Batteries, camera, action! Learning a semantic control space for expressive robot cinematography

no code implementations19 Nov 2020 Rogerio Bonatti, Arthur Bucker, Sebastian Scherer, Mustafa Mukadam, Jessica Hodgins

First, we generate a database of video clips with a diverse range of shots in a photo-realistic simulator, and use hundreds of participants in a crowd-sourcing framework to obtain scores for a set of semantic descriptors for each clip.

Do You See What I See? Coordinating Multiple Aerial Cameras for Robot Cinematography

no code implementations10 Nov 2020 Arthur Bucker, Rogerio Bonatti, Sebastian Scherer

We validate our approach in multiple cluttered environments of a photo-realistic simulator, and deploy the system using two UAVs in real-world experiments.

Autonomous Aerial Cinematography In Unstructured Environments With Learned Artistic Decision-Making

no code implementations15 Oct 2019 Rogerio Bonatti, Wenshan Wang, Cherie Ho, Aayush Ahuja, Mirko Gschwindt, Efe Camci, Erdal Kayacan, Sanjiban Choudhury, Sebastian Scherer

In this work, we address the problem in its entirety and propose a complete system for real-time aerial cinematography that for the first time combines: (1) vision-based target estimation; (2) 3D signed-distance mapping for occlusion estimation; (3) efficient trajectory optimization for long time-horizon camera motion; and (4) learning-based artistic shot selection.

Decision Making Occlusion Estimation

Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations

2 code implementations16 Sep 2019 Rogerio Bonatti, Ratnesh Madaan, Vibhav Vineet, Sebastian Scherer, Ashish Kapoor

We analyze the rich latent spaces learned with our proposed representations, and show that the use of our cross-modal architecture significantly improves control policy performance as compared to end-to-end learning or purely unsupervised feature extractors.

Drone navigation Imitation Learning

Towards a Robust Aerial Cinematography Platform: Localizing and Tracking Moving Targets in Unstructured Environments

no code implementations4 Apr 2019 Rogerio Bonatti, Cherie Ho, Wenshan Wang, Sanjiban Choudhury, Sebastian Scherer

In this work, we overcome such limitations and propose a complete system for aerial cinematography that combines: (1) a vision-based algorithm for target localization; (2) a real-time incremental 3D signed-distance map algorithm for occlusion and safety computation; and (3) a real-time camera motion planner that optimizes smoothness, collisions, occlusions and artistic guidelines.

Pose Estimation

Improved Generalization of Heading Direction Estimation for Aerial Filming Using Semi-supervised Regression

no code implementations26 Mar 2019 Wenshan Wang, Aayush Ahuja, Yanfu Zhang, Rogerio Bonatti, Sebastian Scherer

We show that by leveraging unlabeled sequences, the amount of labeled data required can be significantly reduced.

regression

Integrating kinematics and environment context into deep inverse reinforcement learning for predicting off-road vehicle trajectories

1 code implementation16 Oct 2018 Yanfu Zhang, Wenshan Wang, Rogerio Bonatti, Daniel Maturana, Sebastian Scherer

The first-stage network learns feature representations of the environment using low-level LiDAR statistics and the second-stage network combines those learned features with kinematics data.

Autonomous Navigation motion prediction +1

Autonomous drone cinematographer: Using artistic principles to create smooth, safe, occlusion-free trajectories for aerial filming

no code implementations28 Aug 2018 Rogerio Bonatti, yanfu Zhang, Sanjiban Choudhury, Wenshan Wang, Sebastian Scherer

Autonomous aerial cinematography has the potential to enable automatic capture of aesthetically pleasing videos without requiring human intervention, empowering individuals with the capability of high-end film studios.

Cannot find the paper you are looking for? You can Submit a new open access paper.