Search Results for author: Boris Ivanovic

Found 49 papers, 23 papers with code

DreamDrive: Generative 4D Scene Modeling from Street View Images

no code implementations31 Dec 2024 Jiageng Mao, Boyi Li, Boris Ivanovic, Yuxiao Chen, Yan Wang, Yurong You, Chaowei Xiao, Danfei Xu, Marco Pavone, Yue Wang

In this paper, we present DreamDrive, a 4D spatial-temporal scene generation approach that combines the merits of generation and reconstruction, to synthesize generalizable 4D driving scenes and dynamic driving videos with 3D consistency.

Autonomous Driving Neural Rendering +2

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models

no code implementations10 Dec 2024 Ziqi Lu, Heng Yang, Danfei Xu, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang

Emerging 3D geometric foundation models, such as DUSt3R, offer a promising approach for in-the-wild 3D vision tasks.

3D Reconstruction Pose Estimation

Large Spatial Model: End-to-end Unposed Images to Semantic 3D

1 code implementation24 Oct 2024 Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang

To tackle the scarcity of labeled 3D semantic data and enable natural language-driven scene manipulation, we incorporate a pre-trained 2D language-based segmentation model into a 3D-consistent semantic feature field.

3D Reconstruction Attribute

Distributed NeRF Learning for Collaborative Multi-Robot Perception

no code implementations30 Sep 2024 Hongrui Zhao, Boris Ivanovic, Negar Mehr

We show the effectiveness of our method through an extensive set of experiments on datasets containing challenging real-world scenes, achieving performance comparable to centralized mapping of the environment where data is sent to a central server for processing.

OmniRe: Omni Urban Scene Reconstruction

no code implementations29 Aug 2024 Ziyu Chen, Jiawei Yang, Jiahui Huang, Riccardo de Lutio, Janick Martinez Esturo, Boris Ivanovic, Or Litany, Zan Gojcic, Sanja Fidler, Marco Pavone, Li Song, Yue Wang

To that end, we propose a comprehensive 3DGS framework for driving scenes, named OmniRe, that allows for accurate, full-length reconstruction of diverse dynamic objects in a driving log.

NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

1 code implementation21 Jun 2024 Daniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta

On a large set of challenging scenarios, we observe that simple methods with moderate compute requirements such as TransFuser can match recent large-scale end-to-end driving architectures such as UniAD.

Autonomous Driving Benchmarking

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

no code implementations17 Jun 2024 Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs with limited view overlap, and is trained self-supervised with differentiable rendering to reconstruct RGB, depth, or feature images.

3D geometry 3D Semantic Occupancy Prediction +5

Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

no code implementations16 Jun 2024 Zhiwen Fan, Pu Wang, Yang Zhao, Yibo Zhao, Boris Ivanovic, Zhangyang Wang, Marco Pavone, Hao Frank Yang

Leveraging this rich dataset, we further formulate the crash event feature learning as a novel text reasoning problem and further fine-tune various large language models (LLMs) to predict detailed accident outcomes, such as crash types, severity and number of injuries, based on contextual and environmental factors.

Ensemble Learning

Language-Image Models with 3D Understanding

no code implementations6 May 2024 Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

Our experiments on outdoor benchmarks demonstrate that Cube-LLM significantly outperforms existing baselines by 21. 3 points of AP-BEV on the Talk2Car dataset for 3D grounded reasoning and 17. 7 points on the DriveLM dataset for complex reasoning about driving scenarios, respectively.

Question Answering Visual Question Answering

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

1 code implementation29 Mar 2024 Zhiwen Fan, Kairun Wen, Wenyan Cong, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

InstantSplat adopts a self-supervised framework that bridges the gap between 2D images and 3D representations using Gaussian Bundle Adjustment (GauBA) and can be optimized in an end-to-end manner.

3D Reconstruction Novel View Synthesis +1

Producing and Leveraging Online Map Uncertainty in Trajectory Prediction

1 code implementation CVPR 2024 Xunjiang Gu, Guanyu Song, Igor Gilitschenski, Marco Pavone, Boris Ivanovic

High-definition (HD) maps have played an integral role in the development of modern autonomous vehicle (AV) stacks, albeit with high associated labeling and maintenance costs.

Autonomous Driving Trajectory Forecasting

Parallelized Spatiotemporal Binding

no code implementations26 Feb 2024 Gautam Singh, Yue Wang, Jiawei Yang, Boris Ivanovic, Sungjin Ahn, Marco Pavone, Tong Che

While modern best practices advocate for scalable architectures that support long-range interactions, object-centric models are yet to fully embrace these architectures.

Decoder Object

Driving Everywhere with Large Language Model Policy Adaptation

no code implementations CVPR 2024 Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone

Adapting driving behavior to new environments, customs, and laws is a long-standing problem in autonomous driving, precluding the widespread deployment of autonomous vehicles (AVs).

Autonomous Driving Language Modeling +3

PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving

no code implementations CVPR 2024 Xinshuo Weng, Boris Ivanovic, Yan Wang, Yue Wang, Marco Pavone

Recent works have proposed end-to-end autonomous vehicle (AV) architectures comprised of differentiable modules achieving state-of-the-art driving performance.

Autonomous Driving

Reinforcement Learning with Human Feedback for Realistic Traffic Simulation

no code implementations1 Sep 2023 Yulong Cao, Boris Ivanovic, Chaowei Xiao, Marco Pavone

This works aims to address this by developing a framework that employs reinforcement learning with human preference (RLHF) to enhance the realism of existing traffic models.

Diversity reinforcement-learning +1

trajdata: A Unified Interface to Multiple Human Trajectory Datasets

3 code implementations NeurIPS 2023 Boris Ivanovic, Guanyu Song, Igor Gilitschenski, Marco Pavone

The field of trajectory forecasting has grown significantly in recent years, partially owing to the release of numerous large-scale, real-world human trajectory datasets for autonomous vehicles (AVs) and pedestrian motion tracking.

Autonomous Vehicles Motion Forecasting +1

Language Conditioned Traffic Generation

1 code implementation16 Jul 2023 Shuhan Tan, Boris Ivanovic, Xinshuo Weng, Marco Pavone, Philipp Kraehenbuehl

In this work, we turn to language as a source of supervision for dynamic traffic scene generation.

Decoder Language Modeling +3

Language-Guided Traffic Simulation via Scene-Level Diffusion

no code implementations10 Jun 2023 Ziyuan Zhong, Davis Rempe, Yuxiao Chen, Boris Ivanovic, Yulong Cao, Danfei Xu, Marco Pavone, Baishakhi Ray

Realistic and controllable traffic simulation is a core capability that is necessary to accelerate autonomous vehicle (AV) development.

Language Modeling Language Modelling +1

Partial-View Object View Synthesis via Filtered Inversion

no code implementations3 Apr 2023 Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber

At inference, given one or more views of a novel real-world object, FINV first finds a set of latent codes for the object by inverting the generative model from multiple initial seeds.

Object

DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

no code implementations13 Dec 2022 Peter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone

To enable the joint optimization of AV stacks while retaining modularity, we present DiffStack, a differentiable and modular stack for prediction, planning, and control.

Autonomous Vehicles

Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning

2 code implementations23 Sep 2022 Boris Ivanovic, James Harrison, Marco Pavone

Learning-based behavior prediction methods are increasingly being deployed in real-world autonomous systems, e. g., in fleets of self-driving vehicles, which are beginning to commercially operate in major cities across the world.

Meta-Learning regression

BITS: Bi-level Imitation for Traffic Simulation

1 code implementation26 Aug 2022 Danfei Xu, Yuxiao Chen, Boris Ivanovic, Marco Pavone

We empirically validate our method, named Bi-level Imitation for Traffic Simulation (BITS), with scenarios from two large-scale driving datasets and show that BITS achieves balanced traffic simulation performance in realism, diversity, and long-horizon stability.

Autonomous Vehicles Diversity

ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

1 code implementation CVPR 2022 Yuxiao Chen, Boris Ivanovic, Marco Pavone

In this work, we present ScePT, a policy planning-based trajectory prediction model that generates accurate, scene-consistent trajectory predictions suitable for autonomous system motion planning.

Motion Planning Trajectory Prediction

MTP: Multi-Hypothesis Tracking and Prediction for Reduced Error Propagation

1 code implementation18 Oct 2021 Xinshuo Weng, Boris Ivanovic, Marco Pavone

Recently, there has been tremendous progress in developing each individual module of the standard perception-planning robot autonomy pipeline, including detection, tracking, prediction of other agents' trajectories, and ego-agent trajectory planning.

Trajectory Planning

Propagating State Uncertainty Through Trajectory Forecasting

1 code implementation7 Oct 2021 Boris Ivanovic, Yifeng Lin, Shubham Shrivastava, Punarjay Chakravarty, Marco Pavone

As a result, perceptual uncertainties are not propagated through forecasting and predictions are frequently overconfident.

Trajectory Forecasting

Injecting Planning-Awareness into Prediction and Detection Evaluation

1 code implementation7 Oct 2021 Boris Ivanovic, Marco Pavone

Detecting other agents and forecasting their behavior is an integral part of the modern robotic autonomy stack, especially in safety-critical scenarios entailing human-robot interaction such as autonomous driving.

Autonomous Driving Decision Making +1

Sample-Efficient Safety Assurances using Conformal Prediction

no code implementations28 Sep 2021 Rachel Luo, Shengjia Zhao, Jonathan Kuck, Boris Ivanovic, Silvio Savarese, Edward Schmerling, Marco Pavone

When deploying machine learning models in high-stakes robotics applications, the ability to detect unsafe situations is crucial.

Conformal Prediction Robotic Grasping

Rethinking Trajectory Forecasting Evaluation

no code implementations21 Jul 2021 Boris Ivanovic, Marco Pavone

Forecasting the behavior of other agents is an integral part of the modern robotic autonomy stack, especially in safety-critical scenarios with human-robot interaction, such as autonomous driving.

Autonomous Driving Decision Making +1

Leveraging Neural Network Gradients within Trajectory Optimization for Proactive Human-Robot Interactions

1 code implementation2 Dec 2020 Simon Schaefer, Karen Leung, Boris Ivanovic, Marco Pavone

To achieve seamless human-robot interactions, robots need to intimately reason about complex interaction dynamics and future human behaviors within their motion planning process.

Motion Planning Navigate +1

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

1 code implementation NeurIPS 2020 Masha Itkina, Boris Ivanovic, Ransalu Senanayake, Mykel J. Kochenderfer, Marco Pavone

Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation.

Image Generation Motion Planning +1

Risk-Sensitive Sequential Action Control with Multi-Modal Human Trajectory Forecasting for Safe Crowd-Robot Interaction

no code implementations12 Sep 2020 Haruki Nishimura, Boris Ivanovic, Adrien Gaidon, Marco Pavone, Mac Schwager

This paper presents a novel online framework for safe crowd-robot interaction based on risk-sensitive stochastic optimal control, wherein the risk is modeled by the entropic risk measure.

Model Predictive Control Trajectory Forecasting

Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach

no code implementations10 Aug 2020 Boris Ivanovic, Karen Leung, Edward Schmerling, Marco Pavone

Human behavior prediction models enable robots to anticipate how humans may react to their actions, and hence are instrumental to devising safe and proactive robot planning algorithms.

Trajectory Prediction

The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs

1 code implementation ICCV 2019 Boris Ivanovic, Marco Pavone

Developing safe human-robot interaction systems is a necessary step towards the widespread integration of autonomous agents in society.

Decision Making Motion Forecasting +2

BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

1 code implementation16 Jun 2018 Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone

Our Backward Reachability Curriculum (BaRC) begins policy training from states that require a small number of actions to accomplish the task, and expands the initial state distribution backwards in a dynamically-consistent manner once the policy optimization algorithm demonstrates sufficient performance.

continuous-control Continuous Control +3

Generative Modeling of Multimodal Multi-Human Behavior

1 code implementation6 Mar 2018 Boris Ivanovic, Edward Schmerling, Karen Leung, Marco Pavone

This work presents a methodology for modeling and predicting human behavior in settings with N humans interacting in highly multimodal scenarios (i. e. where there are many possible highly-distinct futures).

Robotics Human-Computer Interaction

Cannot find the paper you are looking for? You can Submit a new open access paper.