no code implementations • 26 Jul 2024 • Boyi Li, Ligeng Zhu, Ran Tian, Shuhan Tan, Yuxiao Chen, Yao Lu, Yin Cui, Sushant Veer, Max Ehrlich, Jonah Philion, Xinshuo Weng, Fuzhao Xue, Andrew Tao, Ming-Yu Liu, Sanja Fidler, Boris Ivanovic, Trevor Darrell, Jitendra Malik, Song Han, Marco Pavone
Finally, we establish a benchmark for video captioning and introduce a leaderboard, aiming to accelerate advancements in video understanding, captioning, and data alignment.
1 code implementation • 7 Dec 2023 • Jonah Philion, Xue Bin Peng, Sanja Fidler
A longstanding challenge for self-driving development is simulating dynamic driving scenarios seeded from recorded driving logs.
no code implementations • ICCV 2023 • Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez
We introduce a technique for novel view synthesis and use it to transform collected data to the viewpoint of target rigs, allowing us to train BEV segmentation models for diverse target rigs without any additional data collection or labeling cost.
no code implementations • ICCV 2023 • Yanwei Li, Zhiding Yu, Jonah Philion, Anima Anandkumar, Sanja Fidler, Jiaya Jia, Jose Alvarez
In this work, we present an end-to-end framework for camera-based 3D multi-object tracking, called DQTrack.
no code implementations • CVPR 2022 • Rafid Mahmood, James Lucas, David Acuna, Daiqing Li, Jonah Philion, Jose M. Alvarez, Zhiding Yu, Sanja Fidler, Marc T. Law
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance?
no code implementations • 11 Apr 2022 • Enze Xie, Zhiding Yu, Daquan Zhou, Jonah Philion, Anima Anandkumar, Sanja Fidler, Ping Luo, Jose M. Alvarez
In this paper, we propose M$^2$BEV, a unified framework that jointly performs 3D object detection and map segmentation in the Birds Eye View~(BEV) space with multi-camera image inputs.
no code implementations • CVPR 2022 • Davis Rempe, Jonah Philion, Leonidas J. Guibas, Sanja Fidler, Or Litany
Scenario generation is formulated as an optimization in the latent space of this traffic model, perturbing an initial real-world scene to produce trajectories that collide with a given planner.
no code implementations • NeurIPS 2021 • David Acuna, Jonah Philion, Sanja Fidler
Alternative solutions seek to exploit driving simulators that can generate large amounts of labeled data with a plethora of content variations.
no code implementations • ICCV 2021 • Zian Wang, Jonah Philion, Sanja Fidler, Jan Kautz
In this paper, we propose a unified, learning-based inverse rendering framework that formulates 3D spatially-varying lighting.
no code implementations • CVPR 2021 • Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler
Realistic simulators are critical for training and verifying robotics systems.
1 code implementation • ICLR 2021 • Avik Pal, Jonah Philion, Yuan-Hong Liao, Sanja Fidler
For autonomous vehicles to safely share the road with human drivers, autonomous vehicles must abide by specific "road rules" that human drivers have agreed to follow.
1 code implementation • 19 Oct 2020 • Yiluan Guo, Holger Caesar, Oscar Beijbom, Jonah Philion, Sanja Fidler
A high-performing object detection system plays a crucial role in autonomous driving (AD).
1 code implementation • ECCV 2020 • Jonah Philion, Sanja Fidler
By training on the entire camera rig, we provide evidence that our model is able to learn not only how to represent images but how to fuse predictions from all cameras into a single cohesive representation of the scene while being robust to calibration error.
Ranked #6 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU ped - 224x480 - Vis filter. - 100x100 at 0.5 metric)
Autonomous Vehicles Bird's-Eye View Semantic Segmentation +1
no code implementations • CVPR 2020 • Seung Wook Kim, Yuhao Zhou, Jonah Philion, Antonio Torralba, Sanja Fidler
Simulation is a crucial component of any robotic system.
no code implementations • CVPR 2020 • Jonah Philion, Amlan Kar, Sanja Fidler
The downside of these metrics is that, at worst, they penalize all incorrect detections equally without conditioning on the task or scene, and at best, heuristics need to be chosen to ensure that different mistakes count differently.
no code implementations • CVPR 2019 • Jonah Philion
In this paper, we use lane detection to study modeling and training techniques that yield better performance on real world test drives.