FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras

Driving requires interacting with road agents and predicting their future behaviour in order to navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras. Our model predicts future instance segmentation and motion of dynamic agents that can be transformed into non-parametric future trajectories. Our approach combines the perception, sensor fusion and prediction components of a traditional autonomous driving stack by estimating bird's-eye-view prediction directly from surround RGB monocular camera inputs. FIERY learns to model the inherent stochastic nature of the future solely from camera driving data in an end-to-end manner, without relying on HD maps, and predicts multimodal future trajectories. We show that our model outperforms previous prediction baselines on the NuScenes and Lyft datasets. The code and trained models are available at https://github.com/wayveai/fiery.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Datasets


Results from the Paper


 Ranked #1 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU veh - 224x480 - No vis filter - 100x50 at 0.25 metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Bird's-Eye View Semantic Segmentation Lyft Level 5 FIERY IoU vehicle - 224x480 - Long 36.7 # 7
IoU vehicle - 224x480 - Short 59.4 # 7
Bird's-Eye View Semantic Segmentation nuScenes FIERY (static) IoU veh - 224x480 - No vis filter - 100x100 at 0.5 35.8 # 6
IoU veh - 224x480 - Vis filter. - 100x100 at 0.5 39.8 # 5
IoU ped - 224x480 - Vis filter. - 100x100 at 0.5 17.2 # 4
Bird's-Eye View Semantic Segmentation nuScenes FIERY IoU veh - 224x480 - No vis filter - 100x50 at 0.25 41.1 # 1
IoU veh - 224x480 - No vis filter - 100x100 at 0.5 38.2 # 3
IoU vehicle - Setting 3 58.5 # 1

Methods


No methods listed for this paper. Add relevant methods here