Fast Neural Scene Flow

Neural Scene Flow Prior (NSFP) is of significant interest to the vision community due to its inherent robustness to out-of-distribution (OOD) effects and its ability to deal with dense lidar points. The approach utilizes a coordinate neural network to estimate scene flow at runtime, without any training. However, it is up to 100 times slower than current state-of-the-art learning methods. In other applications such as image, video, and radiance function reconstruction innovations in speeding up the runtime performance of coordinate networks have centered upon architectural changes. In this paper, we demonstrate that scene flow is different -- with the dominant computational bottleneck stemming from the loss function itself (i.e., Chamfer distance). Further, we rediscover the distance transform (DT) as an efficient, correspondence-free loss function that dramatically speeds up the runtime optimization. Our fast neural scene flow (FNSF) approach reports for the first time real-time performance comparable to learning methods, without any training or OOD bias on two of the largest open autonomous driving (AV) lidar datasets Waymo Open and Argoverse.

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Self-supervised Scene Flow Estimation Argoverse 2 FastNSF EPE 3-Way 0.2384 # 4
EPE Foreground Dynamic 0.3540 # 4
EPE Foreground Static 0.0406 # 4
Dynamic IoU 0.0924 # 1
Scene Flow Estimation Argoverse 2 FastNSF EPE 3-Way 0.1657 # 5
EPE Foreground Dynamic 0.3540 # 5
EPE Foreground Static 0.0406 # 5
Dynamic IoU 0.0924 # 5

Methods


No methods listed for this paper. Add relevant methods here