Perspective transformation for accurate detection of 3D bounding boxes of vehicles in traffic surveillance

Detection and tracking of vehicles captured by traffic surveillance cameras is a key component of intelligent traffic systems. In this paper a novel method of detecting 3D bounding boxes of vehicles is presented. Using the known geometry of the surveilled scene, we propose an algorithm to construct a perspective transformation. The transformation enables us to simplify the problem of detecting 3D bounding boxes to detecting 2D bounding boxes with one additional parameter. We can therefore utilize modified 2D object detectors based on deep convolutional networks to detect 3D bounding boxes of vehicles. Known 3D bounding boxes of vehicles can be utilized to improve results on tasks such as fine-grained vehicle classification or vehicle re-identification. We test the accuracy of our detector by comparing the accuracy of speed measurement on the BrnoCompSpeed dataset with the existing state of the art method. Our method decreases the mean error in speed measurement by 22 % (1.10 km/h to 0.86 km/h) and the median error in speed measurement by 33 % (0.97 km/h to 0.65 km/h mean), while also increasing the recall (83.3 % to 89.3 %).

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Vehicle Speed Estimation BrnoCompSpeed Transform2D Mean Speed Measurement Error (km/h) 0.83 # 2
Median Speed Measurement Error (km/h) 0.60 # 2
95-th Percentile Speed Measurement Error (km/h) 2.04 # 2
Vehicle Speed Estimation BrnoCompSpeed Transform3D Mean Speed Measurement Error (km/h) 0.86 # 3
Median Speed Measurement Error (km/h) 0.65 # 3
95-th Percentile Speed Measurement Error (km/h) 2.17 # 3

Methods


No methods listed for this paper. Add relevant methods here