YOLOP: You Only Look Once for Panoptic Driving Perception

25 Aug 2021  ·  Dong Wu, Manwen Liao, Weitian Zhang, Xinggang Wang, Xiang Bai, Wenqing Cheng, Wenyu Liu ·

A panoptic driving perception system is an essential part of autonomous driving. A high-precision and real-time perception system can assist the vehicle in making the reasonable decision while driving. We present a panoptic driving perception network (YOLOP) to perform traffic object detection, drivable area segmentation and lane detection simultaneously. It is composed of one encoder for feature extraction and three decoders to handle the specific tasks. Our model performs extremely well on the challenging BDD100K dataset, achieving state-of-the-art on all three tasks in terms of accuracy and speed. Besides, we verify the effectiveness of our multi-task learning model for joint training via ablative studies. To our best knowledge, this is the first work that can process these three visual perception tasks simultaneously in real-time on an embedded device Jetson TX2(23 FPS) and maintain excellent accuracy. To facilitate further research, the source codes and pre-trained models are released at https://github.com/hustvl/YOLOP.

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Lane Detection BDD100K val YOLOP Accuracy 70.5 # 4
Params (M) 7.9 # 3
IoU (%) 26.2 # 6
Drivable Area Detection BDD100K val YOLOP mIoU 91.5 # 3
Params (M) 7.9 # 3
Traffic Object Detection BDD100K val Faster R-CNN Recall 77.2 # 9
mAP50 55.6 # 9
Speed 5.3 # 5
Traffic Object Detection BDD100K val YOLOv5s Recall 86.8 # 7
mAP50 77.2 # 5
Speed 82 # 1
Traffic Object Detection BDD100K val YOLOP Recall 89.2 # 5
mAP50 76.5 # 6
Speed 41 # 2
Traffic Object Detection BDD100K val DLT-Net Recall 89.4 # 4
mAP50 68.4 # 7
Speed 9.3 # 3
Traffic Object Detection BDD100K val MultiNet Recall 81.3 # 8
mAP50 60.2 # 8
Speed 8.6 # 4

Methods