YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100. YOLOv7-E6 object detector (56 FPS V100, 55.9% AP) outperforms both transformer-based detector SWIN-L Cascade-Mask R-CNN (9.2 FPS A100, 53.9% AP) by 509% in speed and 2% in accuracy, and convolutional-based detector ConvNeXt-XL Cascade-Mask R-CNN (8.6 FPS A100, 55.2% AP) by 551% in speed and 0.7% AP in accuracy, as well as YOLOv7 outperforms: YOLOR, YOLOX, Scaled-YOLOv4, YOLOv5, DETR, Deformable DETR, DINO-5scale-R50, ViT-Adapter-B and many other object detectors in speed and accuracy. Moreover, we train YOLOv7 only on MS COCO dataset from scratch without using any other datasets or pre-trained weights. Source code is released in https://github.com/WongKinYiu/yolov7.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
2D Object Detection CeyMo YOLOv7 mAP 69.5 # 2
Object Detection COCO-O YOLOv7-E6E Average mAP 32.0 # 15
Effective Robustness 6.42 # 19
Object Detection COCO test-dev YOLOv7 (161 fps) box mAP 51.4 # 75
Object Detection COCO test-dev YOLOv7-D6 (44 fps) box mAP 56.6 # 37
Object Detection COCO test-dev YOLOv7-E6 (56 fps) box mAP 56 # 42
Object Detection COCO test-dev YOLOv7-W6 (84 fps) box mAP 54.9 # 46
Object Detection COCO test-dev YOLOv7-X (114 fps) box mAP 53.1 # 63
Real-Time Object Detection MS COCO YOLOv7-W6(1280) FPS (V100, b=1) 84 # 11
box AP 54.9 # 8
FPS 84 # 9
Real-Time Object Detection MS COCO YOLOv4 FPS (V100, b=1) 161 # 4
box AP 51.4 # 24
Real-Time Object Detection MS COCO YOLOv7-E6(1280) FPS (V100, b=1) 56 # 16
box AP 56 # 6
FPS 56 # 13
Real-Time Object Detection MS COCO YOLOv7-E6E(1280) FPS (V100, b=1) 36 # 19
box AP 56.8 # 3
FPS 36 # 20
Real-Time Object Detection MS COCO YOLOv7-D6(1280) FPS (V100, b=1) 44 # 18
box AP 56.6 # 4
FPS 44 # 18
Real-Time Object Detection MS COCO YOLOv7-X FPS (V100, b=1) 114 # 8
box AP 53.1 # 16
FPS 114 # 6

Methods