Benchmark for Generic Product Detection: A Low Data Baseline for Dense Object Detection

19 Dec 2019  ·  Srikrishna Varadarajan, Sonaal Kant, Muktabh Mayank Srivastava ·

Object detection in densely packed scenes is a new area where standard object detectors fail to train well. Dense object detectors like RetinaNet trained on large and dense datasets show great performance. We train a standard object detector on a small, normally packed dataset with data augmentation techniques. This dataset is 265 times smaller than the standard dataset, in terms of number of annotations. This low data baseline achieves satisfactory results (mAP=0.56) at standard IoU of 0.5. We also create a varied benchmark for generic SKU product detection by providing full annotations for multiple public datasets. It can be accessed at We hope that this benchmark helps in building robust detectors that perform reliably across different settings in the wild.

PDF Abstract


Results from the Paper

Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Object Detection COCO 2017 retinanet Mean mAP 3153 # 1