Libra R-CNN

Last updated on Feb 23, 2021

Faster R-CNN Libra R-CNN (R-101-FPN, 1x, pytorch)

Memory (M) 6500.0
inference time (s/im) 0.06944
File Size 233.24 MB
Training Data MS COCO
Training Resources 8x NVIDIA V100 GPUs
Training Time

Architecture Softmax, RPN, Balanced L1 Loss, Balanced Feature Pyramid, Convolution, FPN, RoIPool, IoU-Balanced Sampling, ResNet
lr sched 1x
Memory (M) 6500.0
Backbone Layers 101
inference time (s/im) 0.06944
SHOW MORE
SHOW LESS
Faster R-CNN Libra R-CNN (R-50-FPN, 1x, pytorch)

Memory (M) 4600.0
inference time (s/im) 0.05263
File Size 160.54 MB
Training Data MS COCO
Training Resources 8x NVIDIA V100 GPUs
Training Time

Architecture Softmax, RPN, Balanced L1 Loss, Balanced Feature Pyramid, Convolution, FPN, RoIPool, IoU-Balanced Sampling, ResNet
lr sched 1x
Memory (M) 4600.0
Backbone Layers 50
inference time (s/im) 0.05263
SHOW MORE
SHOW LESS
Faster R-CNN Libra R-CNN (X-101-64x4d-FPN, 1x, pytorch)

Memory (M) 10800.0
inference time (s/im) 0.11765
File Size 382.03 MB
Training Data MS COCO
Training Resources 8x NVIDIA V100 GPUs
Training Time

Architecture Softmax, RPN, ResNeXt, Balanced L1 Loss, Balanced Feature Pyramid, Convolution, FPN, RoIPool, IoU-Balanced Sampling
lr sched 1x
Memory (M) 10800.0
Backbone Layers 101
inference time (s/im) 0.11765
SHOW MORE
SHOW LESS
RetinaNet Libra R-CNN (R-50-FPN, 1x, pytorch)

Memory (M) 4200.0
inference time (s/im) 0.0565
File Size 146.11 MB
Training Data MS COCO
Training Resources 8x NVIDIA V100 GPUs
Training Time

Architecture Balanced L1 Loss, Balanced Feature Pyramid, FPN, IoU-Balanced Sampling, ResNet, Focal Loss
lr sched 1x
Memory (M) 4200.0
Backbone Layers 50
inference time (s/im) 0.0565
SHOW MORE
SHOW LESS
README.md

Libra R-CNN: Towards Balanced Learning for Object Detection

Introduction

[ALGORITHM]

We provide config files to reproduce the results in the CVPR 2019 paper Libra R-CNN.

@inproceedings{pang2019libra,
  title={Libra R-CNN: Towards Balanced Learning for Object Detection},
  author={Pang, Jiangmiao and Chen, Kai and Shi, Jianping and Feng, Huajun and Ouyang, Wanli and Dahua Lin},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition},
  year={2019}
}

Results and models

The results on COCO 2017val are shown in the below table. (results on test-dev are usually slightly higher than val)

Architecture Backbone Style Lr schd Mem (GB) Inf time (fps) box AP Config Download
Faster R-CNN R-50-FPN pytorch 1x 4.6 19.0 38.3 config model | log
Fast R-CNN R-50-FPN pytorch 1x
Faster R-CNN R-101-FPN pytorch 1x 6.5 14.4 40.1 config model | log
Faster R-CNN X-101-64x4d-FPN pytorch 1x 10.8 8.5 42.7 config model | log
RetinaNet R-50-FPN pytorch 1x 4.2 17.7 37.6 config model | log

Results

Object Detection on COCO minival

Object Detection on COCO minival
MODEL BOX AP
Faster R-CNN Libra R-CNN (X-101-64x4d-FPN, 1x, pytorch) 42.7
Faster R-CNN Libra R-CNN (R-101-FPN, 1x, pytorch) 40.1
Faster R-CNN Libra R-CNN (R-50-FPN, 1x, pytorch) 38.3
RetinaNet Libra R-CNN (R-50-FPN, 1x, pytorch) 37.6