Cascade R-CNN

Introduced by Cai et al. in Cascade R-CNN: Delving into High Quality Object Detection

Cascade R-CNN is an object detection architecture that seeks to address problems with degrading performance with increased IoU thresholds (due to overfitting during training and inference-time mismatch between IoUs for which detector is optimal and the inputs). It is a multi-stage extension of the R-CNN, where detector stages deeper into the cascade are sequentially more selective against close false positives. The cascade of R-CNN stages are trained sequentially, using the output of one stage to train the next. This is motivated by the observation that the output IoU of a regressor is almost invariably better than the input IoU.

Cascade R-CNN does not aim to mine hard negatives. Instead, by adjusting bounding boxes, each stage aims to find a good set of close false positives for training the next stage. When operating in this manner, a sequence of detectors adapted to increasingly higher IoUs can beat the overfitting problem, and thus be effectively trained. At inference, the same cascade procedure is applied. The progressively improved hypotheses are better matched to the increasing detector quality at each stage.

Source: Cascade R-CNN: Delving into High Quality Object Detection

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Object Detection	21	51.22%
Image Compression	2	4.88%
General Classification	2	4.88%
Long-tailed Object Detection	1	2.44%
Pseudo Label	1	2.44%
Data Visualization	1	2.44%
Document AI	1	2.44%
Data Compression	1	2.44%
Video Instance Segmentation	1	2.44%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
RoIAlign	RoI Feature Extractors	(optional)
RPN	Region Proposal	(optional)

Categories

Add Remove

Object Detection Models