Sparse R-CNN is a purely sparse method for object detection in images, without object positional candidates enumerating on all(dense) image grids nor object queries interacting with global(dense) image feature.
As shown in the Figure, object candidates are given with a fixed small set of learnable bounding boxes represented by 4-d coordinate. For the example of the COCO dataset, 100 boxes and 400 parameters are needed in total, rather than the predicted ones from hundreds of thousands of candidates in a Region Proposal Network (RPN). These sparse candidates are used as proposal boxes to extract the feature of Region of Interest (RoI) by RoIPool or RoIAlign.
Source: Sparse R-CNN: End-to-End Object Detection with Learnable ProposalsPaper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Object Detection | 13 | 30.23% |
Object | 10 | 23.26% |
Fire Detection | 1 | 2.33% |
Management | 1 | 2.33% |
Few-Shot Object Detection | 1 | 2.33% |
Decoder | 1 | 2.33% |
Long-tailed Object Detection | 1 | 2.33% |
Pseudo Label | 1 | 2.33% |
Classification | 1 | 2.33% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |