Mask R-CNN Explained | Papers With Code

Method Name:*

Method Full Name:*

Description with Markdown (optional):

**Mask R-CNN** extends [Faster R-CNN](http://paperswithcode.com/method/faster-r-cnn) to solve instance segmentation tasks. It achieves this by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. In principle, Mask R-CNN is an intuitive extension of Faster [R-CNN](https://paperswithcode.com/method/r-cnn), but constructing the mask branch properly is critical for good results.

Most importantly, Faster R-CNN was not designed for pixel-to-pixel alignment between network inputs and outputs. This is evident in how [RoIPool](http://paperswithcode.com/method/roi-pooling), the *de facto* core operation for attending to instances, performs coarse spatial quantization for feature extraction. To fix the misalignment, Mask R-CNN utilises a simple, quantization-free layer, called [RoIAlign](http://paperswithcode.com/method/roi-align), that faithfully preserves exact spatial locations.

Secondly, Mask R-CNN *decouples* mask and class prediction: it predicts a binary mask for each class independently, without competition among classes, and relies on the network's RoI classification branch to predict the category. In contrast, an [FCN](http://paperswithcode.com/method/fcn) usually perform per-pixel multi-class categorization, which couples segmentation and classification.

Code Snippet URL (optional):

Image

Currently: methods/Screen_Shot_2020-05-23_at_7.44.34_PM.png Clear
Change:

Attached collections:

INSTANCE SEGMENTATION MODELS

OBJECT DETECTION MODELS

Add:

New collection name:

Top-level area:

Parent collection (if any):

Description (optional):

Task	Papers	Share
Instance Segmentation	162	21.51%
Object Detection	128	17.00%
General Classification	15	1.99%
Image Segmentation	13	1.73%
Panoptic Segmentation	13	1.73%
Pose Estimation	13	1.73%
Clustering	9	1.20%
Classification	8	1.06%
Autonomous Driving	8	1.06%

Component	Type	Add Remove
Convolution	Convolutions
RoIAlign	RoI Feature Extractors
RPN	Region Proposal
Softmax	Output Functions

Mask R-CNN

Papers

Tasks

Usage Over Time

Components

Categories

Add Remove