PSPNet

Introduced by Zhao et al. in Pyramid Scene Parsing Network

PSPNet, or Pyramid Scene Parsing Network, is a semantic segmentation model that utilises a pyramid parsing module that exploits global context information by different-region based context aggregation. The local and global clues together make the final prediction more reliable. We also propose an optimization

Given an input image, PSPNet use a pretrained CNN with the dilated network strategy to extract the feature map. The final feature map size is $1/8$ of the input image. On top of the map, we use the pyramid pooling module to gather context information. Using our 4-level pyramid, the pooling kernels cover the whole, half of, and small portions of the image. They are fused as the global prior. Then we concatenate the prior with the original feature map in the final part of. It is followed by a convolution layer to generate the final prediction map.

Source: Pyramid Scene Parsing Network

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Semantic Segmentation	33	33.33%
Image Segmentation	8	8.08%
Scene Parsing	6	6.06%
Image Classification	5	5.05%
Scene Understanding	5	5.05%
Autonomous Driving	5	5.05%
Instance Segmentation	4	4.04%
Object Detection	3	3.03%
Few-Shot Semantic Segmentation	2	2.02%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Auxiliary Classifier	Miscellaneous Components
Dilated Convolution	Convolutions
Pyramid Pooling Module	Semantic Segmentation Modules

Categories

Add Remove

Semantic Segmentation Models