HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation

CVPR 2021  ·  Yuval Nirkin, Lior Wolf, Tal Hassner ·

We present a novel, real-time, semantic segmentation network in which the encoder both encodes and generates the parameters (weights) of the decoder. Furthermore, to allow maximal adaptivity, the weights at each decoder block vary spatially. For this purpose, we design a new type of hypernetwork, composed of a nested U-Net for drawing higher level context features, a multi-headed weight generating module which generates the weights of each block in the decoder immediately before they are consumed, for efficient memory utilization, and a primary network that is composed of novel dynamic patch-wise convolutions. Despite the usage of less-conventional blocks, our architecture obtains real-time performance. In terms of the runtime vs. accuracy trade-off, we surpass state of the art (SotA) results on popular semantic segmentation benchmarks: PASCAL VOC 2012 (val. set) and real-time semantic segmentation on Cityscapes, and CamVid. The code is available: https://nirkin.com/hyperseg.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Real-Time Semantic Segmentation CamVid HyperSeg-S mIoU 78.4 # 8
Time (ms) 26.3 # 8
Frame (fps) 38.0 # 8
Real-Time Semantic Segmentation CamVid HyperSeg-L mIoU 79.1 # 6
Time (ms) 60.2 # 14
Frame (fps) 16.6 # 13
Real-Time Semantic Segmentation Cityscapes test HyperSeg-M mIoU 75.8% # 10
Time (ms) 27.1 # 16
Frame (fps) 36.9 # 18
Dichotomous Image Segmentation DIS-TE1 HySM max F-Measure 0.695 # 5
weighted F-measure 0.597 # 5
MAE 0.082 # 4
S-Measure 0.761 # 5
E-measure 0.803 # 5
HCE 205 # 5
Dichotomous Image Segmentation DIS-TE2 HySM max F-Measure 0.759 # 5
weighted F-measure 0.667 # 7
MAE 0.085 # 6
S-Measure 0.794 # 5
E-measure 0.832 # 9
HCE 451 # 5
Dichotomous Image Segmentation DIS-TE3 HySM max F-Measure 0.792 # 6
weighted F-measure 0.701 # 7
MAE 0.079 # 6
S-Measure 0.811 # 5
E-measure 0.857 # 8
HCE 887 # 6
Dichotomous Image Segmentation DIS-TE4 HySM max F-Measure 0.782 # 6
weighted F-measure 0.693 # 6
MAE 0.091 # 6
S-Measure 0.802 # 6
E-measure 0.842 # 9
HCE 3331 # 6
Dichotomous Image Segmentation DIS-VD HySM max F-Measure 0.734 # 6
weighted F-measure 0.640 # 8
MAE 0.096 # 8
S-Measure 0.773 # 6
E-measure 0.814 # 9
HCE 1324 # 5
Semantic Segmentation PASCAL VOC 2012 val HyperSeg-L mIoU 80.61% # 9

Methods