Template-Based Automatic Search of Compact Semantic Segmentation Architectures

4 Apr 2019  ·  Vladimir Nekrasov, Chunhua Shen, Ian Reid ·

Automatic search of neural architectures for various vision and natural language tasks is becoming a prominent tool as it allows to discover high-performing structures on any dataset of interest. Nevertheless, on more difficult domains, such as dense per-pixel classification, current automatic approaches are limited in their scope - due to their strong reliance on existing image classifiers they tend to search only for a handful of additional layers with discovered architectures still containing a large number of parameters. In contrast, in this work we propose a novel solution able to find light-weight and accurate segmentation architectures starting from only few blocks of a pre-trained classification network. To this end, we progressively build up a methodology that relies on templates of sets of operations, predicts which template and how many times should be applied at each step, while also generating the connectivity structure and downsampling factors. All these decisions are being made by a recurrent neural network that is rewarded based on the score of the emitted architecture on the holdout set and trained using reinforcement learning. One discovered architecture achieves 63.2% mean IoU on CamVid and 67.8% on CityScapes having only 270K parameters. Pre-trained models and the search code are available at https://github.com/DrSleep/nas-segm-pytorch.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semantic Segmentation CamVid Template-Based NAS-arch0 (480x360 inputs) Mean IoU 63.9% # 13
Semantic Segmentation CamVid Template-Based NAS-arch1 (480x360 inputs) Mean IoU 63.2% # 15
Real-Time Semantic Segmentation Cityscapes test Template-Based-NAS-arch1 mIoU 67.8% # 32
Time (ms) 97 # 22
Frame (fps) 10 # 23
Semantic Segmentation Cityscapes test Template-Based NAS-arch0 Mean IoU (class) 67.7% # 86
Real-Time Semantic Segmentation Cityscapes test Template-Based-NAS-arch0 mIoU 67.7% # 33
Time (ms) 52 # 21
Frame (fps) 19 # 21
Semantic Segmentation Cityscapes test Template-Based NAS-arch1 Mean IoU (class) 67.8% # 85
Semantic Segmentation Cityscapes val Template-Based NAS-arch0 mIoU 68.1% # 77
Semantic Segmentation Cityscapes val Template-Based NAS-arch1 mIoU 69.5% # 74

Methods


No methods listed for this paper. Add relevant methods here