Semantic Segmentation Models

ENet is a semantic segmentation architecture which utilises a compact encoder-decoder architecture. Some design choices include:

  1. Using the SegNet approach to downsampling y saving indices of elements chosen in max pooling layers, and using them to produce sparse upsampled maps in the decoder.
  2. Early downsampling to optimize the early stages of the network and reduce the cost of processing large input frames. The first two blocks of ENet heavily reduce the input size, and use only a small set of feature maps.
  3. Using PReLUs as an activation function
  4. Using dilated convolutions
  5. Using Spatial Dropout
Source: ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation


Paper Code Results Date Stars