Efficient Spatial Pyramid

Introduced by Mehta et al. in ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

An Efficient Spatial Pyramid (ESP) is an image model block based on a factorization principle that decomposes a standard convolution into two steps: (1) point-wise convolutions and (2) spatial pyramid of dilated convolutions. The point-wise convolutions help in reducing the computation, while the spatial pyramid of dilated convolutions re-samples the feature maps to learn the representations from large effective receptive field. This allows for increased efficiency compared to another image blocks like ResNeXt blocks and Inception modules.

Source: ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Speech Recognition	10	18.52%
Automatic Speech Recognition (ASR)	8	14.81%
Semantic Segmentation	5	9.26%
Language Modelling	3	5.56%
Reinforcement Learning (RL)	3	5.56%
Speech Separation	2	3.70%
Real-Time Semantic Segmentation	2	3.70%
Image Generation	1	1.85%
Text Generation	1	1.85%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Dilated Convolution	Convolutions
Hierarchical Feature Fusion	Degridding
Pointwise Convolution	Convolutions

Categories

Add Remove

Image Model Blocks