SENet pioneered channel attention. The core of SENet is a squeeze-and-excitation (SE) block which is used to collect global information, capture channel-wise relationships and improve representation ability. SE blocks are divided into two parts, a squeeze module and an excitation module. Global spatial information is collected in the squeeze module by global average pooling. The excitation module captures channel-wise relationships and outputs an attention vector by using fully-connected layers and non-linear layers (ReLU and sigmoid). Then, each channel of the input feature is scaled by multiplying the corresponding element in the attention vector. Overall, a squeeze-and-excitation block $F_\text{se}$ (with parameter $\theta$) which takes $X$ as input and outputs $Y$ can be formulated as: \begin{align} s = F_\text{se}(X, \theta) & = \sigma (W_{2} \delta (W_{1}\text{GAP}(X))) \end{align} \begin{align} Y = sX \end{align}
Source: Squeeze-and-Excitation NetworksPaper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Semantic Segmentation | 13 | 5.20% |
Super-Resolution | 12 | 4.80% |
Image Classification | 11 | 4.40% |
Image Super-Resolution | 10 | 4.00% |
Image Restoration | 7 | 2.80% |
Denoising | 7 | 2.80% |
Image Denoising | 7 | 2.80% |
Object Detection | 6 | 2.40% |
Classification | 5 | 2.00% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |