Context Enhancement Module Explained

Method Name:*

Method Full Name:*

Description with Markdown (optional):

**Context Enhancement Module (CEM)** is a feature extraction module used in object detection (specifically, [ThunderNet](https://paperswithcode.com/method/thundernet)) which aims to  to enlarge the receptive field. The key idea of CEM is to aggregate multi-scale local context information and global context information to generate more discriminative features. In CEM, the feature maps from three scales are merged: $C\_{4}$, $C\_{5}$ and $C\_{glb}$. $C\_{glb}$ is the global context feature vector by applying a [global average pooling](https://paperswithcode.com/method/global-average-pooling) on $C\_{5}$. We then apply a 1 × 1 [convolution](https://paperswithcode.com/method/convolution) on each feature map to squeeze the number of channels to $\alpha \times p \times p = 245$.

Afterwards, $C\_{5}$ is upsampled by 2× and $C\_{glb}$ is broadcast so that the spatial dimensions of the three feature maps are
equal. At last, the three generated feature maps are aggregated. By leveraging both local and global context, CEM effectively enlarges the receptive field and refines the representation ability of the thin feature map. Compared with prior [FPN](https://paperswithcode.com/method/fpn) structures, CEM involves only two 1×1 convolutions and a fc layer.

Code Snippet URL (optional):

Image

Currently: methods/Screen_Shot_2020-06-30_at_11.22.39_PM_npYoIuB.png Clear
Change:

Attached collections:

FEATURE EXTRACTORS

Add:

New collection name:

Top-level area:

Parent collection (if any):

Description (optional):

Task	Papers	Share
Object Detection	4	44.44%
Mixed Reality	2	22.22%
Semantic Segmentation	2	22.22%
Miscellaneous	1	11.11%

Component	Type	Add Remove
1x1 Convolution	Convolutions
Global Average Pooling	Pooling Operations

Context Enhancement Module

Papers

Tasks

Usage Over Time

Components

Categories

Add Remove