Generalized Mean Pooling

Generalized Mean Pooling (GeM) computes the generalized mean of each channel in a tensor. Formally:

$$ \textbf{e} = \left[\left(\frac{1}{|\Omega|}\sum_{u\in{\Omega}}x^{p}_{cu}\right)^{\frac{1}{p}}\right]_{c=1,\cdots,C} $$

where $p > 0$ is a parameter. Setting this exponent as $p > 1$ increases the contrast of the pooled feature map and focuses on the salient features of the image. GeM is a generalization of the average pooling commonly used in classification networks ($p = 1$) and of spatial max-pooling layer ($p = \infty$).

Source: MultiGrain

Image Source: Eva Mohedano

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Image Retrieval	3	23.08%
Retrieval	3	23.08%
Philosophy	1	7.69%
Gait Recognition	1	7.69%
Content-Based Image Retrieval	1	7.69%
Dimensionality Reduction	1	7.69%
Classification	1	7.69%
General Classification	1	7.69%
Image Classification	1	7.69%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Pooling Operations