Skip Connections

Residual Connection

Introduced by He et al. in Deep Residual Learning for Image Recognition

Residual Connections are a type of skip-connection that learn residual functions with reference to the layer inputs, instead of learning unreferenced functions.

Formally, denoting the desired underlying mapping as $\mathcal{H}({x})$, we let the stacked nonlinear layers fit another mapping of $\mathcal{F}({x}):=\mathcal{H}({x})-{x}$. The original mapping is recast into $\mathcal{F}({x})+{x}$.

The intuition is that it is easier to optimize the residual mapping than to optimize the original, unreferenced mapping. To the extreme, if an identity mapping were optimal, it would be easier to push the residual to zero than to fit an identity mapping by a stack of nonlinear layers.

Source: Deep Residual Learning for Image Recognition

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Language Modelling 55 7.16%
Retrieval 39 5.08%
Semantic Segmentation 29 3.78%
Question Answering 29 3.78%
Large Language Model 28 3.65%
Object Detection 16 2.08%
Image Segmentation 12 1.56%
Information Retrieval 12 1.56%
Benchmarking 12 1.56%

Components


Component Type
Batch Normalization
Normalization (optional)
ReLU
Activation Functions (optional)

Categories