Pruning

Spectral-Normalized Identity Priors

Introduced by Lin et al. in Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior

Spectral-Normalized Identity Priors, or SNIP, is a structured pruning approach that penalizes an entire residual module in a Transformer model toward an identity mapping. It is applicable to any structured module, including a single attention head, an entire attention block, or a feed-forward subnetwork. The method identifies and discards unimportant non-linear mappings in the residual connections by applying a thresholding operator on the function norm. Furthermore, spectral normalization to stabilize the distribution of the post-activation values of the Transformer layers, further improving the pruning effectiveness of the proposed methodology.

Source: Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior

Papers


Paper Code Results Date Stars

Categories