Label Smoothing

Label Smoothing is a regularization technique that introduces noise for the labels. This accounts for the fact that datasets may have mistakes in them, so maximizing the likelihood of $\log{p}\left(y\mid{x}\right)$ directly can be harmful. Assume for a small constant $\epsilon$, the training set label $y$ is correct with probability $1-\epsilon$ and incorrect otherwise. Label Smoothing regularizes a model based on a softmax with $k$ output values by replacing the hard $0$ and $1$ classification targets with targets of $\frac{\epsilon}{k-1}$ and $1-\epsilon$ respectively.

Source: Deep Learning, Goodfellow et al

Image Source: When Does Label Smoothing Help?


Paper Code Results Date Stars


Task Papers Share
Language Modelling 46 6.35%
Decoder 26 3.59%
Question Answering 22 3.04%
Large Language Model 21 2.90%
In-Context Learning 19 2.62%
Retrieval 15 2.07%
Decision Making 14 1.93%
Machine Translation 13 1.80%
Benchmarking 12 1.66%


Component Type
🤖 No Components Found You can add them if they exist; e.g. Mask R-CNN uses RoIAlign