What is Label Smoothing and how does it improve model generalization?
Updated May 16, 2026
Short answer
Label smoothing is a regularization technique that replaces hard labels with softened probability distributions to prevent overconfidence in neural networks.
Deep explanation
In standard classification, labels are one-hot encoded, meaning the correct class has probability 1 and others 0. This can lead to overconfident models that do not generalize well.
Label smoothing modifies this by distributing a small probability mass to incorrect classes.
Formula:
- y' = (1 - ε) * y + ε / K
Where:
- ε is smoothing factor
- K is number of classes
Why it works:
- Prevents extreme confidence.
- Improves calibration.
- Encourages softer decision boundaries.
Benefits:
- Better generalization.
- Improved robustness to noisy labels.
- Better calibrated probabilities.…
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro