What is Label Smoothing and how does it improve model generalization?

Updated May 16, 2026

Short answer

Label smoothing is a regularization technique that replaces hard labels with softened probability distributions to prevent overconfidence in neural networks.

Deep explanation

In standard classification, labels are one-hot encoded, meaning the correct class has probability 1 and others 0. This can lead to overconfident models that do not generalize well.

Label smoothing modifies this by distributing a small probability mass to incorrect classes.

Formula:

y' = (1 - ε) * y + ε / K

Where:

ε is smoothing factor
K is number of classes

Why it works:

Prevents extreme confidence.
Improves calibration.
Encourages softer decision boundaries.

Benefits:

Better generalization.
Improved robustness to noisy labels.
Better calibrated probabilities.…

Unlock with a Pro subscription to view this section.

View pricing

Real-world example

No real-world example available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Common mistakes

No common mistakes listed yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Follow-up questions

No follow-up questions available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Short answer

Deep explanation

Real-world example

Common mistakes

Follow-up questions

More Deep Learning interview questions