What is Label Smoothing and how does it improve model generalization?

Updated May 16, 2026

Short answer

Label smoothing is a regularization technique that replaces hard labels with softened probability distributions to prevent overconfidence in neural networks.

Deep explanation

In standard classification, labels are one-hot encoded, meaning the correct class has probability 1 and others 0. This can lead to overconfident models that do not generalize well.

Label smoothing modifies this by distributing a small probability mass to incorrect classes.

Formula:

  • y' = (1 - ε) * y + ε / K

Where:

  • ε is smoothing factor
  • K is number of classes

Why it works:

  • Prevents extreme confidence.
  • Improves calibration.
  • Encourages softer decision boundaries.

Benefits:

  • Better generalization.
  • Improved robustness to noisy labels.
  • Better calibrated probabilities.…

Unlock with a Pro subscription to view this section.

View pricing

Real-world example

No real-world example available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Common mistakes

No common mistakes listed yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Follow-up questions

No follow-up questions available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

More Deep Learning interview questions

View all →