What is label smoothing and why is it used in classification models?

Updated May 17, 2026

Short answer

Label smoothing softens hard labels to prevent overconfident predictions.

Deep explanation

Label smoothing replaces hard one-hot labels (0 and 1) with softened probabilities (e.g., 0.9 and 0.1). This prevents the model from becoming overly confident and improves generalization. It also reduces overfitting and helps neural networks produce better-calibrated probabilities. It is widely used in modern deep learning models like transformers.

Unlock with a Pro subscription to view this section.

View pricing

Real-world example

No real-world example available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Common mistakes

No common mistakes listed yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Follow-up questions

No follow-up questions available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

More Supervised Learning interview questions

View all →