What is label smoothing and why is it used in classification models?
Updated May 17, 2026
Short answer
Label smoothing softens hard labels to prevent overconfident predictions.
Deep explanation
Label smoothing replaces hard one-hot labels (0 and 1) with softened probabilities (e.g., 0.9 and 0.1). This prevents the model from becoming overly confident and improves generalization. It also reduces overfitting and helps neural networks produce better-calibrated probabilities. It is widely used in modern deep learning models like transformers.
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro