How does concentration of measure affect neural network training stability?
Updated May 15, 2026
Short answer
It makes gradients and activations more predictable but also more sensitive to small perturbations.
Deep explanation
In high-dimensional neural networks, concentration of measure implies that activations and gradients tend to cluster around expected values. While this stabilizes forward propagation, it also means small parameter changes can have globally consistent effects, increasing sensitivity during optimization. This dual effect explains why normalization layers (BatchNorm, LayerNorm) are critical in deep learning.
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro