How does concentration of measure affect neural network training stability?

Updated May 15, 2026

Short answer

It makes gradients and activations more predictable but also more sensitive to small perturbations.

Deep explanation

In high-dimensional neural networks, concentration of measure implies that activations and gradients tend to cluster around expected values. While this stabilizes forward propagation, it also means small parameter changes can have globally consistent effects, increasing sensitivity during optimization. This dual effect explains why normalization layers (BatchNorm, LayerNorm) are critical in deep learning.

Unlock with a Pro subscription to view this section.

View pricing