What is Batch Normalization and why is it used?

Updated May 15, 2026

Short answer

Batch Normalization stabilizes and accelerates neural network training.

Deep explanation

It normalizes layer inputs to zero mean and unit variance, then scales and shifts them using learnable parameters. This reduces internal covariate shift and allows higher learning rates.

Real-world example

Used in ResNet and EfficientNet for stable training.

Common mistakes

Using batch norm incorrectly during inference mode.

Follow-up questions

What is internal covariate shift?
Why does batch norm improve generalization?

More Computer Vision interview questions

View all →

What is multi-head feature interaction in advanced vision transformers?senior
What is stochastic depth in deep vision architectures?senior
What is neural implicit surface reconstruction using signed distance functions?senior
What is contrastive vision-language pretraining (CLIP-style models)?senior
What is hypernetwork-based vision modeling?senior
What is adaptive computation time (ACT) in deep vision models?senior
What is neural field compositionality in 3D vision systems?senior
What is Perceiver IO and how does it handle arbitrary input/output modalities in vision systems?senior