What is L1 and L2 regularization in Gradient Descent?

Updated May 16, 2026

Short answer

L1 adds absolute penalty; L2 adds squared penalty to loss function.

L1 regularization encourages sparsity, while L2 discourages large weights. Both modify Gradient Descent updates by adding penalty gradients.

Feature selection in high-dimensional ML problems.