What is gradient boosting and how is it different from other ensemble methods?

Updated May 17, 2026

Short answer

Gradient boosting builds models sequentially where each new model corrects errors of the previous one using gradients of the loss function.

Deep explanation

Gradient boosting is an ensemble technique that optimizes a loss function by adding weak learners (usually decision trees) in a stage-wise fashion. Each new model is trained to predict the residual errors (negative gradients) of the current ensemble. Unlike bagging, which reduces variance via averaging, boosting reduces bias by focusing on hard-to-learn samples. Algorithms like XGBoost, LightGBM, and CatBoost are optimized implementations with regularization and efficient tree-building strategies.

Real-world example

Search ranking systems improving relevance by iteratively correcting ranking mistakes.

Common mistakes

Using too many estimators without controlling learning rate, leading to overfitting.

Follow-up questions

What is learning rate in boosting?
Why are trees commonly used in boosting?

Short answer

Deep explanation

Real-world example

Common mistakes

Follow-up questions

More Supervised Learning interview questions