What is gradient boosting and how does it differ from AdaBoost?
Updated May 16, 2026
Short answer
Gradient boosting optimizes a loss function using gradients, while AdaBoost reweights misclassified samples.
Deep explanation
AdaBoost adjusts sample weights based on classification errors, forcing subsequent models to focus on difficult cases. Gradient boosting, on the other hand, builds models that fit the negative gradient of a loss function, making it more flexible and general. Gradient boosting supports arbitrary differentiable loss functions, making it more powerful and widely used in modern systems like XGBoost and LightGBM.
Real-world example
XGBoost is widely used in ranking and fraud detection systems.
Common mistakes
- Assuming AdaBoost and Gradient Boosting are the same algorithm.
Follow-up questions
- Why is gradient boosting more flexible?
- What is residual learning?