What is surrogate splitting in Decision Trees?
Updated May 16, 2026
Short answer
Surrogate splitting is a technique where alternative features are used to replicate the behavior of a primary split when data is missing.
Deep explanation
Surrogate splits are used in algorithms like CART to handle missing values during training and prediction. When the primary splitting feature is missing, the model uses another feature that best mimics the original split's partitioning. These surrogate features are ranked by how well they agree with the original split. This allows decision trees to maintain structure without discarding incomplete data, improving robustness in real-world datasets with missing values.
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro