What is surrogate splitting in Decision Trees?

Updated May 16, 2026

Short answer

Surrogate splitting is a technique where alternative features are used to replicate the behavior of a primary split when data is missing.

Deep explanation

Surrogate splits are used in algorithms like CART to handle missing values during training and prediction. When the primary splitting feature is missing, the model uses another feature that best mimics the original split's partitioning. These surrogate features are ranked by how well they agree with the original split. This allows decision trees to maintain structure without discarding incomplete data, improving robustness in real-world datasets with missing values.

Unlock with a Pro subscription to view this section.

View pricing

Real-world example

No real-world example available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Common mistakes

No common mistakes listed yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Follow-up questions

No follow-up questions available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

More Decision Trees interview questions

View all →