seniorRandom Forest
How does Random Forest behave in high-dimensional low-sample-size (HDLSS) regimes?
Updated May 17, 2026
Short answer
Random Forest may overfit in HDLSS settings due to sparse sampling of feature space.
Deep explanation
In HDLSS problems (p >> n), each tree may find spurious splits due to high dimensionality. Although feature subsampling reduces risk, instability remains because training data is insufficient to represent true distributions. Feature importance becomes highly unstable, and variance across trees increases.
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro