How does Random Forest behave in high-dimensional low-sample-size (HDLSS) regimes?

Updated May 17, 2026

Short answer

Random Forest may overfit in HDLSS settings due to sparse sampling of feature space.

Deep explanation

In HDLSS problems (p >> n), each tree may find spurious splits due to high dimensionality. Although feature subsampling reduces risk, instability remains because training data is insufficient to represent true distributions. Feature importance becomes highly unstable, and variance across trees increases.

Unlock with a Pro subscription to view this section.

View pricing

Real-world example

No real-world example available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Common mistakes

No common mistakes listed yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

Follow-up questions

No follow-up questions available yet.

Unlock with a Pro subscription to view this section.

Upgrade to Pro

More Random Forest interview questions

View all →