How does training dataset size affect bias and variance?

Updated May 15, 2026

Short answer

More training data typically reduces variance but does not significantly change bias.

Deep explanation

Increasing dataset size helps stabilize model learning by reducing sensitivity to noise, which reduces variance. However, bias is mainly determined by model assumptions, not data size. Therefore, even with large datasets, a simple model can still have high bias.

Real-world example

A recommendation system improves stability when trained on millions of user interactions instead of thousands.

Common mistakes

  • Thinking more data always fixes underfitting.

Follow-up questions

  • When does more data stop helping?
  • Does more data increase computation cost?

More Bias & Variance interview questions

View all →