What is prior probability in Naïve Bayes and how is it estimated?

Updated May 17, 2026

Short answer

Prior probability represents the probability of a class before observing any features.

Deep explanation

Prior probability P(C) is estimated as frequency of class C in training data. It encodes baseline belief about class distribution. In imbalanced datasets, priors significantly influence predictions and may require reweighting or correction.

Real-world example

In spam detection, spam emails may form 20% of dataset, so P(spam)=0.2.

Common mistakes

Ignoring class imbalance leading to biased predictions.

Follow-up questions

How does class imbalance affect Naïve Bayes?
Can priors be manually set?

More Naïve Bayes interview questions

View all →

How does Naïve Bayes behave in ultra-high-dimensional regimes with heavy-tailed feature distributions?senior
How does Naïve Bayes behave under Bayesian model averaging interpretations?senior
How does Naïve Bayes integrate into probabilistic knowledge distillation pipelines?senior
How does Naïve Bayes behave under class-conditional feature dependency violations?senior
How does Naïve Bayes relate to probabilistic decision surfaces in exponential family representations?senior
How does Naïve Bayes behave under sparse feature collision in hashed vector spaces?senior
How does Naïve Bayes relate to posterior regularization frameworks?senior
How does Naïve Bayes behave under heteroscedastic feature distributions?senior