midNaïve Bayes
What is the mathematical formulation of Multinomial Naïve Bayes and where is it used?
Updated May 17, 2026
Short answer
Multinomial Naïve Bayes models class probabilities using word counts or frequency features under a multinomial distribution assumption.
Deep explanation
Multinomial Naïve Bayes computes P(C|X) ∝ P(C) * Π P(x_i|C), where x_i represents feature counts. It assumes features are generated from a multinomial distribution conditioned on class labels. It is widely used in text classification where each feature corresponds to word frequency in a document. The model estimates likelihood using normalized counts with smoothing.
Real-world example
Classifying news articles into categories like sports, politics, and tech based on word frequencies.
Common mistakes
- Using raw continuous values instead of count-based features.
Follow-up questions
- Why is Multinomial NB ideal for text data?
- What happens if features are negative?