The multinomial distribution or polynomial distribution is a probability distribution in stochastics . It is a discrete probability distribution and can be understood as a multivariate generalization of the binomial distribution . In Bayesian statistics, it has the Dirichlet distribution as conjugate a priori distribution .
Definition and model
Be and with . Then the counting density of the multinomial distribution is given by

![p_ {1}, \ dotsc, p_ {k} \ in [0,1]](https://wikimedia.org/api/rest_v1/media/math/render/svg/32b7968c2a4789ebc3d6c43e281f537ae1229b5a)


- 
 . .
Here is the multinomial coefficient .

Application and motivation
The multinomial distribution can be motivated based on an urn model with replacement. In an urn there are sorts of balls. The proportion of varieties balls in the urn is . One ball is removed from the urn , its property (type) is noted and the ball is then returned to the urn.



One is now interested in the number of spheres of each type in this sample. Since the multinomial distribution follows, the sample has the probability:




- 
 . .
Taking an urn containing varieties balls, each with a ball per variety, we get the classic dice: You throw these times, it is possible outputs and interested in how big is the probability that straight times occurs just times and so on. Furthermore, the respective descriptions describe the probabilities of the faces of the cube and thus whether it is a fair or an unfair cube.








properties
Expected value
For each the random variable is binomially distributed with the parameters and , thus has the expected value

 
 

 
Variance
The following applies to
 the variance
- 
 . .
Covariance and Correlation Coefficient
The covariance of two random variables and with is calculated as



- 
 , ,
and for the correlation coefficient (according to Pearson) it follows:
- 
 . .
Probability generating function
The multivariate probability generating function is
 
example
There are 31 pupils in a school class, 12 from village A, 11 from village B and 8 from village C. Every day a pupil is drawn to wipe the blackboard. What is the probability that no student from village A, two students from village B and 3 students from village C have to wipe the blackboard in one week? It is and , since every student should be drawn equally likely. Then

 
Relationship to other distributions
Relationship to the binomial distribution
In the special case , the binomial distribution results , more precisely is the joint distribution of and for a -distributed random variable .






Relationship to the multivariate hypergeometric distribution
The multinomial distribution and the multivariate hypergeometric distribution are related because they come from the same urn model. The only difference is that the multivariate hypergeometric distribution draws without replacement. The multivariate hypergeometric distribution can be approximated under certain circumstances by the multinomial distribution, see the article on the multivariate hypergeometric distribution.
literature
- 
Ulrich Krengel : Introduction to probability theory and statistics . 8th edition, Vieweg, 2005. ISBN 978-3-834-80063-3
- Hans-Otto Georgii: Stochastics: Introduction to Probability Theory and Statistics , 4th edition, de Gruyter, 2009. ISBN 978-3-110-21526-7
- Christian Hesse: Applied probability theory : a well-founded introduction with over 500 realistic examples and tasks, Vieweg, Braunschweig / Wiesbaden 2003, ISBN 978-3-528-03183-1 .
Discrete univariate distributions
 
Continuous univariate distributions
 
Multivariate distributions