Generalized hypergeometric distribution
The multivariate hypergeometric distribution , also called generalized hypergeometric distribution , general hypergeometric distribution or polyhypergeometric distribution , is a multivariate probability distribution and is one of the discrete probability distributions . It is a multivariate generalization of the hypergeometric distribution and can be derived from the urn model .
definition
A random variable with values in is called multivariate hypergeometrically distributed to the parameters with and , if they are the probability function
owns. Then you write or as with the hypergeometric distribution.
Derivation from the urn model
The multivariate hypergeometric distribution can be clearly derived from the urn model. Consider an urn with a total of balls, each of which is colored in a different color. There are balls of the same color . The probability of pulling balls of the same color when pulling them once without replacing them is multivariate hypergeometrically distributed.
properties
Expected value
If the number of balls is the same color , then is the expected value
Variance
The variance is
Covariance
The following applies for the covariance between the number of balls
if .
example
There is an urn with 5 black, 10 white and 15 red balls. The probability of drawing exactly two balls of each color when drawing six times is
- ,
so just under eight percent. It is . This follows, for example, for the expected value of the black balls .
Relationship to other distributions
Relationship to the hypergeometric distribution
The hypergeometric distribution is a special case of the multivariate hypergeometric distribution with and . Please note the different parameterizations here.
Relationship to the multinomial distribution
The multivariate hypergeometric distribution and the multinomial distribution are related because they arise from the same urn model, with the difference that it is covered in the multinomial model. In particular, it can be shown that if and holds such that is, and which define a probability function on , then converges point by point to the multinomial distribution with the parameters and . The multivariate hypergeometric distribution can thus be approximated by the multinomial distribution.
literature
- Achim Klenke: Probability Theory . 3. Edition. Springer-Verlag, Berlin Heidelberg 2013, ISBN 978-3-642-36017-6 , doi : 10.1007 / 978-3-642-36018-3 .
- Hans-Otto Georgii: Stochastics . Introduction to probability theory and statistics. 4th edition. Walter de Gruyter, Berlin 2009, ISBN 978-3-11-021526-7 , doi : 10.1515 / 9783110215274 .