Generalized hypergeometric distribution

from Wikipedia, the free encyclopedia

The multivariate hypergeometric distribution , also called generalized hypergeometric distribution , general hypergeometric distribution or polyhypergeometric distribution , is a multivariate probability distribution and is one of the discrete probability distributions . It is a multivariate generalization of the hypergeometric distribution and can be derived from the urn model .

definition

A random variable with values ​​in is called multivariate hypergeometrically distributed to the parameters with and , if they are the probability function

owns. Then you write or as with the hypergeometric distribution.

Derivation from the urn model

The multivariate hypergeometric distribution can be clearly derived from the urn model. Consider an urn with a total of balls, each of which is colored in a different color. There are balls of the same color . The probability of pulling balls of the same color when pulling them once without replacing them is multivariate hypergeometrically distributed.

properties

Expected value

If the number of balls is the same color , then is the expected value

Variance

The variance is

Covariance

The following applies for the covariance between the number of balls

if .

example

There is an urn with 5 black, 10 white and 15 red balls. The probability of drawing exactly two balls of each color when drawing six times is

,

so just under eight percent. It is . This follows, for example, for the expected value of the black balls .

Relationship to other distributions

Relationship to the hypergeometric distribution

The hypergeometric distribution is a special case of the multivariate hypergeometric distribution with and . Please note the different parameterizations here.

Relationship to the multinomial distribution

The multivariate hypergeometric distribution and the multinomial distribution are related because they arise from the same urn model, with the difference that it is covered in the multinomial model. In particular, it can be shown that if and holds such that is, and which define a probability function on , then converges point by point to the multinomial distribution with the parameters and . The multivariate hypergeometric distribution can thus be approximated by the multinomial distribution.

literature