Geometric distribution

from Wikipedia, the free encyclopedia
Geometric distribution
Probability distribution
Geometric distribution.PNG
Probability function of the geometric distribution (variant B) for (blue), (green) and (red)
Distribution function
parameter p ∈ (0,1) - single success probability
Expected value (A) or (B)
Variance
Crookedness
Bulge

The geometric distribution is a probability distribution in stochastics that is univariate and belongs to the discrete probability distributions . It is derived from independent Bernoulli experiments and defined in two variants:

option A
the probability distribution of the number of Bernoulli attempts necessary to be successful. This distribution is defined on the set .
Variant B
the probability distribution of the number of failed attempts before the first success. This distribution is defined on the set .

The two variants are related . Which of these is called “geometric distribution” is either determined in advance or the one that is more appropriate is chosen.

The geometric distribution is used:

  • in the analysis of the waiting times until a certain event occurs.
    • in determining the service life of devices and components, d. H. waiting until the first failure
  • when determining the number of frequent events between consecutive rare events such as errors:
    • Determination of the reliability of devices ( MTBF )
    • Determination of Risk in Actuarial Science
    • Determination of the error rate in the data transmission, for example the number of successfully transmitted TCP packets between two packets with retransmission

Definition of the geometric distribution

A discrete random variable or with the parameter (probability of success), (probability of failure) is sufficient for the geometric distribution if:

option A
The following applies to the probability that one needs exactly attempts in order to achieve the first success
Variant B
The following applies to the probability of having failed attempts before the first success

In both cases the values ​​for the probabilities form a geometric sequence .

The geometric distribution thus has the following distribution functions

option A
Variant B

properties

Expected value

The expected values ​​of the two geometric distributions are

option A
Variant B
.

The expectation value can be derived in different ways:

  • .


.
Where is because is the probability function .
  • The expected value can be broken down by case distinction. The first experiment is likely to be successful, i.e. it will be realized with 1. Likely the first experiment is unsuccessful, but the expected value for the number of still following experiments is due to the memoryless again . So it applies
, so .
  • If you carry out experiments, the expected value for the number of successful experiments is . Therefore, the expected interval between two successful experiments (including one successful experiment) is .

Variance

The variances of the two geometric distributions are

.

The derivation can be made via

.

Memory loss

The geometric distribution is a memoryless distribution, i.e. i.e., it applies to

option A

Variant B

So if a geometrically distributed random variable is known to be greater than the value (variant A) or at least has the value (variant B), the probability that it exceeds this value by is just as great as that an identical random variable takes on the value at all .

The lack of memory is a defining quality; the geometric distribution is therefore the only possible memoryless discrete distribution. Its constant counterpart here is the exponential distribution .

Relation to reproductivity

The sum of independent geometrically distributed random variables with the same parameter is not geometrically distributed , but negatively binomially distributed . Thus the family of geometric probability distributions is not reproductive .

Crookedness

The skew results for both variants as follows:

.

Bulge

The curvature can also be shown closed for both variants as

.

That’s the excess

.

mode

option A

In variant A, mode is 1.

Variant B

With variant B the mode is 0.

Median

option A

In variant A, the median is

.

Here is the Gauss bracket . The median is not necessarily unique.

Variant B

Here is the median

.

It doesn't have to be unique either.

entropy

The entropy of both variants is

.

Characteristic function

The characteristic function has the form

option A
.
Variant B
.

Moment generating function

The moment generating function of the geometric distribution is

option A
Variant B
.

Probability generating function

The probability generating function of the geometric distribution is

option A
Variant B
.

Relationships with other distributions

Relationship to the negative binomial distribution

Generalization to several successes
A generalization of the geometric distribution is the negative binomial distribution , which indicates the probability that attempts are necessary for successes or (in an alternative representation) that the -th success occurs after failures have already occurred.

Conversely, the geometric distribution is a negative binomial distribution with . The geometric distribution thus applies to the convolution .

Relationship to the exponential distribution

Convergence of the geometrical distribution
For a sequence of geometrically distributed random variables with parameters hold with a positive constant . Then the sequence for large
converges to an exponentially distributed random variable with parameters .

In analogy to the discrete geometric distribution, the continuous exponential distribution determines the waiting time until the first occurrence of a rare Poisson-distributed event. The exponential distribution is therefore the continuous analogue of the discrete geometric distribution.

Relationship to the composite Poisson distribution

The geometric distribution in variant B arises as a special case of the composite Poisson distribution in combination with the logarithmic distribution . Select and as the parameter . This means that the geometric distribution is also infinitely divisible .

Relationship to the urn model

The geometric distribution can be derived from the urn model if is. Then the geometric distribution arises when pulling with replacement from an urn with balls, of which are marked. It is then the waiting time for the first success.

Random numbers

Random numbers for the geometric distribution are usually generated using the inversion method. This method is particularly suitable for geometric distribution, since the individual probabilities of simple recursion are sufficient. The inversion method can only be carried out here with rational operations ( addition , multiplication ) and without calculating and storing the distribution function beforehand, which guarantees a fast algorithm for the simulation.

Web links