Probability function of the discrete uniform distribution , d. H.
The discrete uniform distribution is a special probability distribution in stochastics . A discrete random variable with a finite number of occurrences has a discrete uniform distribution if the probability is the same for each of its occurrences . It then applies to . The discrete uniform distribution is univariate and, as its name suggests, is one of the discrete probability distributions .
Typically, this probability distribution is used in random experiments , the results of which are equally frequent. If one assumes (with or without justification) that the natural events are equally likely, one speaks of a Laplace experiment. Common examples of Laplace experiments are the Laplace cube (a perfect six-sided cube where any number from one to six has a probability of falling) and the Laplace coin (a perfect coin where either side has a probability of falling). See also continuous uniform distribution , Laplace's formula .
A distinction is made between different cases for discrete equal distribution. These differ in the result sets and, accordingly, differently defined probability functions and distribution functions . In all cases, the uniform distribution is denoted by, where the carrier is.
In the most general case, the results that occur are any with and if is. So the carrier is . The probability function of the discrete uniform distribution is then
and thus it satisfies the distribution function
In particular, unnatural numbers are also permitted here.
On any whole numbers
Probability function for
The associated distribution function
One chooses two with , one selects as a carrier, the amount
and defines the probability function
and the distribution function
On natural numbers up to n
As a special case of the two definitions above (set or ) one chooses as carrier
and receives as a probability function
as well as the distribution function
Here is the rounding function .
The expected value is in the general case
In the second case one obtains
what to do in the third case
simplified. The proof follows the Gaussian sum formula .
The representation of the variance is already confusing for the general case, since no simplifications are possible:
For the second case it results
In the third case
In the second and third case, the discrete probability distribution is symmetrical about its expected value. In the general case, no statement can be made.
For the last two variants, the skewness is equal to zero, in the first case a symmetrical distribution is required in order to be able to deduce the skewness zero.
Bulge and excess
The excess is in the second case
and with that is the bulge
In the third case this is simplified to excess
and to the bulge
The entropy of the discrete uniform distribution is for all three variants
measured in bits .
In the general case, the median of the discretely uniformly distributed random variable coincides with the median of the values :
In the second case is then
and accordingly in the third case
The mode can be specified, but has little informative value. It corresponds exactly to the carrier of the distribution, i.e. , or or .
Probability generating function
If in the second case , the probability generating function is given by
In the third case this then results
Both cases can be shown elementarily by means of the geometric series .
Moment generating function
The torque-generating function results for any as
The characteristic function results for any as
The problem of estimating the parameter for a uniformly distributed random variable is also called the taxi problem . This name arises from the consideration that one stands at the train station and can watch the numbers of the taxis. Assuming that all the numbers are evenly distributed, the taxis correspond to the sampling and the parameters of the total number of taxis in the city. If a discretely evenly distributed sample is out , the maximum likelihood estimator for the parameter is given by
In particular, it is not true to expectations , since it tends to underestimate the real value and never overestimate it, but only asymptotically true to expectations . The introduction of a correction term leads to the estimator
Or you can estimate the mean distance between the values in the sample and get another estimator
This one is unbiased, just like
The taxi problem is a standard example of estimation theory to show that several different estimators for the same problem can be found without problems, of which it is not clear a priori which is better. Variants of the taxi problem were apparently important during World War II in order to draw conclusions about the number of tanks in the opposing army from the serial numbers of shot down tanks. This would correspond to the estimation of , if one assumes that the serial numbers are evenly distributed.
Relationship to other distributions
Relationship to the Bernoulli distribution
The Bernoulli distribution with is a discrete uniform distribution on .
Relationship to the beta binomial distribution
The beta binomial distribution with is a discrete uniform distribution on .
Relationship to the two-point distribution
The two-point distribution for a discrete uniform distribution .
Relationship to the Rademacher distribution
The Rademacher distribution is a discrete uniform distribution on
Relationship to the urn model
The discrete uniform distribution is the basis of all considerations that are made in the urn model , since the pulling of each of the balls from the urn should be equally likely. Depending on how the balls are colored, numbered or put back (or not), the discrete uniform distribution results in a variety of other important distributions such as: B. the binomial distribution , geometric distribution , hypergeometric distribution , negative binomial distribution and multinomial distribution .
Sum of uniformly distributed random variables
The sum of two independent, uniformly distributed random variables is trapezoidal ; if the random variables are also distributed identically, the sum is triangularly distributed .
The discrete uniform distribution can easily be generalized to real intervals or any measurable quantities with positive volume. It is then called a constant uniform distribution .
Six sided Laplace cube
The random experiment is: A die is thrown once. The possible values of the random variables are: . According to the classical concept of probability, the probability is the same for every expression. It then has the probability function
with the expected value for and :
and the variance
Marketing decision problem
An application in practice could be an operations research ( marketing ) problem . A company wants to introduce a new product on the market:
One tries to quantitatively estimate the success of the product. For the sake of simplicity, we assume 5 different quantities sold: 0, 1,000, 5,000, 10,000 and 50,000. Since it is not possible to make a reliable estimate of the probability of the individual sales figures, the same probabilities are used for the sake of simplicity.
You can now start the decision-making process, i. H. Objectify the individual purchase decision , i.e. determine the expected average sales and consider, for example using decision trees , to what extent increased advertising expenditure could increase sales figures.
The discrete uniform distribution is often named after Pierre-Simon Laplace (Laplace cube). However, it has nothing to do with the continuous Laplace distribution .
↑ Ann Largey, John E. Spencer: Estimation of the parameters in the discrete "Taxi" problem, With and Without Replacement . In: The Economic and Social Review . tape 27 , no. 2 , 1996, p. 119-136 ( tara.tcd.ie [PDF]).