Marginal distribution

from Wikipedia, the free encyclopedia
Representation of the standard deviation ellipse of a two-dimensional normal distribution , as well as the two marginal distributions (red and blue).

As marginal distributions or marginal distribution in the are stochastic , the probability distributions of sub-families of a given family of random variables , respectively. The distribution of the entire family is also called the joint distribution of the random variables for clarity . For example, if and are random variables (on the same probability space ), then the distributions of the individual variables and the marginal distributions of the random vector are called .

Marginal distributions can be calculated for both discrete and continuous features. As with distributions in general, one differentiates accordingly:

In addition, the marginal distribution can be formed for both absolute frequencies and relative frequencies . The individual values ​​of the marginal distribution are then called marginal frequencies (also marginal frequencies or marginal frequencies). The marginal frequencies for categorically subdivided (distinct) features can be read off the margin of a contingency table . Here they are the sums of the frequencies across the neglected feature.

Example using contingency tables

Marginal distributions of discrete features can be represented in contingency tables. At the edge of this table, the edge frequencies, which together form the edge distribution, can be read off as sums of the neglected feature.

For example, a contingency table with absolute frequencies can be seen here. The same would also be conceivable with relative frequencies.

man woman Marginal frequencies
grade 10 10 10 20th
11th grade 4th 16 20th
Marginal frequencies 14th 26th 40

The marginal frequency of being in class 10, neglecting whether one is male or female, is 20. The corresponding marginal frequency for class 11 is also 20. The marginal distribution is evenly distributed because there are the same number of students in both classes. The attribute class is distinct, that is, divided into clearly defined categories.

However, there are also characteristics that are not divided into categories, such as height. These features are continuous because there are smooth transitions between all possible manifestations of the feature. Such characteristics cannot be represented in tables. In order to still enable the representation in a contingency table, it is possible to divide the characteristic into classes (here, categories are meant) by defining so-called class boundaries. One could divide the constant characteristic of height by setting 142 cm as the class boundary and dividing people into people taller than 142 cm and not taller than 142 cm. For these groups, which are divided into classes, class frequencies can now be measured again , which are entered in a contingency table. Since a person who is in one class (> 142) cannot be in another class (≤ 142) at the same time, one speaks of a division into disjoint sets.

definition

A higher-dimensional random variable with a multivariate distribution as a probability measure is given . Then it's called the distribution

the i-th marginal distribution or the i-th marginal distribution of . Alternatively, it is also defined as

.

In the two-dimensional with would be the first marginal distribution

.

More generally, marginal distributions can also be defined for each subset . If , they are called m-dimensional marginal distributions. They are then defined by

For

.

Elementary properties

Derived terms

Edge distribution function

If the distribution function has , an edge distribution function can also be specified as a distribution function of the edge distributions. For the one-dimensional marginal distributions it is defined as

.

All components except for the i-th are therefore set to infinity. One proceeds analogously for the m-dimensional boundary distribution functions. All components in are retained, all others are set to infinity. The first marginal distribution function then results for the two-dimensional case with

.

Edge density

Likewise, probability densities can also be specified for marginal distributions , which are called marginal densities . Those are the functions for which

applies. Has a common density , the edge density can also be called

define. For m-dimensional edge densities one proceeds analogously, one then integrates over all components that are not contained in. In the two-dimensional with , one then obtains by means of integration over the respective other component as edge densities

Edge probability function

Just as edge densities, edge probability functions can also be specified. Essentially, only the integration is replaced by the summation. Has a common probability function , the i-th edge probability function is given as

.

The m-dimensional marginal distributions are also obtained by summation if the components of interest are not also summed. In two-dimensional with it then results

.

Example multinomial distribution

In two dimensions

Be an example two-dimensional multinomialverteilt so . So has the probability function

.

Here is the multinomial coefficient . If you place it , it results directly

.

The probability function can therefore be represented independently of . Accordingly, the edge density of , which arises by adding up over all, is again exactly the probability function of , but without as a variable. So it is

,

the marginal distribution of the multionomial distribution is thus a binomial distribution with the parameters and .

In several dimensions

Let and -dimensional multinomial distribution, i.e. with . The probability function is then

.

To calculate the first marginal distribution one now adds up over all . To simplify the calculation, group and . With the help of the multinomial theorem it then follows that the marginal distribution is again binomially distributed with the parameters and .

Related concepts

Often, on the one hand, marginal distributions are to be generated with a special distribution . On the other hand, the common distribution of random variables and their dependencies should be shown correctly. Such multivariate distributions are not only described by the marginal distributions and the correlation , rather the dependency often has to be described and modeled more precisely. For example, when modeling bond returns, one might expect that the spread between the two returns also remains within a plausible range. Therefore, when modeling multivariate distributions, it is often necessary or useful to model the marginal distributions and their interdependence separately. This is done by calibrating a copula .

Using the marginal distributions, the conditional distribution can be determined from a multivariate distribution . It models that there is already knowledge about the value of a random variable.

literature

Individual evidence

  1. P. Heinz Müller (Ed.): Probability calculation and mathematical statistics. Lexicon of Stochastics. Akademie-Verlag, Berlin 1980, p. 116 and p. 124.