Exponential family

from Wikipedia, the free encyclopedia

In probability theory and statistics , an exponential family (or exponential family ) is a class of probability distributions of a very specific form. One chooses this special form in order to take advantage of certain computational advantages or for reasons of generalization. In a certain sense, exponential families are very natural distributions and a dominated distribution class , which brings many simplifications in handling with it. The concept of exponential families goes back to EJG Pitman , G. Darmois , and BO Koopman (1935–6).

One-parameter exponential family

definition

A family of probability measures on the measurement space with is called a one-parameter exponential family if there is a σ-finite measure such that all are a density function of the shape

regarding own. Usually it is

It is

a measurable function , the natural sufficient statistics or canonical statistics of the exponential family. Likewise is

a measurable function. The function

is called the normalization function or normalization constant and guarantees that the normalization required in the definition of a probability measure is given. Furthermore is

any real function of the parameter.

Examples

The binomial distributions on with are an elementary example . They have the probability function (or the density function with regard to the count)

with . Thus the binomial distribution is part of an exponential family and is characterized by

.

Another example is the exponential distributions . They are defined with and have the probability density function

So in this case

.

It should be noted that a one-parameter exponential family can be a multivariate distribution . Single-parameter here only means that the dimensionality of the "shape parameter" is one. Whether the defined probability distribution is univariate or multivariate depends on the dimensionality of the basic space , which is not subject to any requirements.

Alternative definitions

The definitions of an exponential family usually differ in the following points:

  • Not all authors write the functions and as a product in front of the exponential function, sometimes they are also used as a sum in the exponential function, sometimes with a negative sign. This is how the definitions can be found
.
These differently defined functions can usually be converted into one another without any problems. However, when specifying the functions and, it is important to note how precisely they are defined.
  • Some authors provide the density function with a characteristic function with respect to a set . The density function is then given as
.
The choice of the amount should be independent of the parameter . This definition enables certain criteria that are based on the positivity of the density function to be more general. Such criteria can be found, for example, in regular statistical models .

k -parametric exponential family

definition

A family of probability measures on the measurement space with is called a k-parametric exponential family if there is a σ-finite measure such that all the density function

regarding own. The parameter is often written. Are there

measurable functions and

Functions of the k-dimensional parameter . Here, as in the one-parameter case, the function is called the natural sufficient statistics or the canonical statistics.

example

A classic example of a 2-parametric exponential family is the normal distribution . It is as well . Each is then of the form . With the parameterizations and one obtains the normal distribution from the usual density function

.

Thus, the normal distribution is part of a two-parameter exponential family with

.

Here, too, the following applies again: a k-parametric exponential family can definitely describe a probability distribution in only one dimension. The number k only indicates the number of shape parameters, not the dimensionality of the distribution. In the above example, the normal distribution is one-dimensional, but part of a 2-parametric exponential family.

Another example of a 2-parametric exponential family is the gamma distribution .

Alternative definitions

For the k-parametric exponential family, the same variants exist in the definition as were already discussed in the one-parametric case. In addition, some authors require in the definition that the following two properties apply:

  1. The functions are linearly independent
  2. The functions are almost certainly linearly independent for everyone .

With these additional requirements, statements can be made about the covariance matrix of , for example .

The natural parameterization

In both the one-parametric and the k-parametric case, it is said that the exponential family is present in the natural parameterization if is.

properties

Sufficiency

For the exponential family, canonical statistics are always sufficient statistics . This follows directly from the Neyman criterion for sufficiency. Therefore it is also referred to as natural sufficient statistics.

Score function

For a one-parameter exponential family the score function is given by

.

With natural parameterization this is simplified to

.

Fisher information

The Fisher information can be derived from the score function . it is

.

With natural parameterization, the Fisher information thus results

.

Role in statistics

Classic appreciation: sufficiency

According to the Pitman-Koopman-Darmois theorem , among probability families whose support does not depend on the parameters there are sufficient statistics only among the exponential families , the dimensions of which remain limited as the sample size increases. A little more detailed: Let be independently and identically distributed random variables whose probability distribution family is known. Only if this family is an exponential family is there a (possibly vectorial) sufficient statistic whose number of scalar components does not increase if the sample size is increased.

Bayesian estimation: conjugate distributions

Exponential families are also important for Bayesian statistics . In Bayesian statistics, an a priori probability distribution is multiplied by a likelihood function and then normalized in order to arrive at the a posteriori probability distribution (see Bayes' theorem ). If the likelihood belongs to an exponential family, there also exists a family of conjugate a priori distributions, which is often also an exponential family. An a priori conjugate distribution for the parameter of an exponential family is defined by

where and are hyperparameters (parameters that are not estimated, but rather determined within the framework of the model).

In general, the likelihood function does not belong to any exponential family, which is why there is generally no conjugate a priori distribution. The posterior distribution must then be calculated using numerical methods.

Hypothesis tests: equally best test

The one-parametric exponential family is one of the distribution classes with a monotonic density quotient in canonical statistics if is monotonically increasing. Therefore, for the one-sided test problem with

a consistently best test at a given level . An explicit description of the test with sketched derivation from the Neyman-Pearson lemma can be found here .

literature

Individual evidence

  1. Erling Andersen: Sufficiency and Exponential Families for Discrete Sample Spaces . In: Journal of the American Statistical Association . 65, No. 331, September 1970, pp. 1248-1255. doi : 10.2307 / 2284291 .
  2. ^ E. Pitman: Sufficient statistics and intrinsic accuracy . In: Proc. Camb. phil. Soc. . 32, 1936, pp. 567-579.
  3. G. Darmois: Sur les lois de probabilites a estimation exhaustive . In: CR Acad. sci. Paris . 200, 1935, pp. 1265-1266.
  4. ^ B Koopman: On distribution admitting a sufficient statistic . In: Trans. Amer. math. Soc. . 39, 1936, pp. 399-409. doi : 10.2307 / 1989758 .