Statistics (function)

from Wikipedia, the free encyclopedia

A statistic is a special mathematical function in the subfield of mathematics of the same name, statistics . Statistics do not differ structurally from estimators such as point estimators , but are distinguished from them because of the fundamentally different tasks for which they are used.

definition

A statistical model and a measuring room are given . Then it is called a measurable function

a statistic. Measurability means that for all of the σ-algebra the archetypes are contained in the σ-algebra .

Comment on the definition

The measurability of a function is guaranteed, for example, if it is continuous from the one mapped in the and the corresponding Borel σ-algebras are selected as σ-algebras . Usually these σ-algebras are chosen by default.

The measurability is necessary in order to be able to define the distribution of the statistics analogous to the procedure for random variables . This means that you can also use expressions like

want to investigate. This is then defined as the image dimension via

.

The measurability guarantees here that the right side is well defined.

Demarcation

Mathematically, the terms "measurable function", "random variable", "(point) estimator" and "statistics" agree. The central point of their common definition is that it enables the construction of distributions and image dimensions.

An important point when distinguishing between estimator and statistics is the use and interpretation of the function. Statistics organize and structure existing information (such as order statistics ) or are aids for the construction of procedures (such as test statistics ).

In contrast to this, the estimation functions evaluate existing data, try to guess a value as well as possible and are subject to certain quality criteria.

The distinction is sometimes difficult. Consider a statistical model that formalizes the tossing of a possibly asymmetrical coin n times, that is

  and   .

Ber stands for the Bernoulli distribution . Then the sample mean is

  defined by  

a point estimator for the parameter . The function

  defined by  

differs from the point estimator only by the prefactor , but can be used as a statistic that reduces the observation depth. It reduces the complete description of the experiment with the order of the throws to the information how often the desired side was thrown.

literature