Central fluctuation interval

from Wikipedia, the free encyclopedia

The central fluctuation interval is a term from mathematical statistics . It says something about the precision of the position estimation of a parameter (for example a mean value). The fluctuation interval includes a range around the true value of the parameter in the population, which - to put it simply - contains the parameter estimated from the sample with a previously determined certainty probability.

idea

An estimation function is a random variable for an unknown true parameters of a population . Hence it has a distribution and we can give intervals with respect to the realization with the probability .

In other words, if we take a sample of the values , we can calculate an estimate and, with a given probability, specify an interval in which we expect the estimate .

The central fluctuation intervals have a disadvantage: the interval limits contain the unknown parameter (in contrast to the confidence interval ). Nevertheless, the central fluctuation interval provides valuable information, namely the size of the deviation of a parameter estimated from the sample from the true parameter.

parameter condition Central fluctuation interval
, known
, unknown
arbitrarily distributed, ( known)

( unknown)

, known
, unknown
Bernoulli distributed with parameters or.

Are there

  • the probability of security,
  • , and the quantiles of the standard normal , t and chi-square distributions with degrees of freedom,
  • the corrected sample variance and
  • the estimated proportional value from the sample.

Formal definition

The central fluctuation interval for an estimator is the interval for which applies or , i.e.

.

The central fluctuation interval can, but does not have to, be symmetrical about the unknown parameter. The values or depend

  • on the distribution type of the estimator (see , ) and
  • the variance of the estimator :
.

Special central fluctuation intervals

For the population mean

The estimator is used for the unknown mean of the population . There are two cases for the distribution :

  1. , then applies (reproductive property of normal distribution) or
  2. (arbitrarily distributed) and fulfills the requirements of the central limit value theorem , then applies .

This results in three fluctuation intervals:

1a. known , then and
1b. unknown , then and
2. It applies and
.

The values or are the - quantiles of the standard normal distribution or of the Student t-distribution with degrees of freedom.

For the population variance

If the sample variables are distributed, then there are two different possible estimators for the variance :

  1. If is known , then it arises .
  2. If is unknown , then it arises .

In the first case it is distributed and the central fluctuation interval is

and in the second case it is distributed and the central fluctuation interval is given by

.

The values are the - quantiles of the chi-square distribution with degrees of freedom.

In both cases the central fluctuation interval is not symmetrical .

For the proportional value of the population

A dichotomous random variable number of successes in draws with replacement is binomially distributed depending on the unknown probability of success . When the approximation conditions are met, the distribution is normal and so is the estimator . The central fluctuation interval is therefore given by

.

For the practical calculations you can either estimate with . Alternatively, you can replace with , and is the proportional value from the sample.

Examples

Example 1 : If we want to estimate the mean length of study in semesters of students precisely with a certainty probability , then this means that the central fluctuation interval must not deviate from the true value by more than semesters. The length of the central fluctuation interval must therefore be semester.

For the mean study duration, it is not known whether it is normally distributed, i.e. H. it follows

,

d. H. Depending on ( ), a sample size can be determined in order to achieve this accuracy:

.

With the semester, 1537 students have to be surveyed, if the semester is, then 6147 students would be needed. In this example only the position, but not the width of the central fluctuation interval, depends on the true parameter.

Example 2 : In election surveys, around 1000 eligible voters are usually asked. With what accuracy, given a probability of certainty of, can an election researcher predict the outcome of a party?

The length of the central fluctuation interval is

,

and with , results in a length of . I.e. there is a 95% probability that the proportional value from the random sample will deviate from the true proportional value at most . With a true proportional value of , the central fluctuation interval is thus ; This great inaccuracy is one of the reasons why the press / polling institutes rarely mention the accuracy of forecasts.

Central fluctuation interval and confidence interval

Derivation

The confidence intervals are derived directly from the central fluctuation intervals :

  • Subtract from

  • Subtract from

  • Multiplication of

And that gives the confidence interval.

differences

The following table sums up some of the differences between the central fluctuation interval and the confidence interval.

Central fluctuation interval Confidence interval
Limits Are the same for every sample, i.e. fixed values Change with every sample, so they are random variables
location Includes the unknown population parameter Includes the estimated parameter of the sample
interpretation Indicates the probability with which the parameter estimated from the sample is included in the interval Indicates what proportion of the estimation intervals contain the true parameter