The F distribution or Fisher distribution, also Fisher-Snedecor distribution (after Ronald Aylmer Fisher and George W. Snedecor ), is a continuous probability distribution . An F-distributed random variable is obtained as the quotient of two respectively by the corresponding number of degrees of freedom divided chi-square distributed random variables. The F-distribution has two independent degrees of freedom as parameters and thus forms a two-parameter distribution family .
The F-distribution is often used in a test ( F-test ) to determine whether the difference between two sample variances is due to statistical variability or whether it indicates different populations . In the analysis of variance , an F-statistic is also used to test for significant differences between populations (groups).
definition
A continuous random variable satisfies the F-distribution , with degrees of freedom in the numerator and degrees of freedom in the denominator, if it has the probability density
owns. It is with the gamma function at the place designated.
Historically, the following definition forms the origin of the F-distribution as the distribution of size
where and are independent, chi-squared distributed random variables with degrees of freedom, respectively .
properties
Expected value
The expected value only exists for and then has the value
-
.
Variance
The variance is only defined for and then reads
-
.
Distribution function
The values of the distribution are usually determined numerically and given in a table . A complete tabulation with regard to all degrees of freedom is i. A. not necessary, so that most distribution tables give the quantiles in terms of selected degrees of freedom and probabilities. The relationship is also used here:
where the -quantile of the F-distribution with and means degrees of freedom.
The F-distribution can be expressed in closed form as
where is the regularized incomplete beta function.
maximum
For takes at the point
the maximum.
entropy
The entropy of the F distribution (expressed in nats ) is
where denotes the digamma function .
Relationships with other distributions
In the following, the symbol means “is distributed like”.
Relationship to Beta Distribution
The random variable
is beta distributed with parameters and the following applies:
where and are independent chi-square distributed random variables with and degrees of freedom, respectively .
Relationship to the chi-square distribution
From the independent and chi-square-distributed random variables with or degrees of freedom,
to construct. This random variable is -distributed.
Relationship to the non-central F-distribution
For independent random variables and is
distributed according to the non-central F-distribution with non-centrality parameter . There is a non-central chi-square distribution with non-centrality parameters and degrees of freedom. The central F-distribution results for .
Density of the non-central F distribution
The function is a special hypergeometric function , also called Kummer's function, and represents the density of the central F-distribution given above .
The expectation and variance of the non-central F-distribution are given by
-
With
and
-
With
Both result in the formulas of the central F-distribution.
Relationship to normal distribution
If the independent normally distributed random variables are the parameters
the respective sample variances and are independent, and the following applies :
Therefore the random variable is subject to
an F-distribution with degrees of freedom in the numerator and degrees of freedom in the denominator.
Relationship to Student's t-distribution
If ( Student's t-distribution ) then is
The square of a t-distributed random variable with degrees of freedom follows an F-distribution with and degrees of freedom.
Derivation of the density
The probability density of the F-distribution can be derived (cf. derivation of the density of Student's t-distribution ) from the common density of the two independent random variables and , which are both chi-square distributed.
-
.
With the transformation
we get the common density of and , where and applies.
The Jacobide terminant of this transformation is:
The value is not important because it is multiplied by 0 when calculating the determinant. So the new density function is written
We are now looking for the marginal distribution as an integral over the variable that is not of interest :
Quantile functions
The -quantile of the F-distribution is the solution of the equation and therefore in principle to be calculated using the inverse function. Specifically applies here
with as the inverse of the regularized incomplete beta function. This value can be found in the F-distribution table under the coordinates , and or in the quantile table of the Fisher distribution .
For some values , the quantile functions can be calculated explicitly. The beta integral is solved with where invertible functions occur for a few indices:
The general expressions for higher indices can even be read from the complete row and column. One finds:
See also
literature
- Joachim Hartung, Bärbel Elpelt, Karl-Heinz Klösener: Statistics. 12th edition, Oldenbourg 1999, p. 156 ff., ISBN 3-486-24984-3 .
Web links
Individual evidence
-
^ PR Kinnear, CD Gray (2004): SPSS 12 MADE SIMPLE. Psychology Press. New York. Pp. 208-209.
-
↑ Eric W. Weisstein : Snedecor's F-Distribution . In: MathWorld (English).
-
↑ Frodesen, Skjeggestad, Tofte: Probability and Statistics in Particle Physics. Universitetsforlaget, Bergen - Oslo - Tromsø p. 145 f.
Discrete univariate distributions
Continuous univariate distributions
Multivariate distributions