Global F test

from Wikipedia, the free encyclopedia

The global F test ( English Overall F test ), also global test , overall test , test for overall significance of a model , F test of overall significance , test for the overall context of a model, represents a global test of the regression function. It is checked whether at least one variable provides an explanatory content for the model and the model is therefore significant as a whole. If this hypothesis is rejected, the model is useless. This variant of the F -test is the most common application of the F -test.

Underlying model

The underlying model is that of linear multiple regression , so

.

It is assumed here that the disturbance variables are independent and homoscedastic and that they follow a normal distribution, i.e. H.

.

Null and alternative hypothesis

The null hypothesis of the global F test states that all explanatory variables have no influence on the dependent variable. Both the dependent variable and the independent variables can be binary ( categorical ) or metric. The Wald test can then test the global null hypothesis (without including the absolute term ):

  against   .

This test can be interpreted as if you were testing the overall quality of the regression, i.e. the population coefficient of determination of the regression. For this reason, the global F test is also known as the goodness of fit test. The term “goodness of fit test” is somewhat misleading because, strictly speaking, it does not check the fit of the regression line to the data, but rather whether at least one of the explanatory variables makes a significant contribution to the explanation. If the null hypothesis applies , the so-called null model results . The null model is a model that consists of only one absolute term .

Test statistics

The test statistics of this test can be obtained by first looking at the R -square notation of the F -statistics . The general form of the F statistic is given by

,

wherein test restrictions and the number of residual sum of the restricted and represents the residual sum of the full model. In the present case, since the null hypothesis is, restrictions are tested. This means that the test statistic can also be written as and under the null hypothesis

,

where represents the multiple coefficient of determination . The test statistic of a global F -test is given by the quotient of the "mean square of the explained deviations" and the " mean square of the residuals ". It is distributed under the null hypothesis F -distributed with and degrees of freedom . The calculation of the F test statistic can be summarized in the following table of the analysis of variance :

Source of variation Squared deviation Number of degrees of freedom Mean square of deviation F test statistics
Regression (explained) ( explained sum of squares )
Residuals (unexplained) ( Residual sum of squares )
total ( total sum of squares )

Procedure and interpretation

If the empirical F -value exceeds the critical F -value (the - quantile of the F -distribution with and degrees of freedom) at a significance level determined a priori , the null hypothesis is rejected:

.

This is then sufficiently large and at least one explanatory variable probably contributes enough information to explain . It makes sense to reject the null hypothesis in the case of high F values, since a high coefficient of determination leads to a high F value. If the Wald test rejects the null hypothesis for one or more independent variables, then one can assume that the associated regression parameters are not equal to zero, so that the variables should be included in the model. If only one independent variable is involved (   vs.   ) then a t test is used to check whether the parameter is significant. For a single parameter, the result of the Wald statistic agrees with the result of the square of the t statistic.

Individual evidence

  1. ^ A b Karl Mosler and Friedrich Schmid: Probability calculation and conclusive statistics. Springer-Verlag, 2011, p. 310.
  2. ^ Ludwig Fahrmeir , Rita artist, Iris Pigeot , Gerhard Tutz : Statistics. The way to data analysis. 8., revised. and additional edition. Springer Spectrum, Berlin / Heidelberg 2016, ISBN 978-3-662-50371-3 , p. 458.
  3. Jeffrey Marc Wooldridge: Introductory econometrics: A modern approach. 5th edition. Nelson Education, 2015, p. 146.
  4. ^ Ludwig Fahrmeir , Rita artist, Iris Pigeot , Gerhard Tutz : Statistics. The way to data analysis. 8., revised. and additional edition. Springer Spectrum, Berlin / Heidelberg 2016, ISBN 978-3-662-50371-3 , p. 458.
  5. ^ William H. Greene: Econometric Analysis. 5th edition. Prentice Hall International, 2002, ISBN 0-13-110849-2 , p. 33.