Goodness of fit

from Wikipedia, the free encyclopedia

The goodness or goodness of fit ( English goodness of fit ) indicates "how well" can explain a lot of observations a valued model. Measures of the goodness of fit allow a statement to be made about the discrepancy between the theoretical values ​​of the random variables examined , which are expected or forecast on the basis of the model, and the values ​​actually measured.

The quality of the adaptation of a model to existing data can be assessed with the help of statistical tests or suitable key figures.

Adjustment measures can be used in the hypothesis test, for example, to test for normality in the residuals , to check whether two samples come from populations with the same distribution or to test whether certain frequencies follow a certain distribution (see also Pearson's Chi Square test ).

Regression analysis

Linear regression

With linear regression, there is the coefficient of determination . The coefficient of determination measures how well the measured values ​​fit a regression model (goodness of fit). It is defined as the proportion of the " explained variation " in the " total variation " and is therefore between:

  • (or ): no linear relationship and
  • (or ): perfect linear relationship.

The closer the coefficient of determination is to the value one, the higher the “specificity” or “quality” of the adjustment. Is , then the “ bestlinear regression model consists only of the intercept while is. The closer the value of the coefficient of determination is, the better the regression line explains the true model . If , then the dependent variable can be fully explained by the linear regression model. The measurement points then clearly lie on the non-horizontal regression line. In this case, there is no stochastic relationship, but a deterministic one.

Adaptation tests

A fit test ( English goodness-of-fit test ) is in the inferential statistics , a nonparametric hypothesis test , the unknown probability distribution of a random variable on (approximate) consequences of a particular distribution model (eg. As often the normal distribution is to check). It is about the hypothesis that a given sample comes from a distribution with a certain distribution function . This is often realized through asymptotic considerations of the empirical distribution function (see also Glivenko-Cantelli theorem ). Well-known adaptation tests are for example:

example

When Pearson chi-square test is the chi-square statistic, known as chi-square sum ( english goodness of fit statistic ) the total divided by the expected frequencies squared differences between the observed and expected frequencies:

= Number of observations of type
= Total number of observations
= Expected frequency of type
= Number of cells in the table

The result can be compared to the chi-square distribution to determine the goodness of fit.

Quality criteria

Various quality criteria have been established in structural equation models:

Individual evidence

  1. Bernd Rönz, Hans G. Strohe (1994), Lexicon Statistics , Gabler Verlag
  2. Lothar Sachs , Jürgen Hedderich: Applied Statistics: Collection of Methods with R. 8., revised. and additional edition. Springer Spectrum, Berlin / Heidelberg 2018, ISBN 978-3-662-56657-2 , p. 470