Analysis of variance

As analysis of variance , shortly VA ( English variance analysis of short ANOVA ), and analysis of variance or scatter decomposition called, refers to a large group of data-analytical and structure-checking statistical methods that allow for many different applications.

What they have in common is that they calculate variances and test variables in order to obtain information about the regularities behind the data. The variance of one or more target variables is explained by the influence of one or more influencing variables (factors). The simplest form of analysis of variance tests the influence of a single nominally scaled variable on an interval-scaled variable by comparing the means of the dependent variable within the groups defined by the categories of the independent variable. Thus, the analysis of variance in its simplest form is an alternative to the t test that is suitable for comparisons between more than two groups. Variance analysis models are usually special linear regression models . The method of analysis of variance goes back essentially to Ronald Aylmer Fisher .

overview

Basic concepts

The dependent variable is called the target variable :

The metric random variable whose value is to be explained by the categorical variables . The dependent variable contains measured values.

The independent variable is called the influencing variable or factor :

The categorical variable (= factor) that specifies the groups. Its influence is to be checked; it is nominally scaled .
The categories of a factor are then called factor levels . This designation is not identical to that used in factor analysis .

Number of target variables

Depending on whether one or more target variables are present, a distinction is made between two types of analysis of variance:

the univariate variance analysis by the English name of alysis o f va riance as ANOVA abbreviated

the multivariate analysis of variance, also abbreviated as MANOVA after the English term m ultivariate an alysis o f va riance

Depending on whether one or more factors are present, a distinction is made between simple (one-factorial) and multiple or multiple (multi-factorial) analysis of variance.

Number of examination units

In the simplest case, the same number of observations are considered from each factor level. In this case, one also speaks of an orthogonal analysis of variance or a balanced model . Working with and interpreting data whose factor levels contain different numbers of elements (e.g. also missing values) is more difficult (see unbalanced model ).

Fixed and random effects

A common model discriminating analysis of variance is then made whether the factors with fixed effects ( English fixed factors ) or factors with random effects ( English random factors ) are present. One speaks of fixed effects when the influencing factors occur in a finite number of factor levels and all of them have been recorded or the statement of interest in the investigation only relates to these factor levels. One speaks of models with random effects if one can only capture a selection of all possible factor levels (see also linear panel data models ).

Basic idea

The total variance can be easily broken down into groups if the variability between the factor levels is large, but the variability within them is low.

The procedures investigate whether (and if so how) the expected values of the metric random variables differ in different groups (including classes ). The test variables of the method are used to test whether the variance between the groups is greater than the variance within the groups . In this way it can be determined whether the grouping is useful or not, or whether the groups differ significantly or not.

If they differ significantly, it can be assumed that different laws operate in the groups. For example, it can be clarified whether the behavior of a control group is identical to that of an experimental group. If, for example, the variance of one of these two groups has already been traced back to causes (sources of variance), if the variance is equal, it can be concluded that no new cause (e.g. due to the experimental conditions) was added in the other group.

Requirements and alternatives

The reliability of the significance tests in the context of the analysis of variance depends on the extent to which their requirements are met. These requirements are slightly different depending on the application, the following generally apply:

Uniformity of variance (homoscedasticity): The measurement deviation should be evenly distributed across all possible values of the independent variables.
Normal distribution of prediction errors (residuals): The residuals should come from a normally distributed population.

The check is carried out with other tests outside of the analysis of variance, which are now included as an option in statistical programs as standard. The normal distribution of the residuals can be checked with the Shapiro-Wilk test , and homogeneity of variance with the Levene test .

Against deviations from the assumption of normal distribution, analyzes of variance are considered robust, especially with larger sample sizes (see central limit theorem ). Inhomogeneous variances represent a problem with unequal group sizes. In the case of simple variance analyzes, the Brown-Forsythe test can be used if necessary . A transformation of the dependent variable may also be considered in order to adjust the variances of the groups, for example by taking the logarithm . If the prerequisites are not sufficiently met, distribution-free, nonparametric methods are also available, which are robust but have lower test strength and test different parameters than analysis of variance, since they are based on ranks.

non-parametric methods :
- for two samples ( t -test alternatives):
  - paired (dependent) Sp .: Wilcoxon signed rank test
  - unpaired (independent) Sp .: Mann-Whitney-U-Test also called Wilcoxon-Mann-Whitney-Test, U-Test, Mann-Whitney-Wilcoxon, (MWW) -test or Wilcoxon-rank sum test .
- for three or more samples:
  - paired data: Friedman test , Quade test
  - unpaired data: Kruskal-Wallis-Test , Jonckheere-Terpstra-Test , Umbrella-Test , or with simultaneous violation of the assumption of normal distribution and assumption of homogeneity of variance also the median test
  - for multifactorial analysis: Scheirer-Ray-Hare test

Simple analysis of variance

In a way analysis of variance , and one-way analysis of variance ( English one-way analysis of variance , in short, one-way ANOVA ), or single-factor analysis of variance , called one examines the influence of an independent variable (factor) with different levels (groups) on the characteristics a random variable. For this purpose, the mean values of the values for the groups are compared with one another, namely the variance between the groups is compared with the variance within the groups. Because the total variance is made up of the two components mentioned, it is called an analysis of variance. The simple analysis of variance is the generalization of the t test for more than two groups. For it is equivalent to the t-test. ${\ displaystyle k}$ ${\ displaystyle k}$ ${\ displaystyle k = 2}$

requirements

The error components must be normally distributed. Error components denote the respective variances (total, treatment and error variance). The validity of this prerequisite also requires a normal distribution of the measured values in the respective population.
The error variances between the groups (i.e. the k factor levels) must be equal or homogeneous ( homoscedasticity ).
The measured values or factor levels must be independent of one another.

example

This form of analysis of variance is indicated if, for example, it is to be examined whether smoking has an influence on aggressiveness. Smoking is an independent variable here, which can be divided into three values ( factor levels): non-smokers , light smokers and heavy smokers . The aggressiveness recorded by a questionnaire is the dependent variable. To carry out the investigation, the test subjects are assigned to the three groups. Then the questionnaire is submitted with which the aggressiveness is recorded. ${\ displaystyle k = 3}$

Hypotheses

Let it be the expected value of the dependent variable in the . Group. The null hypothesis of a simple analysis of variance is: ${\ displaystyle \ mu _ {i}}$ ${\ displaystyle i}$

{\ displaystyle H_ {0} \ colon \ mu _ {1} = \ mu _ {2} = \ ldots = \ mu _ {k}}

The alternative hypothesis is:

{\ displaystyle H_ {1} \ colon \ exists i, j: \ \ mu _ {i} \ neq \ mu _ {j}}

The null hypothesis therefore states that there is no difference between the expected values of the groups (which correspond to the factor values or factor levels). The alternative hypothesis states that there is a difference between at least two expected values. For example, if we have five factor levels, the alternative hypothesis is confirmed if at least two of the group means differ. However, three expected values or four or all five can also differ significantly from one another.

If the null hypothesis is rejected, the analysis of variance does not provide any information about how many or between which factor levels there is a difference. We then only know with a certain probability (see level of significance ) that at least two values show a significant difference.

One can now ask whether it would be permissible to carry out individual comparisons in pairs between the mean values using different t- tests. If you compare only two groups (i.e. two mean values) with the analysis of variance, then the t-test and analysis of variance lead to the same result. However, if there are more than two groups, the verification of the global null hypothesis of the analysis of variance using paired t- tests is not permitted - so-called alpha error accumulation or alpha error inflation occurs. With the help of multiple comparison techniques, after a significant analysis of variance result, it can be checked at which pair of mean values the difference or differences lie. Examples of such comparison techniques are the Bonferroni test for the smallest significant difference and the Scheffé test (see also post-hoc test ). The advantage of these methods is that they take into account the aspect of alpha error inflation.

Basic idea of the bill

When calculating the analysis of variance, one first calculates the total variance observed in all groups. To do this, all the measured values from all groups are summarized and the total mean and the total variance are calculated.

Then one would like to determine the variance component of the total variance, which is solely due to the factor. If the entire observed variance was due to the factor, then all measured values in a factor level would have to be the same - in this case there should only be differences between the groups. Since all measured values within a group have the same factor value, they would consequently all have to have the same value, since the factor would be the only source of variance generating. In practice, however, measured values will also differ within a factor level. These differences within the groups must therefore come from other influences (either coincidence or so-called interfering variables ).

In order to calculate which variance can be traced back to the characteristics of the factor, you change your data for a moment, so to speak, "ideally": You assign the mean value of the respective factor level to all measured values within a factor level. This means that all values within a factor level are made the same, and the only difference now remains between the factor levels. The variance is now calculated again with this “idealized” data. This characterizes the variance that results from the factor (“variance of treatments”, treatment variance ).

If you divide the variance of the treatments by the total variance, you get the relative proportion of the variance attributable to the factor.

There is usually a discrepancy between the total variance and the variance of the treatments - the total variance is greater than the variance of the treatments. The variance that is not due to the factor (the treatment) is called the error variance. This is based either on chance or on other variables that have not been investigated (interfering variables).

The error variance can be calculated by rearranging your data: For each individual measured value, you calculate the deviation from the respective group mean of its factor level. The entire variance is calculated again from this. This then characterizes the error variance.

An important relationship between the components is the additivity of the sums of squares. The sum of squares is the part of the variance formula that is in the numerator. If you omit the denominator (the number of degrees of freedom ) when calculating the variance of the treatment , you get the square sum of the treatment. The total sum of squares (i.e. total variance without denominator) results from the sum of the treatment sum and the sum of the squares of the residuals.

The final significance test is carried out using an “ordinary” F- test . One can show mathematically that if the null hypothesis of the analysis of variance is valid, the treatment and error variance must be the same. An F test can be used to test the null hypothesis that two variances are equal by calculating the quotient from them.

In the case of the analysis of variance, the quotient is formed from the variance of the treatments divided by the error variance. This quotient is F -distributed with numerator degrees of freedom and / or denominator degrees of freedom ( is the number of groups, is the total number of all test subjects, is the respective number of test subjects per factor level).

{\ displaystyle (k-1)}

{\ displaystyle k (n-1)}

{\ displaystyle (Nk)}

{\ displaystyle k}

{\ displaystyle N}

{\ displaystyle n}

In tables of the F distribution you can then look up the corresponding F value with the corresponding degrees of freedom and read off what percentage of the F distribution density this value “cuts off”. For example, if we agree on a significance level of 5% before performing the analysis of variance, then the F value would have to cut off at least 95% of the F distribution on the left. If this is the case, then we have a significant result and can reject the null hypothesis at the 5% level.

Mathematical model

The simple analysis of variance regards each measured value as the sum of a “component independent of the factor effect” , the “factor effect” and the experimental error . Each measured value can thus be processed through the following data generating process ${\ displaystyle \ mu}$ ${\ displaystyle \ tau _ {i}}$ ${\ displaystyle \ varepsilon _ {ij}}$

{\ displaystyle Y_ {ij} = \ mu _ {i} + \ varepsilon _ {ij} = \ mu + \ tau _ {i} + \ varepsilon _ {ij}, \ quad i = 1, \ dots, k, \ j = 1, \ dots, n_ {i}}

to be generated. The second equality results from the fact that the fixed mean value dependent on the factor level (mean value of under the test conditions ) can be split into a component that is independent of the factor effect and into the factor effect itself. It therefore applies ${\ displaystyle y}$ ${\ displaystyle i}$ ${\ displaystyle \ mu}$ ${\ displaystyle \ tau _ {j}}$

{\ displaystyle \ mu _ {i} = \ mu + \ tau _ {i}}

.

The experimental error is assumed to be normally distributed at every factor level and for every repetition with an expected value of zero and a homoscedastic unknown error variance (independent of the factor level) . This assumption can be interpreted in such a way that the experimental errors are balanced out on average and that the variability is the same in all groups. It is also assumed that the trial errors are independent of different iterations. To sum up writing for the experimental error: . The goal is the model parameters , and to estimate statistically, that point estimator , and to find. With the help of a so-called table of the analysis of variance or also called table of the analysis of variance , the -th factor level mean can be calculated ${\ displaystyle \ varepsilon _ {ij}}$ ${\ displaystyle i}$ ${\ displaystyle j}$ ${\ displaystyle \ sigma ^ {2}}$ ${\ displaystyle \ varepsilon _ {ij} \; {\ stackrel {\ mathrm {iid}} {\ sim}} \; {\ mathcal {N}} (0, \ sigma ^ {2})}$ ${\ displaystyle \ mu}$ ${\ displaystyle \ mu _ {i}}$ ${\ displaystyle \ tau _ {i}}$ ${\ displaystyle {\ hat {\ mu}}}$ ${\ displaystyle {\ hat {\ mu}} _ {i}}$ ${\ displaystyle {\ hat {\ tau}} _ {i}}$ ${\ displaystyle i}$

{\ displaystyle {\ overline {y}} _ {i \ mathbf {.}} = {\ frac {1} {n_ {i}}} \ sum _ {j = 1} ^ {n_ {i}} y_ { ij}}

and the -th factor level variance ${\ displaystyle i}$

{\ displaystyle s_ {i \ mathbf {.}} ^ {2} = {\ frac {1} {n_ {i} -1}} \ sum _ {j = 1} ^ {n_ {i}} (y_ { ij} - {\ overline {y}} _ {i \ mathbf {.}}) ^ {2}}

to calculate. The overall mean represents the mean of the factor level means weighted with the number of cases : ${\ displaystyle n_ {i}}$ ${\ displaystyle {\ overline {y}} _ {i \ mathbf {.}}}$

{\ displaystyle {\ overline {y}} _ {\ mathbf {..}} = {\ frac {1} {N}} \ sum _ {i = 1} ^ {k} \ sum _ {j = 1} ^ {n_ {i}} y_ {ij} = {\ frac {1} {N}} \ sum _ {i = 1} ^ {k} n_ {i} {\ overline {y}} _ {i \ mathbf {.}}}

,

where represents the total size of the samples at all factor levels. The global expected value or the global mean ( English grand mean ) is set equal to the mean of the level mean values : ${\ displaystyle N = \ sum _ {i = 1} ^ {k} n_ {i}}$ ${\ displaystyle \ mu}$ ${\ displaystyle \ mu _ {i}}$

{\ displaystyle \ mu = {\ frac {1} {N}} \ sum _ {i = 1} ^ {k} n_ {i} \ mu _ {i}}

.

An additional condition for the model parameters in order to ensure the identifiability of the regression model is the so-called reparameterization condition , in which a new parameterization is carried out. In simple analysis of variance it is

{\ displaystyle \ sum _ {i = 1} ^ {k} n_ {i} \ tau _ {i} = \ sum _ {i = 1} ^ {k} n_ {i} (\ mu _ {i} - \ mu) = 0}

.

I.e. the sum of the factor effect weighted with the number of cases results in zero. In this case one speaks of an effect coding. The effects can be clearly estimated using the reparameterization condition . The global mean is estimated by the overall mean , the parameter is estimated by the factor level mean , and the factor effect is estimated by the deviation . The respective deviation between the measured value and the estimated value ( residual ) is through ${\ displaystyle n_ {i}}$ ${\ displaystyle \ tau _ {i}}$ ${\ displaystyle \ mu}$ ${\ displaystyle {\ overline {y}} _ {\ mathbf {..}}}$ ${\ displaystyle \ mu _ {i \ mathbf {.}}}$ ${\ displaystyle {\ overline {y}} _ {i \ mathbf {.}}}$ ${\ displaystyle \ tau}$ ${\ displaystyle ({\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}})}$

{\ displaystyle {\ hat {\ varepsilon}} _ {ij} = y_ {ij} - ({\ hat {\ mu}} + {\ hat {\ tau}} _ {i}) = y_ {ij} - ({\ overline {y}} _ {\ mathbf {..}} + {\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..} }) = y_ {ij} - {\ overline {y}} _ {i \ mathbf {.}}}

given. The residual is given by the deviation of the measured value from the level mean and is the expression of the random variation of the variables at the level of the factor. It can be viewed as a realization of the trial error in the -th iteration at the -th factor level. Each realization of the target value is made up of the total mean, the factor effect and the residual : ${\ displaystyle {\ hat {\ varepsilon}} _ {ij}}$ ${\ displaystyle y_ {ij}}$ ${\ displaystyle i}$ ${\ displaystyle Y}$ ${\ displaystyle i}$ ${\ displaystyle j}$ ${\ displaystyle i}$ ${\ displaystyle {\ overline {y}} _ {\ mathbf {..}} = {\ hat {\ mu}}}$ ${\ displaystyle {\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}} = {\ hat {\ tau}} _ {i}}$ ${\ displaystyle {\ hat {\ varepsilon}} _ {ij} = (y_ {ij} - {\ overline {y}} _ {i \ mathbf {.}})}$

{\ displaystyle y_ {ij} = {\ overline {y}} _ {\ mathbf {..}} + ({\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}}) + \ underbrace {(y_ {ij} - {\ overline {y}} _ {i \ mathbf {.}})} _ {{\ hat {\ varepsilon}} _ { ij}} = {\ hat {\ mu}} + {\ hat {\ tau}} _ {i} + {\ hat {\ varepsilon}} _ {ij}}

.

Sums of squares

The "total sum of squares" or " total sum of squares " just SQT ( S umme the Q uadrate the T otalen deviations ), can be broken down into two parts. One part relates to group membership and the other part, the rest, is attributed to chance. The first part, d. H. The “sum of squares due to factor A”, SQA for short , can be expressed as the sum of the squares of the deviations of the mean values from the total mean of the groups. By the regression "unexplained square sum" and the residual sum of squares , short SQR ( S umme the Q uadrate the R estabweichungen (or "residuals")) , which relates to the differences within the groups is expressed as the total deviation from the Mean values in the groups. The following applies:

{\ displaystyle SQT = SQA + SQR}

.

Where:

{\ displaystyle SQT = \ sum _ {i = 1} ^ {k} \ sum _ {j = 1} ^ {n_ {i}} (y_ {ij} - {\ overline {y}} _ {\ mathbf { ..}}) ^ {2}}

,

{\ displaystyle SQA = \ sum _ {i = 1} ^ {k} n_ {i} ({\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}}) ^ {2} = \ sum _ {i = 1} ^ {k} n_ {i} {\ hat {\ tau}} _ {i} ^ {2}}

,

and

{\ displaystyle SQR = \ sum _ {i = 1} ^ {k} \ sum _ {j = 1} ^ {n_ {i}} {\ hat {\ varepsilon}} _ {ij} ^ {2} = \ sum _ {i = 1} ^ {k} \ left (\ sum _ {j = 1} ^ {n_ {i}} (y_ {ij} - {\ overline {y}} _ {i \ mathbf {.} }) ^ {2} \ right) = \ sum _ {i = 1} ^ {k} (n_ {i} -1) {\ frac {1} {n_ {i} -1}} \ sum _ {j = 1} ^ {n_ {i}} (y_ {ij} - {\ overline {y}} _ {i \ mathbf {.}}) ^ {2} = \ sum _ {i = 1} ^ {k} (n_ {i} -1) s_ {i} ^ {2}}

.

The two sums of squares and are stochastically independent. In the case of groups of the same size , it can be shown that under the null hypothesis the following applies: ${\ displaystyle SQA}$ ${\ displaystyle SQR}$ ${\ displaystyle k}$ ${\ displaystyle N / k}$

{\ displaystyle SQA / \ sigma ^ {2} \; \ sim \; \ chi _ {(k-1)} ^ {2}}

, d. H. the sum of squares follows a chi-square distribution with degrees of freedom,

{\ displaystyle SQA}

{\ displaystyle (k-1)}

and

{\ displaystyle SQR / \ sigma ^ {2} \; \ sim \; \ chi _ {(Nk)} ^ {2}}

, d. H. the sum of squares follows a chi-square distribution with degrees of freedom.

{\ displaystyle SQR}

{\ displaystyle (Nk)}

Test variable

One usually also defines the " mean squares of deviations" (often incorrectly called mean sums of squares ):

{\ displaystyle MQA = {\ frac {1} {k-1}} SQA}

,

and

{\ displaystyle MQR = {\ frac {1} {Nk}} SQR}

.

The test variable or the F statistic can thus be defined as follows:

{\ displaystyle F \ equiv {\ frac {MQA} {MQR}} = {\ frac {\ sum _ {i = 1} ^ {k} n_ {i} ({\ overline {y}} _ {i \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}}) ^ {2} / (k-1)} {\ sum _ {i = 1} ^ {k} (n_ {i } -1) s_ {i} ^ {2} / (Nk)}}}

.

In the case of groups of the same size, under the null hypothesis, F -distributed with degrees of freedom in the numerator and degrees of freedom in the denominator. ${\ displaystyle F}$ ${\ displaystyle (k-1)}$ ${\ displaystyle (Nk)}$

When the test variable becomes significant, at least two groups differ from one another. In post-hoc tests can then be calculated between which each group is the difference.

Sample calculation

The following example is a simple two-group analysis of variance (also known as a two-sample F-test ). In one experiment, two groups ( ) of ( ) animals each receive a different diet. After a certain time, your weight gain is measured with the following values: ${\ displaystyle k = 2}$ ${\ displaystyle 10}$ ${\ displaystyle n_ {1} = n_ {2} = 10}$

Group 1	${\ displaystyle 45}$	${\ displaystyle 23}$	${\ displaystyle 55}$	${\ displaystyle 32}$	${\ displaystyle 51}$	${\ displaystyle 91}$	${\ displaystyle 74}$	${\ displaystyle 53}$	${\ displaystyle 70}$	${\ displaystyle 84}$
Group 2	${\ displaystyle 64}$	${\ displaystyle 75}$	${\ displaystyle 95}$	${\ displaystyle 56}$	${\ displaystyle 44}$	${\ displaystyle 130}$	${\ displaystyle 106}$	${\ displaystyle 80}$	${\ displaystyle 87}$	${\ displaystyle 115}$

The aim is to investigate whether the different foods have a significant influence on weight. The mean value and the variance (here “estimated value”, empirical variance ) of the two groups are

{\ displaystyle {\ overline {y}} _ {1 \ mathbf {.}} = {\ frac {1} {n_ {1}}} \ sum _ {j = 1} ^ {n_ {1}} y_ { 1j} = 57 {,} 8 \ quad}

and

{\ displaystyle s_ {1 \ mathbf {.}} ^ {2} = {\ frac {1} {n_ {1} -1}} \ sum _ {j = 1} ^ {n_ {1}} (y_ { 1j} - {\ overline {y}} _ {1 \ mathbf {.}}) ^ {2} = 479 {,} 7}

{\ displaystyle {\ overline {y}} _ {2 \ mathbf {.}} = {\ frac {1} {n_ {2}}} \ sum _ {j = 1} ^ {n_ {2}} y_ { 2j} = 85 {,} 2 \ quad}

and

{\ displaystyle s_ {2 \ mathbf {.}} ^ {2} = {\ frac {1} {n_ {2} -1}} \ sum _ {j = 1} ^ {n_ {2}} (y_ { 2j} - {\ overline {y}} _ {2 \ mathbf {.}}) ^ {2} = 728 {,} 6}

Because it can be calculated from this: ${\ displaystyle n_ {1} = n_ {2} = 10}$

{\ displaystyle MQA = n_ {1} ({\ overline {y}} _ {1 \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}}) ^ {2} + n_ {2} ({\ overline {y}} _ {2 \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {..}}) ^ {2} = 10 {\ frac {({ \ overline {y}} _ {1 \ mathbf {.}} - {\ overline {y}} _ {2 \ mathbf {.}}) ^ {2}} {2}} = 5 ({\ overline {y }} _ {1 \ mathbf {.}} - {\ overline {y}} _ {2 \ mathbf {.}}) ^ {2} = 3753 {,} 8}

and

{\ displaystyle MQR = {\ frac {(n_ {1} -1) s_ {1 \ mathbf {.}} ^ {2} + (n_ {2} -1) s_ {2 \ mathbf {.}} ^ { 2}} {n_ {1} + n_ {2} -2}} = 604 {,} 15}

The underlying probability model assumes that the weights of the animals are normally distributed and show the same variance in each group. The null hypothesis to be tested is

{\ displaystyle H_ {0}}

: "The mean values of the two groups are the same"

Obviously, the mean values and differ . However, this deviation could also be in the range of natural fluctuations. To check whether the distinction is significant, the test size is calculated. ${\ displaystyle {\ overline {y}} _ {1 \ mathbf {.}}}$ ${\ displaystyle {\ overline {y}} _ {2}}$ ${\ displaystyle F}$

{\ displaystyle F \ equiv {\ frac {MQA} {MQR}} = {\ frac {3753 {,} 8} {604 {,} 15}} \ approx 6 {,} 21}

According to the underlying model, the size is a random variable with a distribution, where the number of groups (factor levels) and the number of measured values are. The indices are called degrees of freedom . The value of the F distribution for given degrees of freedom ( F - quantile ) can be looked up in a Fisher-panel. A desired level of significance (the probability of error) must also be specified. In the present case, the F quantile for the error type 1 is 5%. This means that the null hypothesis cannot be rejected for all values of the test variable up to 4.41. There , the null hypothesis can be rejected with the present values. ${\ displaystyle F}$ ${\ displaystyle F_ {k-1, nk}}$ ${\ displaystyle k}$ ${\ displaystyle n}$ ${\ displaystyle F_ {1,18} \ approx 4 {,} 41}$ ${\ displaystyle F}$ ${\ displaystyle 6 {,} 21> 4 {,} 41}$

It can therefore be assumed that the animals in the two groups really have different weights on average. The probability of accepting a difference even though it is not there is less than 5%.

Double analysis of variance

The two-time analysis of variance, and two-way analysis of variance ( English two-way analysis of variance : short two-way ANOVA , or) two-factor analysis of variance , called into account to explain the target variable two factors (factor A and factor B).

example

This form of analysis of variance is e.g. B. indicated in studies that want to show the influence of smoking and coffee drinking on nervousness. Smoking is here the factor A, which in z. B. can be divided into three characteristics (factor levels): non-smokers , light smokers and chain smokers . Factor B can be the amount of coffee used daily with the following levels: 0 cups, 1–3 cups, 4–8 cups, more than 8 cups. Nervousness is the dependent variable. To carry out the investigation, test persons are divided into 12 groups according to the combinations of factor levels. The measurement of nervousness is carried out, which provides metric data.

Basic idea of the bill

The model (for the case with fixed effects) in effect representation is:

{\ displaystyle Y_ {ijk} = \ mu + \ alpha _ {i} + \ beta _ {j} + (\ alpha \ beta) _ {ij} + \ varepsilon _ {ijk}, \ quad \ varepsilon _ {ijk } \ sim {\ mathcal {N}} (0, \ sigma ^ {2}), \ quad i = 1, \ dots, I, \ quad j = 1, \ dots, J, \ quad k = 1, \ dots, K}

In it are:

{\ displaystyle Y_ {ijk}}

: Target variable; assumed to be normally distributed in the groups

{\ displaystyle I}

: Number of factor levels of the first factor (A)

{\ displaystyle J}

: Number of factor levels of the second factor (B)

{\ displaystyle K}

: Number of observations per factor level (here the same for all combinations of factor levels)

{\ displaystyle \ alpha _ {i}}

: Effect of the -th factor level of factor A

{\ displaystyle i}

{\ displaystyle \ beta _ {j}}

: Effect of the -th factor level of factor B

{\ displaystyle j}

{\ displaystyle (\ alpha \ beta) _ {ij}}

: Interaction (interaction) of the factors on the factor level combination .

{\ displaystyle (i, j)}

The interaction describes a special effect that only occurs when the factor level combination is present. ${\ displaystyle (i, j)}$

{\ displaystyle \ varepsilon _ {ijk}}

: Confounding variables, independent and normally distributed with expected value and the same variances.

{\ displaystyle 0}

The total square sum is broken down into four independent square sums (square sum decomposition): ${\ displaystyle SQT}$

{\ displaystyle SQT = SQA + SQB + SQAB + SQR}

In it are:

{\ displaystyle SQT = \ sum (y_ {ijk} - {\ overline {y}} _ {\ mathbf {...}}) ^ {2}}

the total sum of squares,

{\ displaystyle SQR = \ sum ({y} _ {ijk} - {\ overline {y}} _ {ij \ mathbf {.}}) ^ {2}}

the residual sum of squares,

{\ displaystyle SQAB = \ sum ({\ overline {y}} _ {ij} .- {\ overline {y}} _ {i \ mathbf {..}} - {\ overline {y}} _ {\ mathbf {.} j \ mathbf {.}} + {\ overline {y}} _ {\ mathbf {...}}) ^ {2}}

the sum of squares due to the interaction of A and B,

{\ displaystyle SQA = \ sum ({\ overline {y}} _ {i \ mathbf {..}} - {\ overline {y}} _ {\ mathbf {...}}) ^ {2}}

the sum of squares due to factor A.

{\ displaystyle SQB = \ sum ({\ overline {y}} _ {\ mathbf {.} j \ mathbf {.}} - {\ overline {y}} _ {\ mathbf {...}}) ^ { 2}}

the sum of squares due to factor B.

The expected values of the sums of squares are:

{\ displaystyle \ operatorname {E} (SQR) = IJ (K-1) \ sigma ^ {2}}

{\ displaystyle \ operatorname {E} (SQAB) = K \ sum \ limits _ {i, j} ((\ alpha \ beta) _ {i, j}) ^ {2} + (I-1) (J- 1) \ sigma ^ {2}}

{\ displaystyle \ operatorname {E} (SQA) = JK \ sum _ {i} \ alpha _ {i} ^ {2} + (I-1) \ sigma ^ {2}}

{\ displaystyle \ operatorname {E} (SQB) = IK \ sum _ {j} \ beta _ {j} ^ {2} + (J-1) \ sigma ^ {2}.}

The sums of squares divided by are, assuming appropriate assumptions, chi-squared, namely: ${\ displaystyle \ sigma ^ {2}}$

{\ displaystyle SQR / \ sigma ^ {2}}

with degrees of freedom

{\ displaystyle IJ (K-1)}

{\ displaystyle SQAB / \ sigma ^ {2}}

with degrees of freedom if

{\ displaystyle (I-1) (J-1)}

{\ displaystyle (\ alpha \ beta) _ {i, j} \ equiv 0}

{\ displaystyle SQA / \ sigma ^ {2}}

with degrees of freedom if

{\ displaystyle I-1}

{\ displaystyle \ alpha _ {i} \ equiv 0}

{\ displaystyle SQB / \ sigma ^ {2}}

with degrees of freedom if

{\ displaystyle J-1}

{\ displaystyle \ beta _ {j} \ equiv 0}

The mean squares of deviation result from dividing the sums of squares by their degrees of freedom:

{\ displaystyle MQR = {\ frac {SQR} {IJ (K-1)}}}

{\ displaystyle MQAB = {\ frac {SQAB} {(I-1) (J-1)}}}

{\ displaystyle MQA = {\ frac {SQA} {I-1}}}

{\ displaystyle MQB = {\ frac {SQB} {J-1}}}

The applicable test parameters are calculated like the quotients of the mean squares, with the denominator. ${\ displaystyle MQR}$

One now calculates the variances for the individual factors and the variance for the interaction of and . The hypothesis is: there is no interaction. Again, the hypothesis is calculated using the test statistics . This is now composed of the quotient that resulted from the interaction of and and the error variance. You now compare with the F quantiles after specifying a desired level of significance. If the test variable is greater than the quantile (the latter can be read in the relevant tables), then it is rejected, i.e. there is an interaction between the factors and . ${\ displaystyle A}$ ${\ displaystyle B}$ ${\ displaystyle H_ {0}}$ ${\ displaystyle F}$ ${\ displaystyle A}$ ${\ displaystyle B}$ ${\ displaystyle F}$ ${\ displaystyle H_ {0}}$ ${\ displaystyle A}$ ${\ displaystyle B}$