Randomized test

from Wikipedia, the free encyclopedia

In test theory , a branch of mathematical statistics, a special class of statistical tests is called randomized tests . In contrast to the non-randomized tests, they do not always make an unambiguous yes / no decision, but instead, if certain data occurs, require a lottery to be carried out to determine the decision.

One of the advantages of randomized tests is that they are easier to handle mathematically. This makes it easier to show optimality properties for randomized tests than for non-randomized tests. An example of this is the Neyman-Pearson test , which is randomized to make the most of its level.

definition

A statistical model is given . A test is a statistic

,

which assigns a decision to each observation . The decision is coded with 1 = "Rejection of the null hypothesis " and 0 = "Retention of the null hypothesis"

A test is now called randomized if it not only assumes the values ​​0 and 1, but also values ​​in the interval . The amount

is then called the randomization range of the test and contains all values ​​for which the test does not make a clear decision.

interpretation

If the decision is coded as above with 1 = "Rejection of the null hypothesis" and 0 = "Retention of the null hypothesis", then the values ​​between 0 and 1 are interpreted as probabilities of making a decision to reject the null hypothesis. A value of the randomized testing at the site , that would therefore that in observation of mean with a probability of 50%, the null hypothesis is rejected and is retained with a probability of 50% of the null hypothesis. To determine the decision, a fair coin would have to be tossed, which then decides whether to reject it or keep it. More generally, it means that there is a probability that the null hypothesis will be rejected and there is a probability that the null hypothesis will be retained.

example

Is given as a basic set is provided with the power set as σ-algebra , that is . This set can be provided, for example, with the binomial distribution with and as a probability distribution. This and the exact choice of hypotheses are not relevant for the definition of a randomized test for the time being.

For example, a randomized test would be given by

.

For values ​​less than or equal to five, the null hypothesis is retained, for values ​​greater than or equal to seven, the null hypothesis is rejected, and if the value is six, the null hypothesis is rejected with a probability of . Randomization range would be the six here, so .

If the value six occurs in this test, a fair dice could be thrown. If the number is one, two, three or four, the null hypothesis is rejected, otherwise the null hypothesis is retained.

properties

As already mentioned in the introduction, optimality and existence statements can be derived better for randomized tests. This is mainly because the randomized tests form a convex set . This does not apply to non-randomized tests. For convex sets, there are many far-reaching structural statements about topological properties and the existence of minimal places of functionals . These then enable the corresponding optimality statements to be derived.

supporting documents