Neyman-Pearson lemma

from Wikipedia, the free encyclopedia

The Neyman-Pearson lemma , also called the Neyman-Pearson Fundamentalsemma or the Fundamentalsallemma of mathematical statistics , is a central set of test theory and thus also of mathematical statistics , which makes an optimality statement about the construction of a hypothesis test . The subject of the Neyman-Pearson lemma is the simplest conceivable scenario of a hypothesis test, which is also called the Neyman-Pearson test : Both the null hypothesis and the alternative hypothesis are simple ; That is, they each correspond to a single probability distribution , the associated probability densities of which are referred to below as and . Then, according to the statement of the Neyman-Pearson lemma, the best test is obtained by a decision in which the null hypothesis is rejected if the likelihood quotient falls below a certain value.

The lemma is named after Jerzy Neyman and Egon Pearson , who proved it in 1933.

situation

We are looking for a hypothesis test that is as "good" as possible and that should lead to a decision between null and alternative hypotheses with high reliability. It is assumed that the null and alternative hypotheses each correspond to exactly one probability distribution that applies to the observation results. Under this prerequisite, the probability of a wrong test decision can be calculated exactly for each definition of a rejection area: In detail, these are the two probabilities for an error of the first type and an error of the second type . Therefore, given an upper limit for an error of the first type given by the significance level, the theoretically conceivable test decisions can be compared with one another in a particularly simple manner in qualitative terms.

Formal description of the situation

Realizations of a real random vector with dimension with values ​​in the measurement space are observed . The exact distribution of . The null hypothesis “ ” is to be tested against the alternative “ ” for two probability measures over the given measurement range. The dimensions and have densities or with respect to the Lebesgue measure , i.e. i.e., they are continuous distributions based on .

A decision-making process is now characterized by the definition of a rejection area , with the help of which the null hypothesis is rejected precisely when the observed realization of in lies. This test must not exceed a specified level ,

,

d. In other words, the probability of a false rejection of the null hypothesis, the so-called error of type I , must not be greater than . Of all tests that adhere to this level, the one called the strongest test that maximizes the so-called test strength , i.e. a minimal error of the 2nd type ,

,

the probability that the null hypothesis will not be rejected incorrectly.

formulation

The Neyman-Pearson lemma

In the above situation one considers the extended likelihood quotient for a realization

The case is only defined for the sake of completeness, since it does not occur with any positive probability.

Now a test of the hypothesis “ ” against the alternative “ ” is optimal at a given level (strongest test) if and only if one exists, so that its rejection area meets the requirements

  1. such as
  2. for almost certainly everyone and
  3. for almost certainly everyone

Fulfills. The almost certain properties from 2. and 3. relate to the probability measure , i.e. H. they must almost certainly with respect to and enter.

If a fault area fulfills requirements 1–3., It is also called a Neyman-Pearson area . In discrete models, such a rejection area only exists at certain levels ; in order to fully exploit a given level, randomized tests may have to be used.

special cases

At least the following special cases were not considered by the above lemma:

  • The fault area is the strongest test at the test level ; H. the test has no Type I error. The corresponding test parameter is .
  • The fault area is the strongest test of the level because it has the test strength , i. H. the test has no type 2 error. The corresponding test parameter is .

literature

Individual evidence

  1. ^ Neyman-Pearson Lemma . In: Michiel Hazewinkel (Ed.): Encyclopaedia of Mathematics . Springer-Verlag , Berlin 2002, ISBN 978-1-55608-010-4 (English, online ).

Web links