General test

from Wikipedia, the free encyclopedia
The articles general test and decision-making function overlap thematically. Help me to better differentiate or merge the articles (→  instructions ) . To do this, take part in the relevant redundancy discussion . Please remove this module only after the redundancy has been completely processed and do not forget to include the relevant entry on the redundancy discussion page{{ Done | 1 = ~~~~}}to mark. Universalamateur ( discussion ) 09:21, Oct. 11, 2018 (CEST)

A general test or decision-making process is an abstract instrument of mathematical statistics . Almost all statistical tests, such as hypothesis tests or parameter point estimates , can be mathematically recorded in the form of a general test. The aim of a general test is, on the basis of the (observed) realization of one or more previously defined random variables , whose exact probability distribution i. d. As a rule, it is not known to make a decision with regard to an issue under consideration.

Example: A pharmaceutical company wants to test a newly developed drug for its (unknown) effectiveness. A certain number of patients are given the drug for this purpose. Based on the measured effect of the drug on the patient, the pharmaceutical company now has to decide whether to launch the new drug on the market or whether to continue to use a tried and tested drug.

If the pharmaceutical company decides to launch the new drug, there is a risk that the decision-making process used will only incorrectly classify it as better than the old drug. In this case, the pharmaceutical company would suffer unnecessary damage. In order to avoid this, every general test is based on a so-called damage function, with the help of which one tries to minimize the risk of a decision by choosing a "suitable" decision function .

definition

Given a measuring room and a family of probability measures on . includes all possible realizations or observations. Next is a lot of possible decisions.

  • A mapping is called the damage function .
  • A mapping is called a general test , decision function or decision procedure if and only if the mapping is just - measurable for each . Here denotes the Borel σ-algebra over .

Quality criteria

risk

Let it be a class of decision functions. For an element one denotes

fortune

as a risk function . This indicates the average damage caused by the application of the test under the distribution . Because of this, this always exists, but possibly improperly. Next is called

than the risk of .

It has now further an algebra over and a probability at given defined as an a priori distribution , or (subjective) prior distribution on the set of parameters. If the risk function is measurable with respect to , then the so-called Bayesian risk of the test with respect to it can be introduced, and one then sets

.

Efficiency

With the help of the risk and the risk function, two general tests can now be compared with one another. It is said is at least as efficient as when

.

In the case of a preliminary assessment, the tests can also be compared using the Bayesian risk. One then says is at least as efficient as when .

Optimality

The optimality of a test can be introduced in many different ways. A test is called a

  • highly efficient in , if .
  • Minimax procedure in , if applies.
  • Bayesian solution in re . , If applies.
  • multisubjective optimal or -minimax method in , if a family of probability measures is on and holds .

With a fixed parameter , the unavoidable damage for each test is in . For a good test you will therefore require that

becomes as small as possible ("minimal regret"). Therefore it is further referred to as

  • strictest test in when applies.

Correlation : In the case of the optimality criteria listed here, the highest efficiency can be classified as the strongest requirement, because if a test is extremely efficient in , it is already the minimax method, Bayesian solution, multisubjectively optimal and also the strictest test.

Examples

Hypothesis test

In a hypothesis or significance test , one looks at two mutually exclusive hypotheses and one of which, as a rule, one tries , for example , to reject on the basis of an observation . The set of possible decisions is therefore of the form , where one defines:

"Hypothesis can be rejected."
"The hypothesis cannot be rejected, so no conclusion can be drawn from the experiment."

Parameter point estimation

Given a random variable with respect to two measurement spaces and , which is subject to the distribution family . The "true" parameter is unknown here . This, or more generally a value that depends on , is to be estimated. That is why one regards as a decision-making space . Often used as a damage function

.

This results in the mean square deviation of the estimate from the value to be estimated for a test as a risk function, because

.

Parameter range estimation

The random variable is considered again . One would like to estimate an area in which one suspects the "true" parameter . One sets for this . The empty set is excluded as a decision, since estimating it would not make sense. As a payload, the mapping provides with on. With it you get the risk function for a test

d. H. is precisely the probability with which the parameter is not in the estimated set. The error probability of the procedure for the parameter is therefore also called . The risk is called the significance limit of .