Smit's paradox

from Wikipedia, the free encyclopedia

In classic mathematical statistics, the following applies, to put it casually: the larger the sample , the better the estimate (for more details, see main theorem of mathematical statistics ). In the statistics of random processes , however, it is possible - and is then usually perceived as paradoxical - that an estimate becomes worse as the sample size increases . SJ Wilenkin was the first to notice this in 1959, but there were mistakes in his work, so that JC Smit became the namesake of the paradox in 1961.

The paradox

Let be a weakly stationary random process with an unknown constant expectation value and a (known) covariance function . The process can be watched for Be (discrete) observations and continuous observation of the process over the whole . Than are

unbiased estimates for . Intuitively it seems to be clear that is better than because it uses more information, namely information from the whole , while only uses point information. But even for simple special cases, the opposite is shown: it is better than if one takes the variance of the estimators as a criterion:

example

Be , d. H. discreet observatories. Then it follows , i. H. is better than . If one includes further observations between the previous places, i. H. at , then the variance of from worsens to , i.e. h, a “condensation” of the observations leads to a worse result.

Resolution of the paradox

The estimate is for not the best linear unbiased estimate ( English Best Linear Unbiased Estimator , BLUE for short ), so it is compared with a non-optimal estimate. The BLUE for results after a set of Grenander in the form of a Stieltjesintegrales as a solution of the integral equation with .

Continued example

See also. With the same settlements as in the example above, the result is

.

in contrast to extra weights, places them on the edge of the observation interval ( ). The discrete five-point estimate approximates this marginal weighting better than and is therefore naturally the better estimator.

Practical meaning

The phenomenon described for stochastic processes also applies to random fields . In geostatistics in particular , it is important to know that network densification in geographic information systems does not automatically lead to better estimation results.

Individual evidence

  1. SJ Wilenkin: Whether ocenke srednego v stacionarnych processach. In: Teorija Verojatnost. IV, 1959, pp. 451-453.
  2. JC Smit: Estimation of the mean of a stationary stochastic process by equidistant observations. In: Trabojos de estadistica. 12, 1961, pp. 35-45.
  3. U. Grenander: Stochastic processes and statistical inference. In: Arkiv för Matematik. 1, 1950, pp. 195-277.
  4. ^ W. Näther: Effective Observation of Random Fields. (= Teubner texts on mathematics. Volume 72). Teubner Verlag, Leipzig 1985.
  5. W. Näther: Good and bad examples from the design of experiments for stochastic processes and fields. In: Series of publications by the Institute for Mine Surveying and Geodesy at the TU Bergakademie Freiberg. Issue 2, 2004, pp. 8-19.