Chernoff inequality

In probability theory , the Chernoff inequality, named after Herman Chernoff but tracing back to Herman Rubin , describes an upper bound for the probability that a sequence of independent Bernoulli experiments deviates from its expected number of successes.

The Chernoff inequality is a versatile and widely used tool in the analysis of randomized algorithms in computer science .

sentence

Let be a sequence of independent Bernoulli experiments with and . Accordingly, describes the expected number of successes ( ) of the experiment. ${\ displaystyle X_ {1}, X_ {2}, \ ldots, X_ {n}}$ ${\ displaystyle n}$ ${\ displaystyle P [X_ {i} = 1] = p}$ ${\ displaystyle P [X_ {i} = 0] = 1-p}$ ${\ displaystyle pn}$ ${\ displaystyle X_ {i} = 1}$

1. Then applies to each

{\ displaystyle \ delta> 0}

{\ displaystyle P \ left [\ sum X_ {i} \ geq (1+ \ delta) \ cdot pn \ right] \ leq \ exp \ left (- {\ frac {\ min \ {\ delta, \ delta ^ { 2} \}} {3}} pn \ right)}

2. For each applies:

{\ displaystyle \ delta \ in [0,1]}

{\ displaystyle P \ left [\ sum X_ {i} \ leq (1- \ delta) \ cdot pn \ right] \ leq \ exp \ left (- {\ frac {\ delta ^ {2}} {2}} pn \ right)}

proof

First Chernoff bound

Let be an arbitrary constant at first . In the following, to simplify the notation, designate a new random variable able . Because of the monotony of the illustration then follows ${\ displaystyle t> 0}$ ${\ displaystyle Y}$ ${\ displaystyle \ textstyle Y = \ exp \ left (t \ sum X_ {i} \ right)}$ ${\ displaystyle x \ mapsto \ exp (tx)}$

{\ displaystyle P \ left [\ sum X_ {i} \ geq (1+ \ delta) \ cdot pn \ right] = P \ left [Y \ geq \ exp \ left (t (1+ \ delta) \ cdot pn \ right) \ right] = P \ left [Y \ geq k {\ textrm {E}} \ left [Y \ right] \ right] \ leq {\ frac {1} {k}}}

,

where is defined and the last estimate follows using the Markov inequality . Now applies ${\ displaystyle k}$ ${\ displaystyle k = {\ tfrac {\ exp (t (1+ \ delta) pn)} {{\ textrm {E}} [Y]}}}$

{\ displaystyle {\ textrm {E}} \ left [\ exp (tX_ {i}) \ right] = (1-p) e ^ {0} + pe ^ {t} = 1 + (e ^ {t} -1) p}

and thus

{\ displaystyle {\ textrm {E}} \ left [Y \ right] = {\ textrm {E}} \ left [\ exp (t \ sum X_ {i}) \ right] = {\ textrm {E}} \ left [\ prod \ exp (tX_ {i}) \ right] = \ prod {\ textrm {E}} \ left [\ exp (tX_ {i}) \ right] = \ left (1+ (e ^ { t} -1) p \ right) ^ {n}}

.

So follows

{\ displaystyle {\ frac {1} {k}} = e ^ {- t (1+ \ delta) pn} \ left (1+ (e ^ {t} -1) p \ right) ^ {n} \ leq e ^ {- t (1+ \ delta) pn} \ cdot e ^ {(e ^ {t} -1) pn} = e ^ {- t (1+ \ delta) pn + (e ^ {t} - 1) pn}}

.

Look now . Then applies ${\ displaystyle t = \ log (1+ \ delta)}$

{\ displaystyle {\ frac {1} {k}} \ leq e ^ {- (\ log (1+ \ delta)) (1+ \ delta) pn + \ delta pn} = e ^ {(\ delta - (1 + \ delta) \ log (1+ \ delta)) pn}}

.

For part of the exponent of the right term

{\ displaystyle L (\ delta) = (1+ \ delta) \ log (1+ \ delta)}

one can show by means of curve discussion and Taylor series expansion that it always holds. Due to the monotony of the exponential function, the following applies ${\ displaystyle L (\ delta) \ geq \ delta + {\ tfrac {1} {3}} \ min \ {\ delta, \ delta ^ {2} \}}$

{\ displaystyle {\ frac {1} {k}} \ leq e ^ {(\ delta - (\ delta + {\ frac {1} {3}} \ min \ {\ delta, \ delta ^ {2} \ })) pn} = \ exp \ left (- {\ frac {\ min \ {\ delta, \ delta ^ {2} \}} {3}} pn \ right)}

.

The assertion follows together with the first estimate.

Second Chernoff bound

The proof of the second bound follows technically analogously to the first bound.

variants

A general variant of the Chernoff inequality can be formulated using the standard deviation . Let be discrete, independent random variables with and . Denote the variance of . Then applies to each : ${\ displaystyle X_ {1}, X_ {2}, \ ldots, X_ {n}}$ ${\ displaystyle {\ textrm {E}} [X_ {i}] = 0}$ ${\ displaystyle | X_ {i} | \ leq 1}$ ${\ displaystyle \ sigma ^ {2}}$ ${\ displaystyle \ textstyle X = \ sum X_ {i}}$ ${\ displaystyle 0 <\ lambda \ leq 2 \ sigma}$

{\ displaystyle P \ left [\ left | \ sum X_ {i} \ right | \ geq \ lambda \ sigma \ right] \ leq 2 \ exp \ left (- {\ frac {\ lambda ^ {2}} {4 }} \ right)}

The proof is technically analogous to the one shown above.

Examples

Consider the following question: How likely is it that if you toss a fair coin ten times, you will get at least seven tails? The coin flips provide Bernoulli trials with is after the first follow follows Chernoff bound.: ${\ displaystyle X_ {1}, X_ {2}, \ ldots, X_ {10}}$ ${\ displaystyle pn = {\ tfrac {1} {2}} \ cdot 10 = 5}$

{\ displaystyle P \ left [\ sum X_ {i} \ geq 7 \ right] = P \ left [\ sum X_ {i} \ geq \ left (1 + {\ frac {4} {10}} \ right) \ cdot 5 \ right]}

{\ displaystyle \ leq \ exp \ left (- {\ frac {\ min \ {{\ frac {4} {10}}, {\ frac {16} {100}} \}} {3}} \ cdot 5 \ right) = \ exp \ left (- {\ frac {4} {15}} \ right) \ approx 0 {,} 766 \ ldots}

Just rephrase the above example slightly and ask instead: How likely is it to get at least seventy times the result "number" after a hundred fair coin toss? The first Chernoff bound immediately proves to be much stronger:

{\ displaystyle P \ left [\ sum X_ {i} \ geq 70 \ right] = P \ left [\ sum X_ {i} \ geq \ left (1 + {\ frac {4} {10}} \ right) \ cdot 50 \ right]}

{\ displaystyle \ leq \ exp \ left (- {\ frac {\ min \ {{\ frac {4} {10}}, {\ frac {16} {100}} \}} {3}} \ cdot 50 \ right) = \ exp \ left (- {\ frac {8} {3}} \ right) \ approx 0 {,} 069 \ ldots}

literature

Christian Schindelhauer, Algorithms for Peer-to-Peer Networks (Lecture Materials), http://wwwcs.upb.de/cs/ag-madh/WWW/Teaching/2004SS/AlgoP2P/skript.html , University of Paderborn, 2004.
Kirill Levchenko, Notes, http://www.cs.ucsd.edu/~klevchen/techniques/chernoff.pdf

Individual evidence

↑ Herman Chernoff: A career in statistics. In: Xihong Lin, Christian Genest, David L. Banks, Geert Molenberghs, David W. Scott, Jane-Ling Wang (Eds.): Past, Present, and Future of Statistics. CRC Press, 2014, ISBN 978-1-4822-0496-4 , pp. 35 ( crcpress.com ).
↑ John Bather: A Conversation with Herman Chernoff . In: Statistical Science . 11, No. 4, November 1996, pp. 335-350. doi : 10.1214 / ss / 1032280306 .

[1] Herman Chernoff: A career in statistics. In: Xihong Lin, Christian Genest, David L. Banks, Geert Molenberghs, David W. Scott, Jane-Ling Wang (Eds.): Past, Present, and Future of Statistics. CRC Press, 2014, ISBN 978-1-4822-0496-4 , pp. 35 ( crcpress.com ).

[2] John Bather: A Conversation with Herman Chernoff . In: Statistical Science . 11, No. 4, November 1996, pp. 335-350. doi : 10.1214 / ss / 1032280306 .