The Fritz-John conditions (abbreviated to FJ conditions ) are a necessary first-order optimality criterion in mathematics in non-linear optimization . They are a generalization of the Karush-Kuhn-Tucker conditions and, in contrast to these, do not have any regularity conditions. They are named after the American mathematician of German descent, Fritz John .
Framework
The Fritz-John-Conditions allow statements about an optimization problem of the form
![\ min _ {{x \ in D}} f (x)](https://wikimedia.org/api/rest_v1/media/math/render/svg/d85bc03086db20718f40f8b780d1fb1f500711b7)
under the constraints
![g_ {i} (x) \ leq 0 ~, 1 \ leq i \ leq m](https://wikimedia.org/api/rest_v1/media/math/render/svg/fe3e17e52c7aa87c00575e5caf869822271edee6)
-
.
All functions considered are continuously differentiable and is a non-empty subset of the .
![D.](https://wikimedia.org/api/rest_v1/media/math/render/svg/f34a0c600395e5d4345287e21fb26efd386990e6)
![{\ mathbb {R}} ^ {n}](https://wikimedia.org/api/rest_v1/media/math/render/svg/c510b63578322050121fe966f2e5770bea43308d)
statement
A point is called the Fritz-John point or FJ point for short in the above optimization problem if it meets the following conditions:
![(z ^ {*}, x ^ {*}, \ mu ^ {*}, \ lambda ^ {*}) \ in {\ mathbb {R}} ^ {{1 + n + m + l}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/afd6be69673e30dd1024c45e2578ea8c1f78c34d)
![z ^ {*} \ nabla f (x ^ {*}) + \ sum _ {{{i = 1}} ^ {m} \ mu _ {i} ^ {*} \ nabla g_ {i} (x ^ { *}) + \ sum _ {{j = 1}} ^ {l} \ lambda _ {j} ^ {*} \ nabla h_ {j} (x ^ {*}) = 0](https://wikimedia.org/api/rest_v1/media/math/render/svg/34b79549a332edd7779a505aabe512cf1695ee5c)
![g_ {i} (x ^ {*}) \ leq 0, {\ mbox {for}} i = 1, \ ldots, m](https://wikimedia.org/api/rest_v1/media/math/render/svg/0a2d8ac2f336293f6c2af725678afad5fec5ed13)
![h_ {j} (x ^ {*}) = 0, {\ mbox {for}} j = 1, \ ldots, l \, \!](https://wikimedia.org/api/rest_v1/media/math/render/svg/d2d948a390b49ed32e5d229225bc464d1fc30da0)
![\ mu _ {i} ^ {*} \ geq 0, {\ mbox {for}} i = 1, \ ldots, m](https://wikimedia.org/api/rest_v1/media/math/render/svg/f6ce79f7863ffaf0ec525f7277f30f9f0849cd92)
![\ mu _ {i} ^ {*} g_ {i} (x ^ {*}) = 0, {\ mbox {for}} \; i = 1, \ ldots, m.](https://wikimedia.org/api/rest_v1/media/math/render/svg/457c8ab57f2941953fe09bfc9f3271a77c69661f)
![z ^ {*} \ geq 0](https://wikimedia.org/api/rest_v1/media/math/render/svg/c2d3461c23990d8137d5c0dc2260069dbf6f4fcb)
These conditions are called the Fritz-John-Conditions or FJ-Conditions for short .
If the point is the local minimum of the optimization problem, then there is such that is an FJ point and is not equal to the zero vector.
![x ^ {*}](https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb)
![\ mu ^ {*}, \ lambda ^ {*}, z ^ {*}](https://wikimedia.org/api/rest_v1/media/math/render/svg/551a4fa373fcb619893ac66e3278837507dade76)
![(z ^ {*}, x ^ {*}, \ mu ^ {*}, \ lambda ^ {*})](https://wikimedia.org/api/rest_v1/media/math/render/svg/a41cbc7dc915d6ebb17d2af610af3cbcf9082a63)
![(z ^ {*}, \ mu ^ {*}, \ lambda ^ {*})](https://wikimedia.org/api/rest_v1/media/math/render/svg/49c2e1a2a4234094e60da05224306d3f41335e87)
Thus the FJ conditions are a necessary first order optimality criterion .
Relationship to the Karush-Kuhn-Tucker Terms
For the FJ conditions correspond exactly to the Karush-Kuhn-Tucker conditions . Is a FJ-point, so also with a FJ-point. It can therefore be assumed that if is, a KKT point is already present; this is also generated by rescaling . Then the KKT point belonging to an FJ point is. Conversely, the constraint qualifications of the KKT conditions can now be interpreted in such a way that they guarantee the FJ conditions .
![{\ displaystyle z ^ {*} = 1}](https://wikimedia.org/api/rest_v1/media/math/render/svg/f96d9d89c3fb07423ebbdb6912371287b04a7296)
![(z ^ {*}, x ^ {*}, \ mu ^ {*}, \ lambda ^ {*})](https://wikimedia.org/api/rest_v1/media/math/render/svg/a41cbc7dc915d6ebb17d2af610af3cbcf9082a63)
![{\ displaystyle (sz ^ {*}, x ^ {*}, s \ mu ^ {*}, s \ lambda ^ {*})}](https://wikimedia.org/api/rest_v1/media/math/render/svg/d567dc1e91bd1f1bf065f6bc43f382bc7e66bc05)
![{\ displaystyle s> 0}](https://wikimedia.org/api/rest_v1/media/math/render/svg/76beea94b6662bd490c61c0628dddd8a8cd35538)
![{\ displaystyle z ^ {*}> 0}](https://wikimedia.org/api/rest_v1/media/math/render/svg/bb8a79a7446b98db6cc36efebf7565c40cb94331)
![{\ displaystyle z ^ {*}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/a5b376dccffe5ae946dcdb7e98bf41beae28dc9e)
![{\ displaystyle (x ^ {*}, \ mu ^ {*} / z ^ {*}, \ lambda ^ {*} / z ^ {*})}](https://wikimedia.org/api/rest_v1/media/math/render/svg/831be9d419889dc4ac02ef8d980554c4020b23d7)
![z ^ {*}> 0](https://wikimedia.org/api/rest_v1/media/math/render/svg/bb8a79a7446b98db6cc36efebf7565c40cb94331)
Examples
FJ without KKT
As an example, consider the optimization problem
![{\ displaystyle \ min _ {x \ in X} -x_ {1}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/4b7944d33532b896f909481da9390567d1817f7d)
with restriction amount
-
.
The minimum of the problem is the point . Hence there is an FJ point such that
![{\ displaystyle x ^ {*} = (0,0)}](https://wikimedia.org/api/rest_v1/media/math/render/svg/f23512eeb84d826208d77bb2da77e462cb3ae5f1)
![{\ displaystyle (z ^ {*}, 0,0, \ mu _ {1} ^ {*}, \ mu _ {2} ^ {*})}](https://wikimedia.org/api/rest_v1/media/math/render/svg/be2bda3ace73152e6fc325cb1b83e51598f5008b)
-
.
It follows directly that for an FJ point.
![{\ displaystyle z ^ {*} = 0, \, \ mu _ {1} ^ {*} = \ mu _ {2} ^ {*}> 0}](https://wikimedia.org/api/rest_v1/media/math/render/svg/a0110ce27df5df8a0d513df838c831f39c1a1c68)
In particular, there is no associated KKT point. If one sets , the system of equations for the gradients cannot be solved. In fact, no regularity condition is fulfilled in the point , especially not the most general one, the Abadie CQ .
![{\ displaystyle z ^ {*} = 1}](https://wikimedia.org/api/rest_v1/media/math/render/svg/f96d9d89c3fb07423ebbdb6912371287b04a7296)
![x ^ {*}](https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb)
FJ and KKT
As an example, consider the optimization problem
![{\ displaystyle \ min _ {x \ in X} -x_ {2} + x_ {1} ^ {2}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/a401b0a7ed50d5e860c7869b6e0dde80f578aa85)
with restriction amount
-
.
The restriction set is the unit circle with the curvature of the circle removed from the first quadrant. The minimum of the problem is the point . Hence there is an FJ point so that
![{\ displaystyle x ^ {*} = (0.1)}](https://wikimedia.org/api/rest_v1/media/math/render/svg/908a7fd8ab718de6cf473b13b730f4ad38b371d9)
![{\ displaystyle (z ^ {*}, 0,1, \ mu _ {1} ^ {*}, \ mu _ {2} ^ {*})}](https://wikimedia.org/api/rest_v1/media/math/render/svg/4ba40964bcc3271c3f25a7eedc1227ab51921ff5)
![{\ displaystyle z ^ {*} (0, -1) ^ {T} + \ mu _ {1} ^ {*} (1,1) ^ {T} + \ mu _ {2} ^ {*} ( 0.2) ^ {T} = (0.0) ^ {T}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/65d367dfa003e2bcd307b262e7799b7519570074)
applies. One solution would be what leads to the FJ point . Rescaling with leads to the KKT point . In fact, the LICQ is also fulfilled in this point , which is why the KKT conditions also apply here.
![{\ displaystyle z ^ {*} = 2, \, \ mu _ {1} ^ {*} = 0, \, \ mu _ {2} ^ {*} = 1}](https://wikimedia.org/api/rest_v1/media/math/render/svg/3f72f67e989490fa2ed2a8e148a26a24fdf623ae)
![{\ displaystyle (2,0,1,0,1)}](https://wikimedia.org/api/rest_v1/media/math/render/svg/9b36a843f4b10aa7c0fb5a792ce2c74385f15c74)
![{\ displaystyle z ^ {*}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/a5b376dccffe5ae946dcdb7e98bf41beae28dc9e)
![{\ displaystyle (x ^ {*}, \ mu _ {1} ^ {*} / z ^ {*}, \ mu _ {2} ^ {*} / z ^ {*}) = (0,1, 0.1 / 2)}](https://wikimedia.org/api/rest_v1/media/math/render/svg/96eb503cbc11bd5233760be2c18020ccd432ea96)
![x ^ {*}](https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb)
Related concepts
For convex optimization problems, in which the functions are not continuously differentiable, there are the saddle point criteria of the Lagrange function . If all the functions involved are continuously differentiable, then they are structurally similar to the Fritz-John conditions and equivalent to the KKT conditions.
literature
Web links
Individual evidence
-
↑ F. John: Extremum problems with inequalities as subsidiary conditions . In: Kurt Friedrichs, Otto Neugebauer, JJ Stoker (Ed.): Studies and Essays . Courant Anniversary Volume, Wiley, 1948, pp. 187-204, reprinted in: Fritz John: Collected Papers . Birkhäuser 1985, pp. 543-560