Danskin's theorem

From The Right Wiki
Jump to navigationJump to search

In convex analysis, Danskin's theorem is a theorem which provides information about the derivatives of a function of the form f(x)=maxzZϕ(x,z). The theorem has applications in optimization, where it sometimes is used to solve minimax problems. The original theorem given by J. M. Danskin in his 1967 monograph [1] provides a formula for the directional derivative of the maximum of a (not necessarily convex) directionally differentiable function. An extension to more general conditions was proven 1971 by Dimitri Bertsekas.

Statement

The following version is proven in "Nonlinear programming" (1991).[2] Suppose ϕ(x,z) is a continuous function of two arguments, ϕ:n×Z where Zm is a compact set. Under these conditions, Danskin's theorem provides conclusions regarding the convexity and differentiability of the function f(x)=maxzZϕ(x,z). To state these results, we define the set of maximizing points Z0(x) as Z0(x)={z:ϕ(x,z)=maxzZϕ(x,z)}. Danskin's theorem then provides the following results.

Convexity
f(x) is convex if ϕ(x,z) is convex in x for every zZ.
Directional semi-differential
The semi-differential of f(x) in the direction y, denoted yf(x), is given by yf(x)=maxzZ0(x)ϕ(x,z;y), where ϕ(x,z;y) is the directional derivative of the function ϕ(,z) at x in the direction y.
Derivative
f(x) is differentiable at x if Z0(x) consists of a single element z. In this case, the derivative of f(x) (or the gradient of f(x) if x is a vector) is given by fx=ϕ(x,z)x.

Example of no directional derivative

In the statement of Danskin, it is important to conclude semi-differentiability of f and not directional-derivative as explains this simple example. Set Z={1,+1},ϕ(x,z)=zx, we get f(x)=|x| which is semi-differentiable with f(0)=1,+f(0)=+1 but has not a directional derivative at x=0.

Subdifferential

If ϕ(x,z) is differentiable with respect to x for all zZ, and if ϕ/x is continuous with respect to z for all x, then the subdifferential of f(x) is given by f(x)=conv{ϕ(x,z)x:zZ0(x)} where conv indicates the convex hull operation.

Extension

The 1971 Ph.D. Thesis by Dimitri P. Bertsekas (Proposition A.22) [3] proves a more general result, which does not require that ϕ(,z) is differentiable. Instead it assumes that ϕ(,z) is an extended real-valued closed proper convex function for each z in the compact set Z, that int(dom(f)), the interior of the effective domain of f, is nonempty, and that ϕ is continuous on the set int(dom(f))×Z. Then for all x in int(dom(f)), the subdifferential of f at x is given by f(x)=conv{ϕ(x,z):zZ0(x)} where ϕ(x,z) is the subdifferential of ϕ(,z) at x for any z in Z.

See also

References

  1. Danskin, John M. (1967). The theory of Max-Min and its application to weapons allocation problems. New York: Springer.
  2. Bertsekas, Dimitri P. (1999). Nonlinear programming (Second ed.). Belmont, Massachusetts. ISBN 1-886529-00-0.{{cite book}}: CS1 maint: location missing publisher (link)
  3. Bertsekas, Dimitri P. (1971). Control of Uncertain Systems with a Set-Membership Description of Uncertainty (PDF) (PhD). Cambridge, MA: MIT.