Convergence proof techniques
Convergence proof techniques are canonical patterns of mathematical proofs that sequences or functions converge to a finite limit when the argument tends to infinity. There are many types of sequences and modes of convergence, and different proof techniques may be more appropriate than others for proving each type of convergence of each type of sequence. Below are some of the more common and typical examples. This article is intended as an introduction aimed to help practitioners explore appropriate techniques. The links below give details of necessary conditions and generalizations to more abstract settings. Proof techniques for the convergence of series, a particular type of sequences corresponding to sums of many terms, are covered in the article on convergence tests.
Convergence in Rn
It is common to want to prove convergence of a sequence or function , where and refer to the natural numbers and the real numbers, respectively, and convergence is with respect to the Euclidean norm, . Useful approaches for this are as follows.
First principles
The analytic definition of convergence of to a limit is that[1] for all there exists a such for all , . The most direct proof technique from this definition is to find such a and prove the required inequality. If the value of is not known in advance, the techniques below may be useful.
Contraction mappings
In many cases, the function whose convergence is of interest has the form for some transformation . For example, could map to for some conformable matrix , so that , a matrix generalization of the geometric progression. Alternatively, may be an elementwise operation, such as replacing each element of by the square root of its magnitude. In such cases, if the problem satisfies the conditions of Banach fixed-point theorem (the domain is a non-empty complete metric space) then it is sufficient to prove convergence to prove that is a contraction mapping to prove that it has a fixed point. This requires that for some constant which is fixed for all and . The composition of two contraction mappings is a contraction mapping, so if , then it is sufficient to show that and are both contraction mappings.
Example
Famous examples of applications of this approach include
- If has the form for some matrices and , then converges to if the magnitudes of all eigenvalues of are less than 1[citation needed].
Non-expansion mappings
If both above inequalities in the definition of a contraction mapping are weakened from "strictly less than" to "less than or equal to", the mapping is a non-expansion mapping. It is not sufficient to prove convergence to prove that is a non-expansion mapping. For example, is a non-expansion mapping, but the sequence does not converge for any . However, the composition of a contraction mapping and a non-expansion mapping (or vice versa) is a contraction mapping.
Contraction mappings on limited domains
If is not a contraction mapping on its entire domain, but it is on its codomain (the image of the domain), that is also sufficient for convergence. This also applies for decompositions. For example, consider . The function is not a contraction mapping, but it is on the restricted domain , which is the codomain of for real arguments. Since is a non-expansion mapping, this implies is a contraction mapping.
Convergent subsequences
Every bounded sequence in has a convergent subsequence, by the Bolzano–Weierstrass theorem. If these subsequences all have the same limit, then the original sequence also converges to that limit. If it can be shown that all of the subsequences of must have the same limit, such as by showing that there is a unique fixed point of the transformation and that there are no invariant sets of that contain no fixed points of , then the initial sequence must also converge to that limit.
Monotonicity (Lyapunov functions)
Every bounded monotonic sequence in converges to a limit. This fact can be used directly and can also be used to prove the convergence of sequences that are not monotonic using techniques and theorems named for Aleksandr Lyapunov. In these cases, one defines a function such that is monotonic in and thus converges. If satisfies the conditions to be a Lyapunov function then Lyapunov's theorem implies that is also convergent. Lyapunov's theorem is normally stated for ordinary differential equations, but it can also be applied to sequences of iterates by replacing derivatives with discrete differences. The basic requirements on to be a Lyapunov function are that
- for all and
- for (discrete case) or for (continuous case)
- is "radially unbounded", i.e., that for any sequence with .
In many cases a quadratic Lyapunov function of the form can be found, although more complex forms are also common, for instance entropies in the study of convergence of probability distributions. For delay differential equations, a similar approach applies with Lyapunov functions replaced by Lyapunov functionals also called Lyapunov-Krasovskii functionals. If the inequality in the condition 2 is weak, LaSalle's invariance principle may be used.
Convergence of sequences of functions
To consider the convergence of sequences of functions,[2] it is necessary to define a distance between functions to replace the Euclidean norm. These often include
- Convergence in the norm (strong convergence) -- a function norm, such as is defined, and convergence occurs if . For this case, all of the above techniques can be applied with this function norm.
- Pointwise convergence -- convergence occurs if for each , . For this case, the above techniques can be applied for each point with the norm appropriate for .
- uniform convergence -- In pointwise convergence, some (open) regions can converge arbitrarily slowly. With uniform convergence, there is a fixed convergence rate such that all points converge at least that fast. Formally, where is the domain of each .
See also
Convergence of random variables
Random variables[3] are more complicated than simple elements of . (Formally, a random variable is a mapping from an event space to a value space . The value space may be , such as the roll of a dice, and such a random variable is often spoken of informally as being in , but convergence of sequence of random variables corresponds to convergence of the sequence of functions, or the distributions, rather than the sequence of values.) There are multiple types of convergence, depending on how the distance between functions is measured.
- Convergence in distribution -- pointwise convergence of the distribution functions of the random variables to the limit
- Convergence in probability
- Almost sure convergence -- pointwise convergence of the mappings to the limit, except at a set in with measure 0 in the limit.
- Convergence in the mean
Each has its own proof techniques, which are beyond the current scope of this article. See also
- Dominated convergence
- Carleson's theorem establishing the pointwise (Lebesgue) almost everywhere convergence of Fourier series of L2 functions
- Doob's martingale convergence theorems a random variable analogue of the monotone convergence theorem
Topological convergence
For all of the above techniques, some form the basic analytic definition of convergence above applies. However, topology has its own definitions of convergence. For example, in a non-Hausdorff space, it is possible for a sequence to converge to multiple different limits.
References
- ↑ Ross, Kenneth. Elementary Analysis: The Theory of Calculus. Springer.
- ↑ Haase, Markus. Functional Analysis: An Elementary Introduction. American Mathematics Society.
- ↑ Billingsley, Patrick (1995). Probability and Measure. John Wesley.