Quasilinearization

From The Right Wiki
Jump to navigationJump to search
Two curves on a gridded graph, each marked with crosses at the interpolation nodes. The top shallow curve u_1 has a minimum about 0.71 and the bottom deeper curve u_2 has a minimum about -4. Both curves touch y=1 at x=1 and at x=-1.
Two numerical solutions of the nonlinear example boundary value problem y=y2, y(1)=y(1)=1. Solved by a spectral Chebyshev method and quasilinearization. The top curve u1 used 21 interpolation nodes, and the bottom curve u2 used 34. Both used 3 iterations.

In mathematics, quasilinearization is a technique which replaces a nonlinear differential equation or operator equation (or system of such equations) with a sequence of linear problems, which are presumed to be easier, and whose solutions approximate the solution of the original nonlinear problem with increasing accuracy. It is a generalization of Newton's method; the word "quasilinearization" is commonly used when the differential equation is a boundary value problem.[1][2]

Abstract formulation

Quasilinearization replaces a given nonlinear operator N with a certain linear operator which, being simpler, can be used in an iterative fashion to approximately solve equations containing the original nonlinear operator. This is typically performed when trying to solve an equation such as N(y) = 0 together with certain boundary conditions B for which the equation has a solution y. This solution is sometimes called the "reference solution". For quasilinearization to work, the reference solution needs to exist uniquely (at least locally). The process starts with an initial approximation y0 that satisfies the boundary conditions and is "sufficiently close" to the reference solution y in a sense to be defined more precisely later. The first step is to take the Fréchet derivative of the nonlinear operator N at that initial approximation, in order to find the linear operator L(y0) which best approximates N(y)-N(y0) locally. The nonlinear equation may then be approximated as N(y) = N(yk) + L(yk)( y - yk) + O( y-yk )2, taking k=0. Setting this equation to zero and imposing zero boundary conditions and ignoring higher-order terms gives the linear equation L(yk)( y - yk ) = - N(yk). The solution of this linear equation (with zero boundary conditions) might be called yk+1. Computation of yk for k=1, 2, 3,... by solving these linear equations in sequence is analogous to Newton's iteration for a single equation, and requires recomputation of the Fréchet derivative at each yk. The process can converge quadratically to the reference solution, under the right conditions. Just as with Newton's method for nonlinear algebraic equations, however, difficulties may arise: for instance, the original nonlinear equation may have no solution, or more than one solution, or a multiple solution, in which cases the iteration may converge only very slowly, may not converge at all, or may converge instead to the wrong solution. The practical test of the meaning of the phrase "sufficiently close" earlier is precisely that the iteration converges to the correct solution. Just as in the case of Newton iteration, there are theorems stating conditions under which one can know ahead of time when the initial approximation is "sufficiently close".

Contrast with discretizing first

One could instead discretize the original nonlinear operator and generate a (typically large) set of nonlinear algebraic equations for the unknowns, and then use Newton's method proper on this system of equations. Generally speaking, the convergence behavior is similar: a similarly good initial approximation will produce similarly good approximate discrete solutions. However, the quasilinearization approach (linearizing the operator equation instead of the discretized equations) seems to be simpler to think about, and has allowed such techniques as adaptive spatial meshes to be used as the iteration proceeds.[3]

Example

As an example to illustrate the process of quasilinearization, we can approximately solve the two-point boundary value problem for the nonlinear node d2dx2y(x)=y2(x), where the boundary conditions are y(1)=1 and y(1)=1. The exact solution of the differential equation can be expressed using the Weierstrass elliptic function ℘, like so: y(x)=6(xα|0,β) where the vertical bar notation means that the invariants are g2=0 and g3=β. Finding the values of α and β so that the boundary conditions are satisfied requires solving two simultaneous nonlinear equations for the two unknowns α and β, namely 6(1α|0,β)=1 and 6(1α|0,β)=1. This can be done, in an environment where ℘ and its derivatives are available, for instance by Newton's method.[lower-alpha 1] Applying the technique of quasilinearization instead, one finds by taking the Fréchet derivative at an unknown approximation yk(x) that the linear operator is L(ε)=d2dx2ε(x)2yk(x)ε(x). If the initial approximation is y0(x)=1 identically on the interval 1x1, then the first iteration (at least) can be solved exactly, but is already somewhat complicated. A numerical solution instead, for instance by a Chebyshev spectral method using n=21 Chebyshev—Lobatto points xk=cos(π(n1k)/(n1)) for k=0,1,,n1 gives a solution with residual less than 5109 after three iterations; that is, y3(x) is the exact solution to d2dx2y(x)y2(x)=5109v(x), where the maximum value of |v(x)| is less than 1 on the interval 1x1. This approximate solution (call it u1) agrees with the exact solution 6(xα|0,β) with {α3.524459420,β0.006691372637}. Other values of α and β give other continuous solutions to this nonlinear two-point boundary-value problem for ODE, such as {α2.55347391110,β1.24923895273}. The solution corresponding to these values plotted in the figure is called u2. Yet other values of the parameters can give discontinuous solutions because ℘ has a double pole at zero and so y(x) has a double pole at x=α. Finding other continuous solutions by quasilinearization requires different initial approximations to the ones used here. The initial approximation y0=5x24 approximates the exact solution u2 and can be used to generate a sequence of approximations converging to u2. Both approximations are plotted in the accompanying figure.

Notes

  1. For more information about elliptic functions, see Lawden (1989).[4]

See also

References

  1. Ascher, Uri M.; Mattheij, Robert M.; Russell, Robert D. (1995). Numerical solution of boundary value problems for ordinary differential equations. SIAM.
  2. Sylvester, R.J.; Meyer, F. (June 1965). "Two point boundary problems by quasilinearization". Journal of the Society for Industrial and Applied Mathematics. 13 (2): 586–602. doi:10.1137/0113038. JSTOR 2946451. Retrieved 31 March 2022.
  3. Bornemann, Folkmar (1991). An Adaptive Multilevel Approach to Parabolic Equations in Two Space Dimensions. ZIB. Retrieved 6 March 2022.
  4. Lawden, Derek F. (1989). Elliptic Functions and Applications. New York: Springer-Verlag. ISBN 0-387-96965-9.

Further reading