Laguerre's method

From The Right Wiki
Jump to navigationJump to search

In numerical analysis, Laguerre's method is a root-finding algorithm tailored to polynomials. In other words, Laguerre's method can be used to numerically solve the equation p(x) = 0 for a given polynomial p(x). One of the most useful properties of this method is that it is, from extensive empirical study, very close to being a "sure-fire" method, meaning that it is almost guaranteed to always converge to some root of the polynomial, no matter what initial guess is chosen. However, for computer computation, more efficient methods are known, with which it is guaranteed to find all roots (see Root-finding algorithm § Roots of polynomials) or all real roots (see Real-root isolation). This method is named in honour of the French mathematician, Edmond Laguerre.

Definition

The algorithm of the Laguerre method to find one root of a polynomial p(x) of degree n is:

  • Choose an initial guess x0
  • For k = 0, 1, 2, ...
    • If p(xk) is very small, exit the loop
    • Calculate G=p(xk)p(xk)
    • Calculate H=G2p(xk)p(xk)
    • Calculate a=nG±(n1)(nHG2), where the sign is chosen to give the denominator with the larger absolute value, to avoid catastrophic cancellation.
    • Set xk+1=xka
  • Repeat until a is small enough or if the maximum number of iterations has been reached.

If a root has been found, the corresponding linear factor can be removed from p. This deflation step reduces the degree of the polynomial by one, so that eventually, approximations for all roots of p can be found. Note however that deflation can lead to approximate factors that differ significantly from the corresponding exact factors. This error is least if the roots are found in the order of increasing magnitude.

Derivation

The fundamental theorem of algebra states that every nth degree polynomial p can be written in the form

p(x)=C(xx1)(xx2)(xxn),

so that x1,x2,,xn, are the roots of the polynomial. If we take the natural logarithm of both sides, we find that

ln|p(x)|=ln|C|+ln|xx1|+ln|xx2|++ln|xxn|.

Denote the logarithmic derivative by

G=ddxln|p(x)|=1xx1+1xx2++1xxn[0.3em]=p(x)|p(x)|,

and the negated second derivative by

H=d2dx2ln|p(x)|=1(xx1)2+1(xx2)2++1(xxn)2[0.3em]=p(x)|p(x)|+(p(x)p(x))2sgn(p(x)).

We then make what Acton (1970)[page needed] calls a 'drastic set of assumptions', that the root we are looking for, say, x1 is a short distance,a, away from our guess x, and all the other roots are all clustered together, at some further distance b. If we denote these distances by

axx1

and

bxx2xx3xxn,

or exactly,

bharmonicmean{xx2,xx3,xxn}

then our equation for G may be written as

G=1a+n1b

and the expression for H becomes

H=1a2+n1b2.

Solving these equations for a, we find that

a=nG±(n1)(nHG2),

where in this case, the square root of the (possibly) complex number is chosen to produce largest absolute value of the denominator and make a as small as possible; equivalently, it satisfies:

R{G(n1)(nHG2)}>0,

where R denotes real part of a complex number, and G is the complex conjugate of G; or

a=p(x)p(x){1n+n1n1nn1p(x)p(x)p(x)2}1,

where the square root of a complex number is chosen to have a non-negative real part. For small values of p(x) this formula differs from the offset of the third order Halley's method by an error of 𝒪{(p(x))3}, so convergence close to a root will be cubic as well.

Fallback

Even if the 'drastic set of assumptions' does not work well for some particular polynomial p(x), then p(x) can be transformed into a related polynomial r for which the assumptions are viable; e.g. by first shifting the origin towards a suitable complex number w, giving a second polynomial q(x) = p(xw), that give distinct roots clearly distinct magnitudes, if necessary (which it will be if some roots are complex conjugates). After that, getting a third polynomial r from q(x) by repeatedly applying the root squaring transformation from Graeffe's method, enough times to make the smaller roots significantly smaller than the largest root (and so, clustered comparatively nearer to zero). The approximate root from Graeffe's method, can then be used to start the new iteration for Laguerre's method on r. An approximate root for p(x) may then be obtained straightforwardly from that for r. If we make the even more extreme assumption that the terms in G corresponding to the roots x2,x3,,xn are negligibly small compared to the root x1, this leads to Newton's method.

Properties

File:Attraction zones of Laguerre's.png
Attraction zones of Laguerre's method for the polynomial p(x)=x4+2x3+3x2+4x+1.

If x is a simple root of the polynomial p(x), then Laguerre's method converges cubically whenever the initial guess, x(0), is close enough to the root x1. On the other hand, when x1 is a multiple root convergence is merely linear, with the penalty of calculating values for the polynomial and its first and second derivatives at each stage of the iteration. A major advantage of Laguerre's method is that it is almost guaranteed to converge to some root of the polynomial no matter where the initial approximation is chosen. This is in contrast to other methods such as the Newton–Raphson method and Stephensen's method, which notoriously fail to converge for poorly chosen initial guesses. Laguerre's method may even converge to a complex root of the polynomial, because the radicand of the square root may be of a negative number, in the formula for the correction, a, given above – manageable so long as complex numbers can be conveniently accommodated for the calculation. This may be considered an advantage or a liability depending on the application to which the method is being used. Empirical evidence shows that convergence failure is extremely rare, making this a good candidate for a general purpose polynomial root finding algorithm. However, given the fairly limited theoretical understanding of the algorithm, many numerical analysts are hesitant to use it as a default, and prefer better understood methods such as the Jenkins–Traub algorithm, for which more solid theory has been developed and whose limits are known. The algorithm is fairly simple to use, compared to other "sure-fire" methods, and simple enough for hand calculation, aided by a pocket calculator, if a computer is not available. The speed at which the method converges means that one is only very rarely required to compute more than a few iterations to get high accuracy.

References

  • Acton, Forman S. (1970). Numerical Methods that USUALLY Work. Harper & Row. ISBN 0-88385-450-3 – via Internet Archive (archive.org).
  • Goedecker, S. (1994). "Remark on algorithms to find roots of polynomials". SIAM Journal on Scientific Computing. 15 (5): 1059–1063. Bibcode:1994SJSC...15.1059G. doi:10.1137/0915064.
  • Mekwi, Wankere R. (2001). Iterative methods for roots of polynomials (Master's thesis). Mathematics. Oxford, UK: University of Oxford. Archived from the original on 23 December 2012.
  • Pan, V.Y. (1997). "Solving a polynomial equation: Some history and recent progress". SIAM Review. 39 (2): 187–220. Bibcode:1997SIAMR..39..187P. doi:10.1137/S0036144595288554.
  • Press, W.H.; Teukolsky, S.A.; Vetterling, W.T.; Flannery, B.P. (2007). "Section 9.5.3   Laguerre's method". Numerical Recipes: The art of scientific computing (3rd ed.). New York, NY: Cambridge University Press. ISBN 978-0-521-88068-8.
  • Ralston, Anthony; Rabinowitz, Philip (1978). A First Course in Numerical Analysis. McGraw-Hill. ISBN 0-07-051158-6.