Is it feasible to use Newton's method to find all the roots (if more than one) of a polynomial? Are there any efficient algorithms for finding all the roots?
-  $\begingroup$ Yes. Once you find a root $a$, divide the polynomial by $x-a$ and then apply the algorithm again. $\endgroup$Ben Webster– Ben Webster2010-12-09 05:45:34 +00:00Commented Dec 9, 2010 at 5:45
-  $\begingroup$ Are these real or complex polynomials? Are the coefficients integers? rational? Newton's method can be slow near multiple or closely spaced roots although exact multiple roots can be found using the $\gcd$ of a polynomial and its derivative. Once the roots are approximately located, (Sturm's theorem is helpful) Newtons method can be fairly robust. $\endgroup$Aaron Meyerowitz– Aaron Meyerowitz2010-12-09 05:46:15 +00:00Commented Dec 9, 2010 at 5:46
-  $\begingroup$ The coefficients and the roots wanted are real integral values. $\endgroup$George Tyler– George Tyler2010-12-09 06:22:03 +00:00Commented Dec 9, 2010 at 6:22
-  $\begingroup$ How do we choose good intervals to test for Sturm's theorem? $\endgroup$George Tyler– George Tyler2010-12-09 07:44:34 +00:00Commented Dec 9, 2010 at 7:44
-  1$\begingroup$ Your usage of "multiple roots" seems erroneous; "multiple" is used to denote a root that is counted more than once, e.g. 1 is a multiple (double) root of the polynomial $x^2-2x+1$. $\endgroup$J. M. isn't a mathematician– J. M. isn't a mathematician2010-12-09 09:21:07 +00:00Commented Dec 9, 2010 at 9:21
5 Answers
The Newton's method has driven a lot of attention during the past decades in the community of complex analysis, because of its nice behaviour as a dynamical system. From the point of view of the approximation of the roots of a polynomial $P$, it has contrasted properties. On the one hand, it is a fast method for calculating one root $a$. But once you have a good approximation $\alpha$ of it, you face the problem of dividing $P$ by $X-\alpha$. This division is not exact, and it is unstable if $P$ has an other root close to $a$. The latter case happens frequently in large matrices, either random or with physical meaning.
This is why the codes employed by computers to calculate the set of roots follows an other idea. Form the companion matrix $B_P$, which turns out to be of Hessenberg form ($b_{ij}=0$ if $i\ge j+2$). Then apply the QR method. This one enjoys crucial properties:
- it preserves the Hessenberg form,
- one iteration is fast when applied to a Hessenberg matrix ($O(n^2)$ operations),
- it is stable, in the sense that the roundoff errors are not amplified along the iterations. This is because one conjugates by unitary matrices.
Naively, we could think that the calculation of eigenvalues of a matrix must be done in two steps, first calculate the characteristic polynomial, then solve it. But the QR method is so much efficient that the converse is actually true: to find the roots of a polynomial, form the companion matrix and then apply QR.
If you fear that $P$ has multiple roots, you had better to eliminate them first by calculating the g.c.d. of $P$ and $P'$. This is not easy in general, but if $P\in\mathbb Z[X]$, you do it exactly.
Later. I realize that you seem to be interested only on polynomials whose roots are real. If this is correct, then you can accelerate the calculation by the following trick. First form a Sturm sequence of polynomials $P_0=P,P_1=P',\ldots$. Then use it to form a tridiagonal symmetric matrix $H$ whose characteristic polynomial is $P$: $P_{n-j}$ is the characteristic polynomial of the principal $j\times j$ matrix. Now apply QR to $H$. This is really fast. Each iteration requires only $O(n)$ operation, and the method becomes of cubic order.
-  $\begingroup$ +1. As explained in Chapter 13 of springer.com/mathematics/algebra/book/978-1-4419-7682-6. $\endgroup$Did– Did2010-12-09 08:18:26 +00:00Commented Dec 9, 2010 at 8:18
-  1$\begingroup$ I always feel some misgivings when recommending the companion matrix scheme; while it can be accurate (assuming you balanced your companion matrix before subjecting to QR if the coefficients vary widely in magnitude), using an $O(n^3)$ method for a problem with $O(n)$ parameters (and can thus ostensibly be solved in $O(n^2)$ flops) is a bit of a waste, if you ask me. On the other hand, it might come as a surprise that there are in fact modifications of the QR scheme that exploit the sparsity of the Frobenius companion matrix, e.g. scg.ece.ucsb.edu/publications/oadv2.pdf $\endgroup$J. M. isn't a mathematician– J. M. isn't a mathematician2010-12-09 09:24:46 +00:00Commented Dec 9, 2010 at 9:24
-  1$\begingroup$ Regarding the edit: if the polynomial with all real roots has multiple roots, using the symmetric tridiagonal companion matrix has the side benefit of it having zero sub- and superdiagonal matrix entries, such that the eigenproblem actually decouples (your eigenroutine processes a series of small matrices instead of one big matrix). $\endgroup$J. M. isn't a mathematician– J. M. isn't a mathematician2010-12-09 10:01:49 +00:00Commented Dec 9, 2010 at 10:01
-  $\begingroup$ @J.M. Just right ! $\endgroup$Denis Serre– Denis Serre2010-12-09 10:31:04 +00:00Commented Dec 9, 2010 at 10:31
-  $\begingroup$ When you use the Sturm sequence trick, an additional cost comes from computing the Sturm sequence, which should take $O(n^2)$ ops. $\endgroup$Federico Poloni– Federico Poloni2010-12-09 13:24:05 +00:00Commented Dec 9, 2010 at 13:24
Yes: see e.g. MR1859017 (2002i:37059) Reviewed Hubbard, John(1-CRNL); Schleicher, Dierk(D-MNCH-MI); Sutherland, Scott(1-SUNYS-IM) How to find all roots of complex polynomials by Newton's method. (English summary) Invent. Math. 146 (2001), no. 1, 1–33.
Another good choice is the Aberth method http://en.wikipedia.org/wiki/Aberth_method. It has some advantages wrt the QR-based approaches, especially when the coefficients have very different magnitudes.
-  $\begingroup$ This is in fact very similar to the (Weirestrass-)Durand-Kerner method I mentioned in the comments, in that both can be considered as Newton-Raphson methods applied to an appropriate system of $n$ equations in $n$ unknowns. There is a good discussion of these methods in McNamee's book (which I also linked to in the comments). The advantage of (Ehrlich-)Aberth(-Maehly) over (W)DK is that this has a cubic convergence rate (over (W)DK's quadratic convergence rate) if the polynomial's roots are all simple. $\endgroup$J. M. isn't a mathematician– J. M. isn't a mathematician2010-12-09 13:44:08 +00:00Commented Dec 9, 2010 at 13:44
Use Sturm sequences. Or form a matrix that has eigenvalues equal to the roots of your polynomial and apply the QR iteration.
-  $\begingroup$ FWIW, if your input polynomial has all its roots real, one can use a Sturmian scheme to construct a symmetric tridiagonal companion matrix, which can then be subjected to the QR algorithm (which is more efficient in the symmetric case than in the unsymmetric case). $\endgroup$J. M. isn't a mathematician– J. M. isn't a mathematician2010-12-09 09:27:38 +00:00Commented Dec 9, 2010 at 9:27
There is another method based on circles on the complex plane named splitting circle method http://en.wikipedia.org/wiki/Splitting_circle_method.

