Newtonβs method
Let π be a differentiable function from βn to βn.
Newtonβs method consists of starting at an ππ for the equation π(π±)=π. Then the function is linearized at ππ by replacing the increment π(π±)-π(ππ) by a linear function of the increment [π(ππ)](π±-ππ).
Now we can solve the linear equation π(ππ)+[π(ππ)](π±-ππ)=π. Since this is a system of n linear equations in n unknowns, [π(ππ)](π±-ππ)=-π(ππ) can be likened to the general linear system Aπ±=π.
Therefore, if [π(ππ)] is invertible, then π±=ππ-[π(ππ)]-1π(ππ).
By renaming π± to ππ, you can reiterate Newtonβs method to get an ππ. Thus, Newtonβs method states
πn+1=πn-[ππ(πn)]-1π(πn) |
Thus we get a series of πβs that we hope will converge to π± such that π(π±)=π. When we solve an equation of the form π(π±)=π, we call the solution a root of the equation. Thus, Newtonβs method is used to find roots of nonlinear equations.
Unfortunately, Newtonβs method does not always converge. There are tests for
neighborhoods of ππβs where Newtonβs method will converge however. One such test is Kantorovitchβs theorem
, which combines what is needed into a concise mathematical equation.
Corollary 1: Newtonβs Method in one dimension - The above equation is simplified in one dimension to the well-used
a1=a0-f(a0)fβ²(a0) |
This intuitively cute equation is pretty much the equation of first year calculus. :)
Corollary 2: Finding a square root - So now that you know the equation,
you need to know how to use it, as it is an algorithm. The construction of the primary equation, of course is the important part. Letβs see how you do it if you want to find a square root of a number b.
We want to find a number x (x for unknown), such that x2=b. You might think βwhy not find a number such that x=βb ?β Well, the problem with that approach is that we donβt have a value for βb, so weβd be right back where we started. However, squaring both sides of the equation to get x2=b lets us work with the number we do know, b.) Back to x2=b. With some manipulation, we see this means that x2-b=0 ! Thus we have our f(x)=0 scenario.
We can see that fβ²(x)=2x thus, fβ²(a0)=2a0 and f(a0)=a20-b. Now we have all we need to carry out Newtonβs method. By renaming x to be a1, we have
a1=a0-12a0(a20-b)=12(a0+ba0) |
.
The equation on the far right is also known as the divide and average method, for those who have not learned the full Newtonβs method, and just want a fast way to find square roots. Letβs see how this works out to find the square root of a number like 2:
Let x2=2
x2-2=0=f(x) |
Thus, by Newtonβs method,β¦
a1=a0-a20-22a0 |
All we did was plug in the expressions f(a0) and fβ²(a0) where Newtonβs method asks for them. Now we have to pick an a0. Hmm, since
β1<β2<β4 |
1<β2<2 |
letβs pick a reasonable number between 1 and 2 like 1.5
a1=1.5-1.52-22(1.5) |
a1=1.41Λ6 |
Looks like our guess was too high. Letβs see what the next iteration says
a2=1.41Λ6-1.41Λ62-22(1.41Λ6) |
a2=1.414215686β¦ |
getting better =) You can use your calculator to find that
β2=1.414213562β¦ |
Not bad for only two iterations! Of course, the more you iterate, the more decimal places your an will be accurate to. By the way, this is also how your calculator/computer finds square roots!
Geometric Interpretation: Consider an arbitrary function f:βββ such as f(x)=x2-b. Say you wanted to find a root of this function. You know that in the neighborhood of x=a0, there is a root (Maybe you used Kantorovitchβs theorem or tested and saw that the function changed signs in this neighborhood). We want to use our knowledge a0 to find an a1 that is a better approximation to x0 (in this case, closer to it on the x-axis).
So we know that x0β€a1β€a0 or in another case a0β€a1β€x0. What is an efficient algorithm to bridge the gap between a0 and x0 ? Letβs look at a tangent line to the graph.
Note that the line intercepts the x-axis between a0 and x0, which is exactly what we want. The slope of this tangent line is fβ²(a0) by definition of the derivative at a0, and we know one point on the line is (a1,0), since that is the x-intercept. That is all we need to find the formula
of the line, and solve for a1.
y-y1 | =m(x-x1) | |||
f(a0)-0 | =fβ²(a0)(a0-a1) | Substituting | ||
f(a0)fβ²(a0) | =a0-a1 | |||
-a1 | =-a0+f(a0)fβ²(a0) | Aesthetic change. Flipped the equation around. | ||
a1 | =a0-f(a0)fβ²(a0) | Newtonβs method! |
Title | Newtonβs method |
Canonical name | NewtonsMethod |
Date of creation | 2013-03-22 11:58:05 |
Last modified on | 2013-03-22 11:58:05 |
Owner | alozano (2414) |
Last modified by | alozano (2414) |
Numerical id | 28 |
Author | alozano (2414) |
Entry type | Algorithm |
Classification | msc 49M15 |
Synonym | Newton-Raphson method |
Related topic | LipschitzCondition |
Related topic | KantorovitchsTheorem |
Related topic | Derivative |
Related topic | Superconvergence |
Related topic | FixedPointsOfNormalFunctions |