conjugate gradient algorithm
The conjugate gradient algorithm is used to solve the quadratic minimization problem:
or equivalently to solve the linear system , where is a given by symmetric positive definite matrix and is a given vector.
The algorithm requires n iterations, starting from an arbitrary initial guess (often is used). We will use the following notation:
-
•
— iteration number;
-
•
— solution approximation;
-
•
— search direction;
-
•
— residual, which is defined as .
Algorithm
-
1.
Initialization. Let (or other starting point). Let . (The initial search direction is set to minus the gradient of the quadratic function being minimized, evaluated at the starting point).
-
2.
For to compute
If or Stop, the solution has been found. Otherwise, continue:
(In each iteration the solution estimate is set to the previous estimate plus a multiple of the previous search direction. The next search direction is then set to the gradient plus a multiple of the previous search direction).
Discussion
The conjugate gradient method was developed in 1952 by Hestenes and Stiefel as an improvement to the steepest descent method. Whereas steepest descent approaches the solution asymptotically, the conjugate gradient method will find the solution in n iterations (assuming no roundoff error).
Why the name? The search directions are conjugate in the sense that for . In addition these directions are computed from (but are not equal to) the gradient.
The conjugate gradient method has been generalized to the case where the function being minimized is only approximately quadratic. In that case the explicit formula for given above is replaced by a line-search procedure; this is a trial and error method in which various values of are tried, and the value that leads to the smallest value of the objective is chosen. Well known generalized c.g. methods include the Fletcher-Reeves method and the Polak-Ribiere method.
Example
Solve
We have
then
and finally
which is the solution.
References
Luenberger: Introduction to Linear and Nonlinear Programming, Addison-Wesley, 1973
Jonathan Richard Shewchuk: An Introduction to the Conjugate Gradient Method Without the Agonizing Pain, August 1994. \htmladdnormallinkhttp://www-2.cs.cmu.edu/ jrs/jrspapers.html http://www-2.cs.cmu.edu/ jrs/jrspapers.html [A detailed derivation of the method from first principles].
Press, et al.: Numerical Recipes in C, Cambridge University Press, 1995 [Chapter 10.6 contains an implementation of the generalized conjugate gradient method of Polak and Ribiere].
Title | conjugate gradient algorithm |
---|---|
Canonical name | ConjugateGradientAlgorithm |
Date of creation | 2013-03-22 14:58:54 |
Last modified on | 2013-03-22 14:58:54 |
Owner | aplant (12431) |
Last modified by | aplant (12431) |
Numerical id | 16 |
Author | aplant (12431) |
Entry type | Algorithm |
Classification | msc 15A06 |
Classification | msc 90C20 |
Synonym | method of conjugate gradients |