The general problem to be solved by the least squares method is this: given some direct measurements of random variables, and knowing a set of equations which have to be satisfied by these measurements (possibly involving unknown parameters ), find the set of which comes closest to satisfying
where “closest” is defined by a such that
The sum of squares of elements of a vector can be written in different ways
The assumption has been made here that the elements of are statistically uncorrelated and have equal variance. For this case, the above solution results in the most efficent estimators for , . If the are correlated, correlations and variances are defined by a covariance matrix , and the above minimum condition becomes
Least squares solutions can be more or less simple, depending on the constraint equations . If there is exactly one equation for each measurement, and the functions are linear in the elements of and , the solution is discussed under linear regression. For other linear models, see Linear Least Squares. Least squares methods applied to few parameters can lend themselves to very efficient algorithms (e.g. in real-time processing), as they reduce to simple matrix operations.
If the constraint equations are non-linear, one typically solves by linearization and in iterations, using approximate values of , in every step, and linearizing by forming the matrix of derivatives , (the Jacobian matrix) and possibly also at the last point of approximation.
Note that as the iterative improvements tend towards zero (if the process converges), converges towards a final value which enters the minimum equation above.
Algorithms avoiding the explicit calculation of and have also been investigated, e.g. ; for a discussion, see . Where convergence (or control over convergence) is problematic, use of a general package for minimization may be indicated.
- 1 M.L. Ralston and R.I. Jennrich, Dud, a Derivative-free Algorithm for Non-linear Least Squares, Technometrics 20-1 (1978) 7.
- 2 W.H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery, Numerical Recipes in C, Second edition, Cambridge University Press, 1995.
Note: This entry is based on content from the http://rkb.home.cern.ch/rkb/titleA.htmlThe Data Analysis Briefbook
|Date of creation||2013-03-22 12:05:47|
|Last modified on||2013-03-22 12:05:47|
|Last modified by||akrowne (2)|
|Synonym||least squares problem|