|
|
|
|
characteristic values and vectors (of a matrix)
|
(Topic)
|
|
|
Over the spectrum $\sigma(A)$ of a matrix $A$ , its eigenvalues $\lambda_1, \lambda_2, \ldots, \lambda_s$ possess multiplicities $n_1, n_2, \ldots, n_s$ , respectively, with $\sum_{k=1}^s n_k=n$ . Its associated characteristic polynomial is then factored as \begin{equation} \Delta(\lambda)\equiv |\lambda I-A|=\Pi_{k=1}^s (\lambda-\lambda_k)^{n_k}. \end{equation} Let us set $\mathrm{mult}(\lambda_k)=n_k$ for
multiplicity of $\lambda_k$ ($k=1,\ldots,s$ ). We will now prove the following theorem.
Theorem If $\sigma(A)=\{\lambda_k\}_{k=1}^s$ , $\mathrm{mult}(\lambda_k)=n_k$ , and $g(\mu)$ is a scalar polynomial, then $\sigma(g(A))=\{g(\lambda_k)\}_{k=1}^s$ , $\mathrm{mult}(g(\lambda_k))=n_k$ .
Proof. Let $g(\mu)$ be an arbitrary scalar polynomial. We want to find the characteristic values of $g(A)$ . For this purpose we split $g(\mu)$ into linear factors \begin{equation} g(\mu)=a_0\Pi_{i=1}^t(\mu-\mu_i)^{l_i}, \qquad a_0\neq 0, \qquad \sum_{i=1}^t l_i=l. \end{equation}On substitution $\mu \mapsto A$ , we have \begin{equation} g(A)=a_0\Pi_{i=1}^t(A-\mu_i I)^{l_i}, \end{equation}being $I$ the identity
matrix. Let us compute the determinant of $g(A)$ . ( Coefficient $a_0$ will be powered to $n$ , the order of the square matrix $A$ ).
because on substitution $\lambda \mapsto \mu_i$ in (1). Next we commute the binomial by introducing $(-1)^{nl}$ into the product signs and also we note that $a_0^n=a_0^{\sum_{k=1}^s n_k}=\Pi_{k=1}^s a_0^{n_k}$ , so that \begin{equation*} |g(A)|=\Pi_{k=1}^s[a_0\Pi_{i=1}^t(\lambda_k-\mu_i)^{l_i}]^{n_k}, \end{equation*}and we may use (2) for $\mu=\lambda_k$ to obtain \begin{equation} |g(A)|=\Pi_{k=1}^s g(\lambda_k)^{n_k}. \end{equation}Finally we substitute the polynomial $g(\mu)$ by $\lambda-g(\mu)$ , where $\lambda$ is an arbitrary parameter, getting for (4) \begin{equation} \Delta(g(A))\equiv|\lambda I-g(A)|=\Pi_{k=1}^s[\lambda-g(\lambda_k)]^{n_k}. \end{equation}This proves the theorem. 
As an important particular case we have: $\sigma(A^m)=\{\lambda_k^m\}_{k=1}^s$ , ($m=0, 1, \cdots $ ), $\mathrm{mult}(\lambda_k)=n_k$ .
Connection between the characteristic polynomial $\Delta(\lambda)$ and the adjugate matrix $B(\lambda)$ of $A$ .
As it is well known, the adjugate matrix $B$ of a matrix $A$ there corresponds to the algebraic complement or cofactor matrix of the transpose of $A$ . From this definition we have \begin{equation} B(\lambda)(\lambda I-A)=\Delta(\lambda)I\qquad\mathrm{and}\qquad (\lambda I-A)B(\lambda)=\Delta(\lambda)I. \end{equation}Let us suppose $\Delta(\lambda)$ is given by \begin{equation} \Delta(\lambda)=\lambda^n-\sum_{k=1}^n c_k\lambda^{n-k}. \end{equation}It is clear that the difference $\Delta(\lambda)-\Delta(\mu)$ is divisible by $\lambda-\mu$ without remainder, hence \begin{equation} \delta(\lambda,\mu)\equiv \frac{\Delta(\lambda)-\Delta(\mu)}{\lambda-\mu}=\lambda^{n-1}+(\mu-c_1)\lambda^{n-2}+(\mu^2-c_1\mu-c_2)\lambda^{n-3}+\cdots \end{equation}is a polynomial in $\lambda, \mu$ . If we replace in (8) $(\lambda, \mu)$ by the permutable matrices $(\lambda I, A)$ and recalling that
from Cayley-Hamilton theorem $\Delta(A)=0$ , then \begin{equation} \delta(\lambda I,A)(\lambda I-A)= \Delta(\lambda)I, \end{equation}which by comparing it with (6)${}_1$ we conclude that \begin{equation} B(\lambda)=\delta(\lambda I, A) \end{equation}is the desired formula by virtue of the uniqueness of the quotient. Therefore (10) and (8) let to write the adjugate $B(\lambda)$ as the matrix polynomial \begin{equation} B(\lambda)=I\lambda^{n-1}+\sum_{k=1}^{n-1}B_k\lambda^{n-k-1}, \end{equation}where ($\mu\mapsto A$
in (8)) \begin{equation} B_k=A^k-\sum_{i=1}^k c_iA^{k-i}, \qquad (k=1,\ldots, n-1), \end{equation}which can also be obtained from the recurrence equation \begin{equation} B_k=AB_{k-1}-c_kI, \qquad (k=1, \ldots, n-1; \quad B_0=I). \end{equation}What is more, \begin{equation} AB_{n-1}-c_nI=0\equiv B_n. \end{equation}(13) as well as (14) follow inmediately from (6)${}_2$ if we equate the coefficients of equal powers of $\lambda$ on both sides. Also, if we substitute
$B_{n-1}$ from (12), into (14), we get $\Delta(A)=0$ (Cayley-Hamilton), an implicit consequence of generalized Bézout theorem. On the other hand, by setting $\lambda=0$ in (7) we obtain $c_n=\Delta(0)/(-1)=|-A|/(-1)=(-1)^{n-1}|A|\neq 0$ , whenever $A$ be non- singular. From this and from (14) follow that \begin{equation} A^{-1}=\frac{1}{c_n}B_{n-1}. \end{equation}Let now $\lambda_c$ be a characteristic value of $A$ , then $\Delta(\lambda_c)=0$ and (6)${}_2$ becomes \begin{equation} (\lambda_c
I-A)B(\lambda_c)=0. \end{equation}Let us assume that $B(\lambda_c)\neq 0$ and denote by $\mathbf{b}$ an arbitrary non-zero column of this matrix. From (16) we have $(\lambda_cI-A)\mathbf{b}=\mathbf{0}$ . That is, \begin{equation} A\mathbf{b}=\lambda_c\mathbf{b}. \end{equation}Therefore every non-zero column of $B(\lambda_c)$ determines a characteristic vector corresponding to the characteristic value $\lambda_c$ . Moreover, if to the characteristic value $\lambda_c$ there correspond $l$ linearly independent characteristic vectors, $n-l$ will be the rank of $\lambda_cI-A$ and so the rank of $B(\lambda_c)$ does not exceed $l$ . In particular, if only one characteristic vector there corresponds to $\lambda_c$ , then in $B(\lambda_c)$ the elements of any two columns will be proportional (In such a case $l=1$ , hence the rank of $\lambda_cI-A$ will be $n-1$ ).
In conclusion: If the coefficients of the characteristic polynomial are known, then the adjugate matrix may be found by (10). In addition, if the given matrix $A$ is non-singular, then the inverse matrix $A^{-1}$ can be found from (15). Also if $\lambda_c$ is a characteristic value of $A$ , the non-zero columns of $B(\lambda_c)$ are characteristc vectors of A for $\lambda=\lambda_c$ .
Example. We find out the characteristic values and vectors from the matrix \begin{equation*} A= \begin{bmatrix} 3 & -3 & 2 \\ -1 & 5 & -2 \\ -1 & 3 & 0 \end{bmatrix} . \end{equation*}From (1), \begin{equation*} \Delta(\lambda)=|\lambda I-A|= \begin{vmatrix} \lambda-3 & 3 & -2 \\ 1 & \lambda-5 & 2 \\ 1 & -3 & \lambda \end{vmatrix} =\lambda^3-8\lambda^2+20\lambda-16. \end{equation*}Comparing with (7), we have \begin{equation*} c_1=8, \qquad c_2=-20, \qquad c_3=16. \end{equation*}Next we use (8), \begin{equation*} \delta(\lambda,\mu)=\frac{\Delta(\lambda)-\Delta(\mu)}{\lambda-\mu}=\lambda^2+(\mu-8)\lambda+\mu^2-8\mu+20, \end{equation*}so that from (11) \begin{equation*} B(\lambda)=\delta(\lambda I,A)=\lambda^2 I+(\underbrace{A-8I}_{B_1})\lambda+\underbrace{A^2-8A+20I}_{B_2}. \end{equation*}We
will now evaluate $B_1$ and $B_2$ by using (12) and (13), respectively. \begin{equation*} B_1=A-8I= \begin{bmatrix} -5 & -3 & 2 \\ -1 & -3 & -2 \\ -1 & 3 & -8 \end{bmatrix} , \qquad B_2=AB_1+20I= \begin{bmatrix} 6 & 6 & -4 \\ 2 & 2 & 4 \\ 2 & -6 & 12 \end{bmatrix} , \end{equation*}thus $B(\lambda)$ is \begin{equation*} B(\lambda)= \begin{bmatrix} \lambda^2-5\lambda+6 & -3\lambda+6 & 2\lambda-4 \\ -\lambda+2 & \lambda^2-3\lambda+2 & -2\lambda+4 \\ -\lambda+2 & 3\lambda-6 & \lambda^2-8\lambda+12 \end{bmatrix} . \end{equation*}Also $|A|=16$ and $A^{-1}$ is obtained from (15), i.e. \begin{equation*} A^{-1}=\frac{1}{16}B_2=\frac{1}{8} \begin{bmatrix} 3 & 3 & -2 \\ 1 &1 & 2 \\ 1 & -3 & 6 \end{bmatrix} .
\end{equation*}Furthermore, \begin{equation*} \Delta(\lambda)=(\lambda-2)^2(\lambda-4). \end{equation*}We notice the eigenvalue $\lambda=2$ possesses multiplicity $2$ and also that all the entries of the adjugate $B(\lambda)$ are divisible by the binomial $\lambda-2$ ($|B(2)|=0$ , i.e. $\lambda=2$ annihilates it), therefore it can be reduced which makes instructive this problem. Thus, \begin{equation*} C(\lambda)= \begin{bmatrix} \lambda-3 & -3 & 2 \\ -1 & \lambda-1 & -2 \\ -1 & 3 & \lambda-6 \end{bmatrix} , \end{equation*}which for $\lambda=2$ it becomes \begin{equation*} C(2)= \begin{bmatrix} -1 & -3
& 2 \\ -1 & 1 & -2 \\ -1 & 3 & -4 \end{bmatrix} . \end{equation*}From this we get the charactreristic vectors $(1,1,1)$ by multiplying the first colum by $-1$ , and also $(-3,1,3)$ , both correponding to $\lambda=2$ . Third column is a linear combination of the first two (subtract it). Likewise we find for the another characteristic value $\lambda=4$ \begin{equation*} C(4)= \begin{bmatrix} 1 & -3 & 2 \\ -1 & 3 & -2 \\ -1 & 3 & -2 \end{bmatrix} , \end{equation*}whence we get the eigenvector $(1,-1,-1)$ , being the remaining two columns clearly proportional to the first one.
|
"characteristic values and vectors (of a matrix)" is owned by perucho.
|
|
(view preamble | get metadata)
| Other names: |
eigenvalues, eigenvectors |
|
|
Cross-references: linear combination, reduced, inverse, non-singular, addition, conclusion, elements, rank, linearly independent, vector, column, singular, consequence, sides, powers, equate, equation, quotient, formula, Cayley-Hamilton theorem, permutable, remainder, divisible, difference, clear, transpose, cofactor, algebraic complement, adjugate, connection, parameter, product, binomial, square matrix, order, coefficient, determinant, identity matrix, substitution, factors, characteristic, polynomial, scalar, theorem, characteristic polynomial, multiplicities, matrix, spectrum
There are 67 references to this entry.
This is version 3 of characteristic values and vectors (of a matrix), born on 2008-01-10, modified 2008-01-22.
Object id is 10182, canonical name is CharacteristicValuesAndVectorsOfAMatrix.
Accessed 5345 times total.
Classification:
| AMS MSC: | 15A18 (Linear and multilinear algebra; matrix theory :: Eigenvalues, singular values, and eigenvectors) |
|
|
|
|
|
|
Pending Errata and Addenda
|
|
|
|
|
|
|
|
|
|
|