Diagonalizable matrix
Encyclopedia : D : DI : DIA : Diagonalizable matrix
In linear algebra, a square matrix A is called diagonalizable if it is similar to a diagonal matrix, i.e. if there exists an invertible matrix P such that P -1AP is a diagonal matrix. If V is a finite-dimensional vector space, then a linear map T : V → V is called diagonalizable if there exists a basis of V with respect to which T is represented by a diagonal matrix. Diagonalization is the process of finding a corresponding diagonal matrix for a diagonalizable matrix or linear map.
Diagonalizable matrices and maps are of interest because diagonal matrices are especially easy to handle: their eigenvalues and eigenvectors are known and one can raise a diagonal matrix to a power by simply raising the diagonal entries to that same power.
The fundamental fact about diagonalizable maps and matrices is expressed by the following:
- An n-by-n matrix A over the field F is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to n, which is the case if and only if there exists a basis of Fn consisting of eigenvectors of A. If such a basis has been found, one can form the matrix P having these basis vectors as columns, and P -1AP will be a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of A.
- A linear map T : V → V is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to dim(V), which is the case if and only if there exists a basis of V consisting of eigenvectors of T. With respect to such a basis, T will be represented by a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of T.
The following sufficient (but not necessary) condition is often useful.
- An n-by-n matrix A is diagonalizable over the field F if it has n distinct eigenvalues in F, i.e. if its characteristic polynomial has n distinct roots in F.
- A linear map T : V → V with n=dim(V) is diagonalizable if it has n distinct eigenvalues, i.e. if its characteristic polynomial has n distinct roots in F.
The same is not true over R. As n increases, it becomes (in some sense) less and less likely that a randomly selected real matrix is diagonalizable over R.
Examples
How to diagonalize a matrix
Consider a matrix
- [A=\begin1 & 2 & 0 \\0 & 3 & 0 \\2 & -4 & 2 \end.]
- [ \lambda_1 = 3, \quad \lambda_2 = 2, \quad \lambda_3= 1. ]
If we want to diagonalize A, we need to compute the corresponding eigenvectors. They are
- [ v_1 = \begin -1 \\ -1 \\ 2 \end, \quad v_2 = \begin 0 \\ 0 \\ 1 \end, \quad v_3 = \begin -1 \\ 0 \\ 2 \end. ]
Now, let P be the matrix with these eigenvectors as its columns:
- [P=\begin-1 & 0 & -1 \\-1 & 0 & 0 \\2 & 1 & 2 \end.]
- [P^AP =\begin0 & -1 & 0 \\2 & 0 & 1 \\-1 & 1 & 0 \end\begin1 & 2 & 0 \\0 & 3 & 0 \\2 & -4 & 2 \end\begin-1 & 0 & -1 \\-1 & 0 & 0 \\2 & 1 & 2 \end =\begin3 & 0 & 0 \\0 & 2 & 0 \\0 & 0 & 1\end.]
Matrices that are not diagonalizable
Some real matrices are not diagonalizable over the reals. Consider for instance the matrix
- [ B = \begin 0 & 1 \\ -1 & 0 \end. ]
- [ Q = \begin 1 & \textrm \\ \textrm & 1 \end, ]
However, there are also matrices that are not diagonalizable, even if complex numbers are used. This happens if the geometric and algebraic multiplicities of an eigenvalues do not coincide. For instance, consider
- [ C = \begin 0 & 1 \\ 0 & 0 \end. ]
An application
Diagonalization can be used to compute the powers of a matrix A efficiently, provided the matrix is diagonalizable. Suppose we have found that
- [P^AP = D]
- [A^k = (PDP^)^k = PD^kP^]
For example, consider the following matrix:
- [M =\begina & b-a \\ 0 &b \end.]
- [M^2 = \begina^2 & b^2-a^2 \\ 0 &b^2 \end,\quadM^3 = \begina^3 & b^3-a^3 \\ 0 &b^3 \end,\quadM^4 = \begina^4 & b^4-a^4 \\ 0 &b^4 \end,\quad \ldots]
- [\mathbf=\begin 1 \\ 0 \end=\mathbf_1,\quad \mathbf=\begin 1 \\ 1 \end=\mathbf_1+\mathbf_2,]
- [ \mathbf_1 = \mathbf,\qquad \mathbf_2 = \mathbf-\mathbf.]
- [M\mathbf = a\mathbf,\qquad M\mathbf=b\mathbf.]
- [ M^n \mathbf = a^n\, \mathbf,\qquad M^n \mathbf=b^n\,\mathbf.]
- [ M^n \mathbf_1 = M^n \mathbf = a^n \mathbf_1,]
- [ M^n \mathbf_2 = M^n (\mathbf-\mathbf) = b^n \mathbf - a^n\mathbf = (b^n-a^n) \mathbf_1+b^n\mathbf_2.]
- [M^n = \begina^n & b^n-a^n \\ 0 &b^n \end,]
See also
External links
References
- Roger A. Horn and Charles R. Johnson, Matrix Analysis, Chapter 1, Cambridge University Press, 1985. ISBN 0-521-30586-1 (hardback), ISBN 0-521-38632-2 (paperback).
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
