Skip to main content

Section 6.2 The Characteristic Equation

In the previous section we were able to check whether or not a given vector (scalar) was an eigenvector (eigenvalue) for a matrix \(A\text{.}\) What was missing was a process for finding the eigenvalues and eigenvectors for a matrix from scratch. This section will outline such a process. (Once we find the eigenvalues, we can use the method of ExampleΒ 6.1.9 to find bases for the eigenspaces.)

Subsection 6.2.1 How to Find Eigenvalues

We recall from SectionΒ 6.1 that a scalar \(\lambda \in \ff\) is an eigenvalue for a matrix \(A \in M_n(\ff)\) if there is a nontrivial solution to the equation \((A- \lambda I)\bfx = \bfo\text{.}\) Since this happens precisely when \(A - \lambda I\) is not invertible, by TheoremΒ 5.4.13 we can use the determinant to find the eigenvalues for \(A\text{.}\)
Specifically, \(\lambda\) is an eigenvalue for \(A\) if and only if \(\det(A - \lambda I) = 0\text{.}\) If we consider \(\lambda\) as a variable, we can begin the process of finding the eigenvalues for square matrices. We will illustrate this method in the following example before making all of the steps explicit (with the appropriate notation and terminology).

Example 6.2.1.

Let \(A \in M_2(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 2 \amp -2 \\ 2 \amp 7 \end{bmatrix}\text{.} \end{equation*}
A scalar \(\lambda\) is an eigenvalue for \(A\) if and only if \(\det(A - \lambda I) = 0\text{.}\) Since \(A - \lambda I\) is \(2\times 2\text{,}\) we can find the determinant with ease:
\begin{equation*} \det (A - \lambda I) = \begin{vmatrix} 2-\lambda \amp -2 \\ 2 \amp 7-\lambda \end{vmatrix} = (2-\lambda)(7-\lambda) + 4\text{.} \end{equation*}
With some algebra, we can see that
\begin{equation*} \det(A - \lambda I) = \lambda^2 - 9\lambda + 18 = (\lambda - 6)(\lambda - 3)\text{.} \end{equation*}
Since \(\lambda\) is an eigenvalue when \(\det(A - \lambda I)=0\text{,}\) we see that the eigenvalues of \(A\) are \(\lambda = 6\) and \(\lambda = 3\text{.}\)
Let’s verify this method by showing that \(\lambda=6\) and \(\lambda=3\) are eigenvalues for \(A\) and finding bases for their eigenspaces. First, if \(\lambda = 6\text{,}\) then
\begin{equation*} A - 6 I = \begin{bmatrix} -4 \amp -2 \\ 2 \amp 1 \end{bmatrix} \sim \begin{bmatrix} 1 \amp \frac{1}{2} \\ 0 \amp 0 \end{bmatrix}\text{.} \end{equation*}
From this we see that \(\mathrm{eig}_6(A)\) is one-dimensional with basis \(\left\{ \begin{bmatrix} -1 \\ 2 \end{bmatrix} \right\}\text{.}\)
Now, when \(\lambda = 3\text{,}\) we have
\begin{equation*} A - 3 I = \begin{bmatrix} -1 \amp -2 \\ 2 \amp 4 \end{bmatrix} \sim \begin{bmatrix} 1 \amp 2 \\ 0 \amp 0 \end{bmatrix}\text{.} \end{equation*}
Thus \(\mathrm{eig}_3(A)\) is also one-dimensional with basis \(\left\{ \begin{bmatrix} -2 \\ 1 \end{bmatrix} \right\}\text{.}\)
We now give some names and notation to the elements of this method to find the eigenvalues of a matrix.

Definition 6.2.2.

If \(A \in M_n(\ff)\text{,}\) then \(\det(A - \lambda I)=0\) is called the characteristic equation of \(A\text{.}\)
Based on our previous discussion, we can say that \(\lambda\) is an eigenvalue of \(A\) if and only if \(\lambda\) satisfies the characteristic equation of \(A\text{.}\)
It is occasionally useful to have a term and notation for just one piece of the characteristic equation.

Definition 6.2.3.

For a matrix \(A \in M_n(\ff)\text{,}\) \(\det(A - \lambda I)\) is a degree \(n\) polynomial referred to as the characteristic polynomial of \(A\text{.}\) Sometimes the notation \(p_A(\lambda)\) is used for this polynomial.
We will see that the characteristic equation can get quite difficult to solve as the size of the matrix increases. The next example provides a glimpse of this for a \(3\times 3\) matrix.

Example 6.2.4.

Let \(A \in M_3(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 1 \amp -3 \amp -1 \\ 2 \amp 6 \amp 1 \\ -4 \amp -10 \amp -1 \end{bmatrix} \text{.} \end{equation*}
To find the characteristic polynomial of \(A\text{,}\) we will use cofactor expansion along the first row of \(A - \lambda I\text{:}\)
\begin{align*} \det(A - \lambda I) \amp = (1-\lambda)\begin{vmatrix} 6-\lambda \amp 1 \\ -10 \amp -1-\lambda \end{vmatrix} + 3 \begin{vmatrix} 2 \amp 1 \\ -4 \amp -1-\lambda \end{vmatrix} - 1 \begin{vmatrix} 2 \amp 6-\lambda \\ -4 \amp -10 \end{vmatrix} \\ \amp = (1-\lambda)(\lambda^2 - 5\lambda + 4) + 3(2-2\lambda) - (4-4\lambda)\\ \amp = -\lambda^3 + 6\lambda^2 - 11\lambda + 6\text{.} \end{align*}
For the matrix \(A\) given above, we have \(p_A(\lambda) = -\lambda^3 + 6\lambda^2 - 11\lambda + 6\text{.}\)
Since one can solve a quadratic equation easilyβ€”either by factoring or using the quadratic formulaβ€”some mistakenly think that all polynomial equations are easy to solve. Not so. As we consider the characteristic polynomial for the matrix in the previous example, we find the task of finding the eigenvalues quite challenging.
While there is a formula for solving a cubic polynomial, it is not nearly as nice as the quadratic formula. It is, in fact, something that should rarely see the light of day. It is ugly.
In practice, the eigenvalues of an \(n\times n\) matrix are usually found with a computer algebra system when \(n \ge 3\text{.}\) However, there is one way that we could describe an eigenvalue problem and still have a reasonable expectation that the reader could find a solution.

Example 6.2.5.

Consider the following matrix \(A \in M_3(\rr)\) (from the previous example):
\begin{equation*} A = \begin{bmatrix} 1 \amp -3 \amp -1 \\ 2 \amp 6 \amp 1 \\ -4 \amp -10 \amp -1 \end{bmatrix} \text{.} \end{equation*}
If we know that \(\lambda = 1\) is an eigenvalue for \(A\text{,}\) we can find all of the eigenvalues.
Our previous work showed that \(p_A(\lambda) = -\lambda^3 + 6\lambda^2 - 11\lambda + 6\text{.}\) If we know that \(\lambda=1\) is an eigenvalue, that means that \((\lambda - 1)\) must be a factor of this polynomial. Using long division of polynomials, we can complete the factorization:
\begin{equation*} p_A(\lambda) = (\lambda - 1)(-\lambda^2 + 5\lambda - 6) = -(\lambda - 1)(\lambda - 2)(\lambda - 3)\text{.} \end{equation*}
This shows that the eigenvalues for \(A\) are \(\lambda = 1, 2, 3\text{.}\)
When a characteristic polynomial is factored to reveal the eigenvalues, we might see some of those eigenvalues repeated. We now introduce a term to describe this repetition.

Definition 6.2.6.

The algebraic multiplicity of an eigenvalue is the number of times it appears as a root of the characteristic polynomial for a matrix. We will often refer to this number as an eigenvalue’s multiplicity.

Example 6.2.7.

We consider the matrix from ExampleΒ 6.1.9 in the previous section:
\begin{equation*} A = \begin{bmatrix} 4.5 \amp -2.5 \amp -2.5 \\ 2.5 \amp -0.5 \amp -2.5 \\ 5 \amp -5 \amp -3 \end{bmatrix}\text{.} \end{equation*}
The characteristic polynomial for this matrix is
\begin{equation*} p_A(\lambda) = -\lambda^3 + \lambda^2 + 8\lambda - 12 = -(\lambda - 2)^2(\lambda + 3)\text{.} \end{equation*}
From this we see that \(A\) has an eigenvalue of 2 with multiplicity 2 and an eigenvalue of \(-3\) with multiplicity 1.
Before we move to the second half of this section, we point out that not every matrix over \(\ff\) has eigenvalues in \(\ff\text{.}\) The following example illustrates this, but we can understand this intuitively by recognizing that (for example) not every polynomial over \(\rr\) can be completely factored into linear terms.

Example 6.2.8.

Let \(A \in M_2(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} -5 \amp -4 \\ 5 \amp -6 \end{bmatrix}\text{.} \end{equation*}
We can calculate the characteristic polynomial for \(A\text{:}\)
\begin{align*} \det(A - \lambda I) \amp = \begin{vmatrix} -5-\lambda \amp -4 \\ 5 \amp -6-\lambda \end{vmatrix} \\ \amp = (-5-\lambda)(-6-\lambda) + 20 \\ \amp = \lambda^2+11\lambda+50\text{.} \end{align*}
This quadratic polynomial has no real roots, as the reader can check by verifying that the discriminant (the expression under the square root in the quadratic formula) is negative.
If \(A\) is considered as a matrix over \(\cc\text{,}\) then it has two complex eigenvalues. But it has no real eigenvalues.
One of the major differences between \(\rr\) and \(\cc\) is that \(\cc\) is an algebraically closed field, while \(\rr\) is not. The fact that \(\cc\) is algebraically closed is another way to restate the famous Fundamental Theorem of Algebra, which says that every polynomial with coefficients in \(\cc\) can be completely reduced to linear factors. There are interesting, deep waters to explore here, but perhaps not in the present text.

Subsection 6.2.2 Similarity

Earlier in this book we discussed one way matrices can be relatedβ€”row equivalence. We now introduce another relationship which has connections to the characteristic equation.

Definition 6.2.9.

Suppose that \(A, B \in M_n(\ff)\text{.}\) Then \(A\) is similar to \(B\) if there exists an invertible matrix \(P \in M_n(\ff)\) such that \(P^{-1}AP = B\) or \(A = PBP^{-1}\text{.}\)

Note 6.2.10.

Since we could write \(Q = P^{-1}\) in the previous definition and have \(Q^{-1}BQ = A\) when \(P^{-1}AP = B\text{,}\) we see that \(B\) is similar to \(A\) when \(A\) is similar to \(B\text{.}\) For this reason, we can say that \(A\) and \(B\) are similar.
The next theorem presents one of the most compelling reasons for studying similarity in matrices.

Proof.

If \(A\) and \(B\) are similar, then there exists an invertible matrix \(P\) such that \(B = P^{-1}AP\text{.}\) When forming \(B - \lambda I\text{,}\) we have
\begin{equation*} B - \lambda I = P^{-1}AP - \lambda P^{-1}P = P^{-1}(AP - \lambda P) = P^{-1}(A - \lambda I) P\text{.} \end{equation*}
Since the determinant is multiplicative (TheoremΒ 4.2.13), we have
\begin{align*} \det(B - \lambda I) \amp = \det(P^{-1})\det(A - \lambda I) \det(P) \\ \amp = \det(P^{-1})\det(P)\det(A - \lambda I)\\ \amp = \det(P^{-1}P)\det(A - \lambda I) \\ \amp = \det(I)\det(A - \lambda I)\\ \amp = \det(A - \lambda I)\text{.} \end{align*}
This shows that \(A\) and \(B\) have the same characteristic polynomials.
The next example showcases two similar matrices.

Example 6.2.12.

Let \(A\) and \(B\) be the following matrices over \(\rr\text{:}\)
\begin{equation*} A = \begin{bmatrix} -10 \amp -5 \\ 24 \amp 13 \end{bmatrix}, \hspace{12pt} B = \begin{bmatrix} 1 \amp -4 \\ -3 \amp 2 \end{bmatrix}\text{.} \end{equation*}
We will show that these are similar matrices by considering the matrix \(P\text{:}\)
\begin{equation*} P = \begin{bmatrix} -4 \amp -1 \\ -5 \amp -2 \end{bmatrix}\text{.} \end{equation*}
The reader can check that \(P\) is invertible and that \(A = P^{-1}BP\text{.}\) This shows that \(A\) and \(B\) are similar.
Further calculation shows that \(p_B(\lambda) = \lambda^2-3\lambda-10\) and that this is the same as \(p_A(\lambda)\text{.}\)
It is important to note the logical implication that TheoremΒ 6.2.11 provides with care. If two matrices are similar, then they have the same eigenvalues. However, we may have two matrices with the same eigenvalues which are not similar.

Example 6.2.13.

Let \(A, B \in M_2(\rr)\) be the following matrices:
\begin{equation*} A = \begin{bmatrix} 5 \amp 0 \\ 0 \amp 5 \end{bmatrix}, \hspace{12pt} B = \begin{bmatrix} 5 \amp -2 \\ 0 \amp 5 \end{bmatrix}\text{.} \end{equation*}
Since both \(A\) and \(B\) are triangular matrices, we can see that they have the same eigenvalue (with the same multiplicity). However, these matrices are not similar. In fact, we will show in the exercises that a matrix \(kI\text{,}\) for \(k \in \ff\text{,}\) is only similar to itself. In this example, \(A = 5I\text{.}\)
We end this section with a small note of caution. Similarity and row equivalence are not the same thing, and the distinction between these two relations is important to keep in mind. In particular, elementary row operations usually alter the eigenvalues of a matrix, while we have seen that a similarity transformation does not.

Reading Questions 6.2.3 Reading Questions

1.

Let \(A\) be the following matrix over \(\rr\text{:}\)
\begin{equation*} A = \begin{bmatrix} 2 \amp -1 \\ -6 \amp -3 \end{bmatrix}\text{.} \end{equation*}
  1. Write down the characteristic equation for \(A\text{.}\) (Feel free to use \(t\) instead of \(\lambda\) if typing \(\lambda\) is difficult.)
  2. Determine the two eigenvalues for \(A\text{.}\)

2.

Consider the matrix \(A\) from the previous reading question. Write down two separate matrices which are similar to \(A\text{.}\) Explain how you obtained these matrices. (There are many, many correct answers!)

Exercises 6.2.4 Exercises

1.

Let \(A \in M_2(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} -7 \amp 8 \\ -6 \amp 7 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\text{.}\)

2.

Let \(A \in M_2(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 7 \amp -3 \\ 6 \amp -2 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\text{.}\)
Answer.
The characteristic polynomial is \(p_A(\lambda) = \lambda^2 - 5\lambda + 4\text{.}\) The eigenvalues are \(\lambda = 1\) and \(\lambda = 4\text{.}\)

3.

Let \(A \in M_2(\ff_5)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 0 \amp 2 \\ 1 \amp 1 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\text{.}\)

4.

Let \(A \in M_2(\ff_5)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 2 \amp 4 \\ 4 \amp 2 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\text{.}\)
Answer.
The characteristic polynomial is \(p_A(\lambda) = \lambda^2 + \lambda + 3\text{,}\) and the eigenvalues are \(\lambda=1\) and \(\lambda=3\text{.}\)

5.

Let \(A \in M_3(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 3 \amp 0 \amp 0 \\ 5 \amp -1 \amp 1 \\ 3 \amp -1 \amp 1 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\) by hand.

6.

Let \(A \in M_3(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 2 \amp 6 \amp 6 \\ 0 \amp -1 \amp 0 \\ -3 \amp -6 \amp -7 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial and the eigenvalues of \(A\) by hand.
Answer.
The characteristic polynomial is \(p_A(\lambda) = -\lambda^3 - 6\lambda^2 - 9\lambda - 4\text{,}\) but it is easier in this problem NOT to expand the calculation to this form. The characteristic polynomial as calculated is \(p_A(\lambda) = (-1-\lambda)(\lambda^2 + 5\lambda + 4)\text{.}\) The eigenvalues are \(\lambda=-4\) and \(\lambda = -1\text{.}\)

7.

Let \(A \in M_3(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 5 \amp 8 \amp -8 \\ -4 \amp -11 \amp 16 \\ 0 \amp -4 \amp 9 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial of \(A\) by hand. If \(\lambda = 1\) is an eigenvalue of \(A\text{,}\) find the other eigenvalues of \(A\) by hand.

8.

Let \(A \in M_3(\rr)\) be the following matrix:
\begin{equation*} A = \begin{bmatrix} 1 \amp -4 \amp -4 \\ -2 \amp -1 \amp -4 \\ 4 \amp 4 \amp 9 \end{bmatrix}\text{.} \end{equation*}
Find the characteristic polynomial of \(A\) by hand. If \(\lambda = 3\) is an eigenvalue of \(A\text{,}\) find the other eigenvalues of \(A\) by hand.
Answer.
The characteristic polynomial is \(p_A(\lambda) = -\lambda^3+9\lambda^2-23\lambda+15\text{.}\) The eigenvalues are \(\lambda = 1\text{,}\) \(\lambda = 3\text{,}\) and \(\lambda = 5\text{.}\)

Writing Exercises

9.
Let \(A \in M_n(\rr)\) have \(n\) real eigenvalues, \(\lambda_1, \ldots, \lambda_n\text{,}\) repeated according to their multiplicities, so that
\begin{equation*} \det(A - \lambda I) = (\lambda_1 - \lambda) \cdots (\lambda_n - \lambda)\text{.} \end{equation*}
Prove that \(\det(A)\) is the product of the \(n\) eigenvalues of \(A\text{.}\)
10.
Prove that similarity is an equivalence relation on the set \(M_n(\ff)\text{.}\)
Solution.
For the purposes of notation, we will say that \(A \sim B\) if \(A\) is similar to \(B\text{.}\) We first note that a matrix \(A \in M_n(\ff)\) is similar to itself since \(A = IPI^{-1}\text{,}\) where \(I\) is the \(n\times n\) identity matrix. This proves that the relation is reflexive.
We now suppose that \(A, B \in M_n(\ff)\) with \(A \sim B\text{.}\) This means that there is an invertible matrix \(P\) such that \(A = PBP^{-1}\text{.}\) We can manipulate this equation to give \(B = P^{-1}AP\text{,}\) and since \(P^{-1}\) is invertible (and since \((P^{-1})^{-1}=P\)), this proves that \(B \sim A\text{.}\) This proves that the relation is symmetric.
We now suppose that \(A, B, C \in M_n(\ff)\) with \(A \sim B\) and \(B \sim C\text{.}\) This means that we have invertible matrices \(P\) and \(Q\) such that \(A = PBP^{-1}\) and \(B = QCQ^{-1}\text{.}\) Putting these equations together, we have
\begin{equation*} A = P(QCQ^{-1})P^{-1} = (PQ)C(Q^{-1}P^{-1}) = (PQ)C(PQ)^{-1}\text{.} \end{equation*}
In this derivation we have used PropositionΒ 3.3.3. Since \(PQ\) is invertible (again by PropositionΒ 3.3.3), this proves that \(A \sim C\text{,}\) which shows that the relation is transitive.
We have shown that the relation of similarity on the set \(M_n(\ff)\) is reflexive, symmetric, and transitive, so it is an equivalence relation.
11.
Let \(I_n \in M_n(\ff)\) be the identity matrix and let \(k \in \ff\) be a scalar. If \(A = kI_n\text{,}\) prove that the only matrix \(A\) is similar to is itself.
12.
Prove that if \(A\) and \(B\) are similar, then \(\det(A) = \det(B)\text{.}\)
13.
Let \(A \in M_n(\ff)\text{.}\) Prove that \(\mathrm{rank}(A) = n\) if and only if \(p_A(0) \neq 0\text{.}\)