Skip to content

Eigenvalues and Eigenvectors

5.1 Definitions

Let AMn×n(F)A \in \mathcal{M}_{n \times n}(F). A scalar λF\lambda \in F is an eigenvalue of AA if there Exists a non-zero vector vFn\mathbf{v} \in F^n such that

Av=λvA\mathbf{v} = \lambda \mathbf{v}

The vector v\mathbf{v} is called an eigenvector corresponding to λ\lambda.

The eigenspace corresponding to λ\lambda is Eλ=ker(AλI)E_\lambda = \ker(A - \lambda I). Its dimension is The geometric multiplicity of λ\lambda.

5.2 Characteristic Polynomial

Theorem 5.1. λ\lambda is an eigenvalue of AA if and only if det(AλI)=0\det(A - \lambda I) = 0.

The polynomial p(λ)=det(AλI)p(\lambda) = \det(A - \lambda I) is called the characteristic polynomial of AA. Its roots (in the algebraic closure of FF) are the eigenvalues of AA.

If p(λ)=(lambdalambda1)m1(lambdalambda2)m2(lambdalambdak)mkp(\lambda) = (\\lambda - \\lambda_1)^{m_1}(\\lambda - \\lambda_2)^{m_2}\cdots(\\lambda - \\lambda_k)^{m_k} With lambda1,,lambdak\\lambda_1, \ldots, \\lambda_k distinct, then mim_i is the algebraic multiplicity of lambdai\\lambda_i.

Proposition 5.2. For each eigenvalue λ\lambda, 1dim(Eλ)mλ1 \leq \mathrm{dim}(E_\lambda) \leq m_\lambda (geometric multiplicity does not exceed algebraic multiplicity).

5.3 Diagonalisation

Definition. AA is diagonalisable if there exists an invertible matrix PP and a diagonal Matrix DD such that

A=PDP1A = PDP^{-1}

Theorem 5.3. AMn×n(F)A \in \mathcal{M}_{n \times n}(F) is diagonalisable (over FF) if and only if AA has nn linearly independent Eigenvectors (over FF). Equivalently, the sum of the geometric multiplicities equals nn.

Corollary 5.4. If AA has nn distinct eigenvalues, then AA is diagonalisable.

Proof. Eigenvectors corresponding to distinct eigenvalues are linearly independent. With nn distinct Eigenvalues, we obtain nn linearly independent eigenvectors, which form a basis of FnF^n. \blacksquare

5.4 Cayley—Hamilton Theorem

Theorem 5.5 (Cayley—Hamilton). Every square matrix satisfies its own characteristic polynomial: If p(λ)=det(λIA)p(\lambda) = \det(\lambda I - A)Then p(A)=0p(A) = 0 (the zero matrix).

Proof sketch. Let p(λ)=λn+cn1λn1++c1λ+c0p(\lambda) = \lambda^n + c_{n-1}\lambda^{n-1} + \cdots + c_1\lambda + c_0. By the adjugate formula (Theorem 3.5), (λIA)adj(λIA)=p(λ)I(\lambda I - A) \cdot \mathrm{adj}(\lambda I - A) = p(\lambda) \cdot I. Each entry of adj(λIA)\mathrm{adj}(\lambda I - A) is a polynomial in λ\lambda of degree at most n1n - 1 So we can write adj(λIA)=Bn1λn1++B1λ+B0\mathrm{adj}(\lambda I - A) = B_{n-1}\lambda^{n-1} + \cdots + B_1\lambda + B_0 for Matrices BiB_i. Multiplying out and comparing coefficients of λk\lambda^k:

Bn1=I,Bn2ABn1=cn1I,,AB0=c0IB_{n-1} = I, \quad B_{n-2} - AB_{n-1} = c_{n-1}I, \quad \ldots, \quad -AB_0 = c_0 I

Multiplying the kk-th equation on the left by AkA^k and summing over kk:

AnBn1+An1(Bn2ABn1)++A0(AB0)=An+cn1An1++c0I=p(A)A^n B_{n-1} + A^{n-1}(B_{n-2} - AB_{n-1}) + \cdots + A^0(-AB_0) = A^n + c_{n-1}A^{n-1} + \cdots + c_0 I = p(A)

But the left side telescopes to zero, so p(A)=0p(A) = 0. \blacksquare

5.5 Jordan Normal Form

When a matrix is not diagonalisable, the Jordan normal form provides the next-best canonical Representation.

Theorem 5.6. Let AMn×n(C)A \in \mathcal{M}_{n \times n}(\mathbb{C}). Then AA is similar to a block-diagonal Matrix

J=(J1Jk)J = \begin{pmatrix} J_1 & & \\ & \ddots & \\ & & J_k \end{pmatrix}

Where each Jordan block has the form

Ji=(λi1λi1λi)J_i = \begin{pmatrix} \lambda_i & 1 & & \\ & \lambda_i & \ddots & \\ & & \ddots & 1 \\ & & & \lambda_i \end{pmatrix}

The Jordan form is unique up to permutation of the blocks.

Intuition. Each Jordan block corresponds to one eigenvalue. The size of the block equals the Number of steps in the chain v,(AλI)v,(AλI)2v,\mathbf{v}, (A - \lambda I)\mathbf{v}, (A - \lambda I)^2\mathbf{v}, \ldots Of generalised eigenvectors. A diagonalisable matrix has all Jordan blocks of size 1×11 \times 1.

Problem. Find the Jordan normal form of

A=(3103)A = \begin{pmatrix} 3 & 1 \\ 0 & 3 \end{pmatrix}

Solution

The characteristic polynomial is det(AλI)=(3λ)2\det(A - \lambda I) = (3 - \lambda)^2So λ=3\lambda = 3 is the Only eigenvalue with algebraic multiplicity 2.

A3I=(0100)A - 3I = \begin{pmatrix} 0 & 1 \\ 0 & 0 \end{pmatrix}Which has rank 1, so the geometric Multiplicity is dim(ker(A3I))=21=1\dim(\ker(A - 3I)) = 2 - 1 = 1.

Since the geometric multiplicity (1) is less than the algebraic multiplicity (2), AA is not Diagonalisable. The Jordan form has one block of size 2:

J=(3103)J = \begin{pmatrix} 3 & 1 \\ 0 & 3 \end{pmatrix}

(In this case, AA is already in Jordan form.) \blacksquare

5.6 Spectral Theorem for Real Symmetric Matrices

Theorem 5.7 (Spectral Theorem). If AMn×n(R)A \in \mathcal{M}_{n \times n}(\mathbb{R}) is symmetric (A=ATA = A^T), then:

  1. All eigenvalues of AA are real.
  2. AA has nn linearly independent orthonormal eigenvectors.
  3. AA is orthogonally diagonalisable: A=QDQTA = QDQ^T where QQ is orthogonal (QTQ=IQ^TQ = I).

Proof. We prove (1) and then (2) and (3) by induction on nn.

(1) Let λC\lambda \in \mathbb{C} be an eigenvalue with eigenvector vCn\mathbf{v} \in \mathbb{C}^n v0\mathbf{v} \neq \mathbf{0}. Then

vTAv=vT(λv)=λvTv\overline{\mathbf{v}}^T A \mathbf{v} = \overline{\mathbf{v}}^T (\lambda \mathbf{v}) = \lambda \overline{\mathbf{v}}^T \mathbf{v}

Since A=ATA = A^T and AA has real entries, A=A=AT\overline{A} = A = A^TSo

vTAv=(Av)Tv=(Av)Tv=(λv)Tv=λvTv\overline{\mathbf{v}}^T A \mathbf{v} = (A\overline{\mathbf{v}})^T \mathbf{v} = (\overline{A\mathbf{v}})^T \mathbf{v} = (\overline{\lambda}\,\overline{\mathbf{v}})^T \mathbf{v} = \overline{\lambda}\,\overline{\mathbf{v}}^T \mathbf{v}

Therefore (λλ)vTv=0(\lambda - \overline{\lambda})\overline{\mathbf{v}}^T\mathbf{v} = 0. Since vTv>0\overline{\mathbf{v}}^T\mathbf{v} \gt 0 We have λ=λ\lambda = \overline{\lambda}So λR\lambda \in \mathbb{R}.

(2) and (3) By induction. For n=1n = 1 the result is trivial. Assume it holds for (n1)×(n1)(n-1) \times (n-1) Symmetric matrices. Since all eigenvalues are real, AA has a real eigenvalue λ1\lambda_1 with real Eigenvector v1\mathbf{v}_1. Normalise: q1=v1/v1\mathbf{q}_1 = \mathbf{v}_1 / \lVert \mathbf{v}_1 \rVert.

Let W=q1={wRn:q1Tw=0}W = \mathbf{q}_1^\perp = \{\mathbf{w} \in \mathbb{R}^n : \mathbf{q}_1^T \mathbf{w} = 0\}. For any wW\mathbf{w} \in W:

q1T(Aw)=(Aq1)Tw=(λ1q1)Tw=λ10=0\mathbf{q}_1^T (A\mathbf{w}) = (A\mathbf{q}_1)^T \mathbf{w} = (\lambda_1 \mathbf{q}_1)^T \mathbf{w} = \lambda_1 \cdot 0 = 0

So AwWA\mathbf{w} \in W. Therefore AA restricts to a symmetric linear map AW:WWA|_W : W \to W on an (n1)(n-1)-dimensional space. By the inductive hypothesis, WW has an orthonormal basis {q2,,qn}\{\mathbf{q}_2, \ldots, \mathbf{q}_n\} of eigenvectors of AWA|_W.

Then {q1,q2,,qn}\{\mathbf{q}_1, \mathbf{q}_2, \ldots, \mathbf{q}_n\} is an orthonormal eigenbasis for Rn\mathbb{R}^n And A=QDQTA = QDQ^T with Q=[q1qn]Q = [\mathbf{q}_1 \mid \cdots \mid \mathbf{q}_n]. \blacksquare

5.7 Worked Example: Full Diagonalisation

Problem. Find the eigenvalues, eigenvectors, and diagonalise

A=(4123)A = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix}

Solution

The characteristic polynomial is

det(AλI)=det(4λ123λ)=(4λ)(3λ)2\det(A - \lambda I) = \det\begin{pmatrix} 4 - \lambda & 1 \\ 2 & 3 - \lambda \end{pmatrix} = (4 - \lambda)(3 - \lambda) - 2

=λ27λ+10=(λ5)(λ2)= \lambda^2 - 7\lambda + 10 = (\lambda - 5)(\lambda - 2)

So the eigenvalues are λ1=5\lambda_1 = 5 and λ2=2\lambda_2 = 2.

For λ1=5\lambda_1 = 5: Solve (A5I)v=0(A - 5I)\mathbf{v} = \mathbf{0}.

(1122)v=0    v1+v2=0    v=t(11)\begin{pmatrix} -1 & 1 \\ 2 & -2 \end{pmatrix}\mathbf{v} = \mathbf{0} \implies -v_1 + v_2 = 0 \implies \mathbf{v} = t\begin{pmatrix} 1 \\ 1 \end{pmatrix}

For λ2=2\lambda_2 = 2: Solve (A2I)v=0(A - 2I)\mathbf{v} = \mathbf{0}.

(2121)v=0    2v1+v2=0    v=t(12)\begin{pmatrix} 2 & 1 \\ 2 & 1 \end{pmatrix}\mathbf{v} = \mathbf{0} \implies 2v_1 + v_2 = 0 \implies \mathbf{v} = t\begin{pmatrix} 1 \\ -2 \end{pmatrix}

Therefore A=PDP1A = PDP^{-1} with

P=(1112),D=(5002)P = \begin{pmatrix} 1 & 1 \\ 1 & -2 \end{pmatrix}, \quad D = \begin{pmatrix} 5 & 0 \\ 0 & 2 \end{pmatrix}

P1=13(2111)=(2/31/31/31/3)P^{-1} = \frac{1}{-3}\begin{pmatrix} -2 & -1 \\ -1 & 1 \end{pmatrix} = \begin{pmatrix} 2/3 & 1/3 \\ 1/3 & -1/3 \end{pmatrix}

Verification: PDP1=(1112)(5002)(2/31/31/31/3)PDP^{-1} = \begin{pmatrix} 1 & 1 \\ 1 & -2 \end{pmatrix}\begin{pmatrix} 5 & 0 \\ 0 & 2 \end{pmatrix}\begin{pmatrix} 2/3 & 1/3 \\ 1/3 & -1/3 \end{pmatrix}

=(5254)(2/31/31/31/3)=(10/3+2/35/32/310/34/35/3+4/3)=(4123)=A= \begin{pmatrix} 5 & 2 \\ 5 & -4 \end{pmatrix}\begin{pmatrix} 2/3 & 1/3 \\ 1/3 & -1/3 \end{pmatrix} = \begin{pmatrix} 10/3 + 2/3 & 5/3 - 2/3 \\ 10/3 - 4/3 & 5/3 + 4/3 \end{pmatrix} = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix} = A. \blacksquare

Problem. Use the Cayley—Hamilton theorem to compute A10A^{10} for the same matrix AA above.

Solution

The characteristic polynomial is p(λ)=λ27λ+10p(\lambda) = \lambda^2 - 7\lambda + 10So by Cayley—Hamilton, A2=7A10IA^2 = 7A - 10I.

To find A10A^{10}Divide λ10\lambda^{10} by p(λ)p(\lambda):

λ10=q(λ)(λ27λ+10)+r(λ)\lambda^{10} = q(\lambda)(\lambda^2 - 7\lambda + 10) + r(\lambda)

Where r(λ)=aλ+br(\lambda) = a\lambda + b has degree less than 2. Then A10=r(A)=aA+bIA^{10} = r(A) = aA + bI.

To find aa and bbEvaluate at the eigenvalues:

λ10λ=5=510=9765625=5a+b\lambda^{10}\big|_{\lambda=5} = 5^{10} = 9765625 = 5a + b

λ10λ=2=210=1024=2a+b\lambda^{10}\big|_{\lambda=2} = 2^{10} = 1024 = 2a + b

Subtracting: 3a=97656251024=97646013a = 9765625 - 1024 = 9764601So a=3254867a = 3254867.

b=102423254867=10246509734=6508710b = 1024 - 2 \cdot 3254867 = 1024 - 6509734 = -6508710.

Therefore A10=3254867A6508710IA^{10} = 3254867 \cdot A - 6508710 \cdot I. \blacksquare

:::caution Common Pitfall Not every matrix is diagonalisable. For example, A=(1101)A = \begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix} has Eigenvalue λ=1\lambda = 1 with algebraic multiplicity 2 but geometric multiplicity 1. It has only one Linearly independent eigenvector and is not diagonalisable.

5.8 Worked Example: Spectral Decomposition of a Symmetric Matrix

Problem. Orthogonally diagonalise the symmetric matrix

A=(211121112)A = \begin{pmatrix} 2 & 1 & 1 \\ 1 & 2 & 1 \\ 1 & 1 & 2 \end{pmatrix}

Solution

det(AλI)=det(2λ1112λ1112λ)\det(A - \lambda I) = \det\begin{pmatrix} 2-\lambda & 1 & 1 \\ 1 & 2-\lambda & 1 \\ 1 & 1 & 2-\lambda \end{pmatrix}

=(2λ)[(2λ)21]1[(2λ)1]+1[1(2λ)]= (2-\lambda)[(2-\lambda)^2 - 1] - 1[(2-\lambda) - 1] + 1[1 - (2-\lambda)] =(2λ)(λ24λ+3)(1λ)+(λ1)= (2-\lambda)(\lambda^2 - 4\lambda + 3) - (1-\lambda) + (\lambda - 1) =(2λ)(λ1)(λ3)= (2-\lambda)(\lambda-1)(\lambda-3)

Eigenvalues: λ1=1\lambda_1 = 1, λ2=2\lambda_2 = 2, λ3=3\lambda_3 = 3.

For λ1=1\lambda_1 = 1: AI=(111111111)(111000000)A - I = \begin{pmatrix} 1 & 1 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \end{pmatrix} \to \begin{pmatrix} 1 & 1 & 1 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}. Eigenspace: {(s,t,st):s,tR}\{(s, t, -s-t) : s, t \in \mathbb{R}\}. An orthonormal basis: q1=12(1,1,0)\mathbf{q}_1 = \frac{1}{\sqrt{2}}(1, -1, 0), q2=16(1,1,2)\mathbf{q}_2 = \frac{1}{\sqrt{6}}(1, 1, -2).

For λ2=2\lambda_2 = 2: A2I=(011101110)(101011000)A - 2I = \begin{pmatrix} 0 & 1 & 1 \\ 1 & 0 & 1 \\ 1 & 1 & 0 \end{pmatrix} \to \begin{pmatrix} 1 & 0 & 1 \\ 0 & 1 & 1 \\ 0 & 0 & 0 \end{pmatrix}. Eigenspace: {(t,t,t):tR}\{(-t, -t, t) : t \in \mathbb{R}\}. Normalised: q3=13(1,1,1)\mathbf{q}_3 = \frac{1}{\sqrt{3}}(-1, -1, 1).

For λ3=3\lambda_3 = 3: A3I=(111111111)(111000000)A - 3I = \begin{pmatrix} -1 & 1 & 1 \\ 1 & -1 & 1 \\ 1 & 1 & -1 \end{pmatrix} \to \begin{pmatrix} 1 & -1 & -1 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}. Eigenspace: {(s+t,s,t):s,tR}\{(s+t, s, t) : s, t \in \mathbb{R}\}. Normalised: q4=13(1,1,1)\mathbf{q}_4 = \frac{1}{\sqrt{3}}(1, 1, 1).

A=QDQTA = QDQ^T where Q=16(312231220222)Q = \frac{1}{\sqrt{6}}\begin{pmatrix} \sqrt{3} & 1 & -\sqrt{2} & \sqrt{2} \\ -\sqrt{3} & 1 & -\sqrt{2} & \sqrt{2} \\ 0 & -2 & \sqrt{2} & \sqrt{2} \end{pmatrix} and D=diag(1,2,3)D = \mathrm{diag}(1, 2, 3).

\blacksquare

5.9 Common Pitfalls

  • Algebraic multiplicity \geq geometric multiplicity, not the reverse. A matrix is diagonalisable if and only if equality holds for every eigenvalue.
  • The characteristic polynomial depends on the choice of eigenvalue variable convention. det(AλI)\det(A - \lambda I) and det(λIA)\det(\lambda I - A) differ by a factor of (1)n(-1)^n but have the same roots.
  • Similarity preserves eigenvalues but not eigenvectors. If A=PBP1A = PBP^{-1} and Bv=λvB\mathbf{v} = \lambda\mathbf{v} then A(Pv)=λ(Pv)A(P\mathbf{v}) = \lambda(P\mathbf{v}); the eigenvectors transform by PP.

:::