Let A∈Mn×n(F). A scalar λ∈F is an eigenvalue of A if there Exists a non-zero vector v∈Fn such that
Av=λv
The vector v is called an eigenvector corresponding to λ.
The eigenspace corresponding to λ is Eλ=ker(A−λI). Its dimension is The geometric multiplicity of λ.
5.2 Characteristic Polynomial
Theorem 5.1.λ is an eigenvalue of A if and only if det(A−λI)=0.
The polynomial p(λ)=det(A−λI) is called the characteristic polynomial of A. Its roots (in the algebraic closure of F) are the eigenvalues of A.
If p(λ)=(lambda−lambda1)m1(lambda−lambda2)m2⋯(lambda−lambdak)mk With lambda1,…,lambdak distinct, then mi is the algebraic multiplicity of lambdai.
Proposition 5.2. For each eigenvalue λ, 1≤dim(Eλ)≤mλ (geometric multiplicity does not exceed algebraic multiplicity).
5.3 Diagonalisation
Definition.A is diagonalisable if there exists an invertible matrix P and a diagonal Matrix D such that
A=PDP−1
Theorem 5.3.A∈Mn×n(F) is diagonalisable (over F) if and only if A has n linearly independent Eigenvectors (over F). Equivalently, the sum of the geometric multiplicities equals n.
Corollary 5.4. If A has n distinct eigenvalues, then A is diagonalisable.
Proof. Eigenvectors corresponding to distinct eigenvalues are linearly independent. With n distinct Eigenvalues, we obtain n linearly independent eigenvectors, which form a basis of Fn. ■
5.4 Cayley—Hamilton Theorem
Theorem 5.5 (Cayley—Hamilton). Every square matrix satisfies its own characteristic polynomial: If p(λ)=det(λI−A)Then p(A)=0 (the zero matrix).
Proof sketch. Let p(λ)=λn+cn−1λn−1+⋯+c1λ+c0. By the adjugate formula (Theorem 3.5), (λI−A)⋅adj(λI−A)=p(λ)⋅I. Each entry of adj(λI−A) is a polynomial in λ of degree at most n−1 So we can write adj(λI−A)=Bn−1λn−1+⋯+B1λ+B0 for Matrices Bi. Multiplying out and comparing coefficients of λk:
Bn−1=I,Bn−2−ABn−1=cn−1I,…,−AB0=c0I
Multiplying the k-th equation on the left by Ak and summing over k:
But the left side telescopes to zero, so p(A)=0. ■
5.5 Jordan Normal Form
When a matrix is not diagonalisable, the Jordan normal form provides the next-best canonical Representation.
Theorem 5.6. Let A∈Mn×n(C). Then A is similar to a block-diagonal Matrix
J=J1⋱Jk
Where each Jordan block has the form
Ji=λi1λi⋱⋱1λi
The Jordan form is unique up to permutation of the blocks.
Intuition. Each Jordan block corresponds to one eigenvalue. The size of the block equals the Number of steps in the chain v,(A−λI)v,(A−λI)2v,… Of generalised eigenvectors. A diagonalisable matrix has all Jordan blocks of size 1×1.
Problem. Find the Jordan normal form of
A=(3013)
Solution
The characteristic polynomial is det(A−λI)=(3−λ)2So λ=3 is the Only eigenvalue with algebraic multiplicity 2.
A−3I=(0010)Which has rank 1, so the geometric Multiplicity is dim(ker(A−3I))=2−1=1.
Since the geometric multiplicity (1) is less than the algebraic multiplicity (2), A is not Diagonalisable. The Jordan form has one block of size 2:
J=(3013)
(In this case, A is already in Jordan form.) ■
5.6 Spectral Theorem for Real Symmetric Matrices
Theorem 5.7 (Spectral Theorem). If A∈Mn×n(R) is symmetric (A=AT), then:
All eigenvalues of A are real.
A has n linearly independent orthonormal eigenvectors.
A is orthogonally diagonalisable: A=QDQT where Q is orthogonal (QTQ=I).
Proof. We prove (1) and then (2) and (3) by induction on n.
(1) Let λ∈C be an eigenvalue with eigenvector v∈Cnv=0. Then
vTAv=vT(λv)=λvTv
Since A=AT and A has real entries, A=A=ATSo
vTAv=(Av)Tv=(Av)Tv=(λv)Tv=λvTv
Therefore (λ−λ)vTv=0. Since vTv>0 We have λ=λSo λ∈R.
(2) and (3) By induction. For n=1 the result is trivial. Assume it holds for (n−1)×(n−1) Symmetric matrices. Since all eigenvalues are real, A has a real eigenvalue λ1 with real Eigenvector v1. Normalise: q1=v1/∥v1∥.
Let W=q1⊥={w∈Rn:q1Tw=0}. For any w∈W:
q1T(Aw)=(Aq1)Tw=(λ1q1)Tw=λ1⋅0=0
So Aw∈W. Therefore A restricts to a symmetric linear map A∣W:W→W on an (n−1)-dimensional space. By the inductive hypothesis, W has an orthonormal basis {q2,…,qn} of eigenvectors of A∣W.
Then {q1,q2,…,qn} is an orthonormal eigenbasis for Rn And A=QDQT with Q=[q1∣⋯∣qn]. ■
5.7 Worked Example: Full Diagonalisation
Problem. Find the eigenvalues, eigenvectors, and diagonalise
Problem. Use the Cayley—Hamilton theorem to compute A10 for the same matrix A above.
Solution
The characteristic polynomial is p(λ)=λ2−7λ+10So by Cayley—Hamilton, A2=7A−10I.
To find A10Divide λ10 by p(λ):
λ10=q(λ)(λ2−7λ+10)+r(λ)
Where r(λ)=aλ+b has degree less than 2. Then A10=r(A)=aA+bI.
To find a and bEvaluate at the eigenvalues:
λ10λ=5=510=9765625=5a+b
λ10λ=2=210=1024=2a+b
Subtracting: 3a=9765625−1024=9764601So a=3254867.
b=1024−2⋅3254867=1024−6509734=−6508710.
Therefore A10=3254867⋅A−6508710⋅I. ■
:::caution Common Pitfall Not every matrix is diagonalisable. For example, A=(1011) has Eigenvalue λ=1 with algebraic multiplicity 2 but geometric multiplicity 1. It has only one Linearly independent eigenvector and is not diagonalisable.
5.8 Worked Example: Spectral Decomposition of a Symmetric Matrix
Problem. Orthogonally diagonalise the symmetric matrix
For λ1=1: A−I=111111111→100100100. Eigenspace: {(s,t,−s−t):s,t∈R}. An orthonormal basis: q1=21(1,−1,0), q2=61(1,1,−2).
For λ2=2: A−2I=011101110→100010110. Eigenspace: {(−t,−t,t):t∈R}. Normalised: q3=31(−1,−1,1).
For λ3=3: A−3I=−1111−1111−1→100−100−100. Eigenspace: {(s+t,s,t):s,t∈R}. Normalised: q4=31(1,1,1).
A=QDQT where Q=613−3011−2−2−22222 and D=diag(1,2,3).
■
5.9 Common Pitfalls
Algebraic multiplicity ≥ geometric multiplicity, not the reverse. A matrix is diagonalisable if and only if equality holds for every eigenvalue.
The characteristic polynomial depends on the choice of eigenvalue variable convention.det(A−λI) and det(λI−A) differ by a factor of (−1)n but have the same roots.
Similarity preserves eigenvalues but not eigenvectors. If A=PBP−1 and Bv=λv then A(Pv)=λ(Pv); the eigenvectors transform by P.