For a
linear mapping
φ
{\displaystyle {}\varphi }
on a
K
{\displaystyle {}K}
-vector space
V
{\displaystyle {}V}
and an
eigenvalue
λ
∈
K
{\displaystyle {}\lambda \in K}
,
GeEig
λ
(
φ
)
=
⋃
n
∈
N
kern
(
φ
−
λ
Id
)
n
{\displaystyle {}\operatorname {GeEig} _{\lambda }(\varphi )=\bigcup _{n\in \mathbb {N} }\operatorname {kern} {\left(\varphi -\lambda \operatorname {Id} \right)}^{n}\,}
is called
generalized eigenspace
of
φ
{\displaystyle {}\varphi }
for this eigenvalue.
If
V
{\displaystyle {}V}
is finite-dimensional, then the chain
kern
(
φ
−
λ
Id
)
⊆
kern
(
φ
−
λ
Id
)
2
⊆
kern
(
φ
−
λ
Id
)
3
⊆
.
.
.
{\displaystyle {}\operatorname {kern} {\left(\varphi -\lambda \operatorname {Id} \right)}\subseteq \operatorname {kern} {\left(\varphi -\lambda \operatorname {Id} \right)}^{2}\subseteq \operatorname {kern} {\left(\varphi -\lambda \operatorname {Id} \right)}^{3}\subseteq ...\,}
becomes stationary, that is, there exists some
r
∈
N
{\displaystyle {}r\in \mathbb {N} }
such that
GeEig
λ
(
φ
)
=
kern
(
φ
−
λ
Id
)
r
.
{\displaystyle {}\operatorname {GeEig} _{\lambda }(\varphi )=\operatorname {kern} {\left(\varphi -\lambda \operatorname {Id} \right)}^{r}\,.}
Generalized eigenspaces are, due to
exercise ,
invariant under the linear mapping. By definition, we have
Eig
λ
(
φ
)
⊆
GeEig
λ
(
φ
)
,
{\displaystyle {}\operatorname {Eig} _{\lambda }{\left(\varphi \right)}\subseteq \operatorname {GeEig} _{\lambda }(\varphi )\,,}
and for
φ
{\displaystyle {}\varphi }
diagonalizable we have equality, see
exercise .
We want to understand trigonalizable mappings via their generalized eigenspaces.
Due to
the Lemma of Bezout ,
there exist polynomials
S
,
T
∈
K
[
T
]
{\displaystyle {}S,T\in K[T]}
such that
S
P
+
T
Q
=
1
.
{\displaystyle {}SP+TQ=1\,.}
Set
U
=
kern
P
(
φ
)
{\displaystyle {}U=\operatorname {kern} P(\varphi )}
and
W
=
kern
Q
(
φ
)
{\displaystyle {}W=\operatorname {kern} Q(\varphi )}
.
Let
v
∈
V
{\displaystyle {}v\in V}
.
Due to
the Theorem of Cayley-Hamilton
,
we have
0
=
χ
φ
(
φ
)
=
(
P
(
φ
)
∘
Q
(
φ
)
)
(
v
)
=
P
(
φ
)
(
Q
(
φ
)
(
v
)
)
.
{\displaystyle {}0=\chi _{\varphi }(\varphi )=(P(\varphi )\circ Q(\varphi ))(v)=P(\varphi )(Q(\varphi )(v))\,.}
Therefore, the image of
Q
(
φ
)
{\displaystyle {}Q(\varphi )}
belongs to the kernel of
P
(
φ
)
{\displaystyle {}P(\varphi )}
and vice versa. From
v
=
Id
V
(
v
)
=
(
S
P
+
T
Q
)
(
φ
)
(
v
)
=
S
(
φ
)
(
P
(
φ
)
(
v
)
)
+
T
(
φ
)
(
Q
(
φ
)
(
v
)
)
=
P
(
φ
)
(
S
(
φ
)
(
v
)
)
+
Q
(
φ
)
(
T
(
φ
)
(
v
)
)
{\displaystyle {}{\begin{aligned}v&=\operatorname {Id} _{V}(v)\\&=(SP+TQ)(\varphi )(v)\\&=S(\varphi )(P(\varphi )(v))+T(\varphi )(Q(\varphi )(v))\\&=P(\varphi )(S(\varphi )(v))+Q(\varphi )(T(\varphi )(v))\end{aligned}}}
we can read off that the left-hand summand belongs to
Im
P
(
φ
)
⊆
kern
Q
(
φ
)
{\displaystyle {}\operatorname {Im} P(\varphi )\subseteq \operatorname {kern} Q(\varphi )}
and the right-hand summand belongs to
Im
Q
(
φ
)
⊆
kern
P
(
φ
)
{\displaystyle {}\operatorname {Im} Q(\varphi )\subseteq \operatorname {kern} P(\varphi )}
.
Therefore, we have a sum decomposition, which is direct, since
P
(
φ
)
(
v
)
=
Q
(
φ
)
(
v
)
=
0
{\displaystyle {}P(\varphi )(v)=Q(\varphi )(v)=0}
implies
v
=
0
{\displaystyle {}v=0}
.
For the
φ
{\displaystyle {}\varphi }
-invariance
of these spaces, see
exercise .
For
v
∈
kern
Q
(
φ
)
{\displaystyle {}v\in \operatorname {kern} Q(\varphi )}
,
we have
v
=
S
(
φ
)
(
P
(
φ
)
(
v
)
)
+
T
(
φ
)
(
Q
(
φ
)
(
v
)
)
=
S
(
φ
)
(
P
(
φ
)
(
v
)
)
=
P
(
φ
)
(
S
(
φ
)
(
v
)
)
,
{\displaystyle {}v=S(\varphi )(P(\varphi )(v))+T(\varphi )(Q(\varphi )(v))=S(\varphi )(P(\varphi )(v))=P(\varphi )(S(\varphi )(v))\,,}
that is, we have
Im
P
(
φ
)
=
kern
Q
(
φ
)
{\displaystyle {}\operatorname {Im} P(\varphi )=\operatorname {kern} Q(\varphi )}
.
Therefore, the restriction of
P
(
φ
)
{\displaystyle {}P(\varphi )}
to the kernel of
Q
(
φ
)
{\displaystyle {}Q(\varphi )}
is surjective, thus bijective.
◻
{\displaystyle \Box }
Let
φ
:
V
⟶
V
{\displaystyle \varphi \colon V\longrightarrow V}
be an
endomorphism
on the
finite-dimensional
K
{\displaystyle {}K}
-vector space
V
{\displaystyle {}V}
, and let
λ
∈
K
{\displaystyle {}\lambda \in K}
. Then the
dimension
of the
generalized eigenspace
GeEig
λ
(
φ
)
{\displaystyle {}\operatorname {GeEig} _{\lambda }(\varphi )}
equals the
algebraic multiplicity
of
λ
{\displaystyle {}\lambda }
.
We write the
characteristic polynomial
of
φ
{\displaystyle {}\varphi }
as
χ
φ
=
(
X
−
λ
)
k
Q
,
{\displaystyle {}\chi _{\varphi }=(X-\lambda )^{k}Q\,,}
where
(
X
−
λ
)
{\displaystyle {}(X-\lambda )}
does not occur in
Q
{\displaystyle {}Q}
as a linear factor, that is,
k
{\displaystyle {}k}
is the algebraic multiplicity of
λ
{\displaystyle {}\lambda }
. Then,
P
=
(
X
−
λ
)
k
{\displaystyle {}P=(X-\lambda )^{k}}
and
Q
{\displaystyle {}Q}
are
coprime ,
and, due to
fact ,
we have the decompositon
V
=
kern
P
(
φ
)
⊕
kern
Q
(
φ
)
,
{\displaystyle {}V=\operatorname {kern} P(\varphi )\oplus \operatorname {kern} Q(\varphi )\,,}
and
P
(
φ
)
=
(
φ
−
λ
Id
)
k
:
kern
Q
(
φ
)
⟶
kern
Q
(
φ
)
{\displaystyle P(\varphi )={\left(\varphi -\lambda \operatorname {Id} \right)}^{k}\colon \operatorname {kern} Q(\varphi )\longrightarrow \operatorname {kern} Q(\varphi )}
is a bijection. Moreover,
H
:=
GeEig
λ
(
φ
)
=
kern
P
(
φ
)
,
{\displaystyle {}H:=\operatorname {GeEig} _{\lambda }(\varphi )=\operatorname {kern} P(\varphi )\,,}
where the inclusion
⊇
{\displaystyle {}\supseteq }
is clear, and the other inclusion follows from the fact that higher powerse of
X
−
λ
{\displaystyle {}X-\lambda }
do not annihilate further elements, by the bijectivity on
kern
Q
(
φ
)
{\displaystyle {}\operatorname {kern} Q(\varphi )}
just mentioned. For the characteristic polynomial, we have, due to the direct sum decomposition according to
fact ,
the relation
χ
φ
=
χ
1
⋅
χ
2
,
{\displaystyle {}\chi _{\varphi }=\chi _{1}\cdot \chi _{2}\,,}
where
χ
1
{\displaystyle {}\chi _{1}}
is the characteristic polynomial of
φ
|
H
{\displaystyle {}\varphi {|}_{H}}
and
χ
2
{\displaystyle {}\chi _{2}}
is the characteristic polynomial of
φ
|
kern
Q
(
φ
)
{\displaystyle {}\varphi {|}_{\operatorname {kern} Q(\varphi )}}
. Since
(
φ
−
λ
)
k
{\displaystyle {}(\varphi -\lambda )^{k}}
restricted to
H
{\displaystyle {}H}
is the zero mapping, the minimal polynomial of
φ
|
H
{\displaystyle {}\varphi {|}_{H}}
and, hence, also the characteristic polynomial
χ
1
{\displaystyle {}\chi _{1}}
are some power of
(
X
−
λ
)
{\displaystyle {}(X-\lambda )}
, say
χ
1
=
(
X
−
λ
)
d
,
{\displaystyle {}\chi _{1}=(X-\lambda )^{d}\,,}
where
d
=
dim
K
(
H
)
.
{\displaystyle {}d=\dim _{K}{\left(H\right)}\,.}
In particular,
d
≤
k
{\displaystyle {}d\leq k}
,
as
χ
1
{\displaystyle {}\chi _{1}}
is a divisor of
χ
φ
{\displaystyle {}\chi _{\varphi }}
. Assume that
d
<
k
{\displaystyle {}d<k}
.
Then
λ
{\displaystyle {}\lambda }
is a zero of
χ
2
{\displaystyle {}\chi _{2}}
and
λ
{\displaystyle {}\lambda }
is an eigenvalue of
φ
|
kern
Q
(
φ
)
{\displaystyle {}\varphi {|}_{\operatorname {kern} Q(\varphi )}}
. But this is a contradiction to the fact that
P
(
φ
)
{\displaystyle {}P(\varphi )}
is a bijection on this space.
◻
{\displaystyle \Box }
◻
{\displaystyle \Box }
Let
φ
:
V
⟶
V
{\displaystyle \varphi \colon V\longrightarrow V}
be a
trigonalizable
K
{\displaystyle {}K}
-endomorphism
on the
finite-dimensional
K
{\displaystyle {}K}
-vector space
V
{\displaystyle {}V}
. Then
V
{\displaystyle {}V}
is the
direct sum
of the
generalized eigenspaces ,
that is,
V
=
GeEig
λ
1
(
φ
)
⊕
⋯
⊕
GeEig
λ
m
(
φ
)
,
{\displaystyle {}V=\operatorname {GeEig} _{\lambda _{1}}(\varphi )\oplus \cdots \oplus \operatorname {GeEig} _{\lambda _{m}}(\varphi )\,,}
where
λ
1
,
…
,
λ
m
{\displaystyle {}\lambda _{1},\ldots ,\lambda _{m}}
are the different
eigenvalues
of
φ
{\displaystyle {}\varphi }
, and
φ
{\displaystyle {}\varphi }
is the
direct sum
of the restrictions
φ
i
=
φ
|
H
i
:
H
i
⟶
H
i
{\displaystyle \varphi _{i}=\varphi {|}_{H_{i}}\colon H_{i}\longrightarrow H_{i}}
on the generalized eigenspaces.
Let
χ
φ
=
(
X
−
λ
1
)
k
1
⋯
(
X
−
λ
m
)
k
m
{\displaystyle {}\chi _{\varphi }=(X-\lambda _{1})^{k_{1}}\cdots (X-\lambda _{m})^{k_{m}}\,}
be the
characteristic polynomial ,
which splits into linear factors according to
fact ,
where the
λ
i
{\displaystyle {}\lambda _{i}}
are different. We do induction over
m
{\displaystyle {}m}
. For
m
=
1
{\displaystyle {}m=1}
,
there is only one eigenvalue
λ
{\displaystyle {}\lambda }
and only one generalized eigenspace. Due to
fact ,
the minimal polynomial is of the form
(
X
−
λ
)
s
{\displaystyle {}(X-\lambda )^{s}}
and thus
V
=
GeEig
λ
(
φ
)
{\displaystyle {}V=\operatorname {GeEig} _{\lambda }(\varphi )}
.
Suppose that the statement is already proven for smaller
m
{\displaystyle {}m}
. We set
P
=
(
X
−
λ
1
)
k
1
{\displaystyle {}P=(X-\lambda _{1})^{k_{1}}}
and
Q
=
(
X
−
λ
2
)
k
2
⋯
(
X
−
λ
m
)
k
m
{\displaystyle {}Q=(X-\lambda _{2})^{k_{2}}\cdots (X-\lambda _{m})^{k_{m}}}
.
We are then in the situation of
fact
and
fact .
Therefore, we have a direct sum decomposition into
φ
{\displaystyle {}\varphi }
-invariant
linear subspaces
V
=
GeEig
λ
1
(
φ
)
⊕
kern
Q
(
φ
)
.
{\displaystyle {}V=\operatorname {GeEig} _{\lambda _{1}}(\varphi )\oplus \operatorname {kern} Q(\varphi )\,.}
The characteristic polynomial is, according to
fact ,
the product of the characteristic polynomials of the restrictions to the spaces. Because of
fact ,
the polynomial
(
X
−
λ
1
)
k
1
{\displaystyle {}(X-\lambda _{1})^{k_{1}}}
is the characteristic polynomial of the restriction to the first generalized eigenspace, hence,
Q
{\displaystyle {}Q}
is the characteristic polynomial of the restriction to
kern
Q
(
P
)
{\displaystyle {}\operatorname {kern} Q(P)}
. In particular, this restriction is also trigonalizable. By the induction hypothesis,
kern
Q
(
P
)
{\displaystyle {}\operatorname {kern} Q(P)}
is the direct sum of the generalized eigenspaces for
λ
2
,
…
,
λ
m
{\displaystyle {}\lambda _{2},\ldots ,\lambda _{m}}
. Altogether, this implies the direct sum decomposition of
V
{\displaystyle {}V}
and of
φ
{\displaystyle {}\varphi }
.
◻
{\displaystyle \Box }