* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Normal Matrices
System of linear equations wikipedia , lookup
Capelli's identity wikipedia , lookup
Linear least squares (mathematics) wikipedia , lookup
Symmetric cone wikipedia , lookup
Principal component analysis wikipedia , lookup
Rotation matrix wikipedia , lookup
Determinant wikipedia , lookup
Gaussian elimination wikipedia , lookup
Four-vector wikipedia , lookup
Non-negative matrix factorization wikipedia , lookup
Matrix (mathematics) wikipedia , lookup
Singular-value decomposition wikipedia , lookup
Matrix calculus wikipedia , lookup
Eigenvalues and eigenvectors wikipedia , lookup
Orthogonal matrix wikipedia , lookup
Jordan normal form wikipedia , lookup
Matrix multiplication wikipedia , lookup
Chapter 6 Normal Matrices Normal matrices are matrices that include Hermitian matrices and enjoy several of the same properties as Hermitian matrices. Indeed, while we proved that Hermitian matrices are unitarily diagonalizable, we did not establish any converse. That is, if a matrix is unitarily diagonalizable, then does it have any special property involving for example its spectrum or its adjoint? As we shall see normal matrices are unitarily diagonalizable. 6.1 Introduction to Normal matrices Definition 6.1.1. A matrix A ∈ Mn is called normal if A∗ A = AA∗ . Proposition 6.1.1. A ∈ Mn is normal if and only if every matrix unitarily equivalent to A is normal. Proof. Suppose A is normal and B = U ∗ AU , where U is unitary. Then B ∗ B = U ∗ A∗ AU = U ∗ AA∗ U = U ∗ AU U ∗ A∗ U = BB ∗ . If U ∗ AU is normal then it is easy to see that U ∗ AA∗ U = U ∗ A∗ AU . Multiply this equation on the right by U ∗ and on the left by U to obtain AA∗ = A∗ A. Examples. (1) Unitary matrices are normal (U ∗ U = I = U U ∗ ). (2) Hermitian matrices are normal (AA∗ = A2 = A∗ A). (3) If A∗ = −A, we have A∗ A = AA∗ = −A2 . Hence matrices for which A∗ = −A, called skew-Hermitian, are normal. 197 198 CHAPTER 6. NORMAL MATRICES Example 6.1.1. Consider the arbitrary matrix N ∈ M2 (R) , written as · ¸ a b N= . If we suppose that N is normal then c d ∗ N N NN ∗ ¸ · a b = = c d · ¸· ¸T · a b a b = = c d c d · a b c d ¸T · a2 + c2 ab + cd ab + cd b2 + d2 a2 + b2 ac + bd ac + bd c2 + d2 ¸ ¸ From this we conclude that b2 = c2 , or b = ±c. Consider the cases in turn. (i) If c = b, then N is Hermitian and thus normal. (ii) If c = −b 6= 0, then (N ∗ N )12 = ab + cd = b (a − d). On the other hand (N N ∗ )12 = ac + bd = (d − a) b. For b (a − d) = (d − a) b, we must have a = d. This gives that real 2 × 2 normal matrices are either symmetric or have the form · ¸ a b N= −b a Note this form includes both rotations and skew-symmetric matrices. Recall the definition of a unitarily diagonalizable matrix: A matrix A ∈ Mn is called unitarily diagonalizable if there is a unitary matrix U for which U ∗ AU is diagonal. A simple consequence of this is that if U ∗ AU = D (where D = diagonal and U = unitary), then AU = U D and hence A has n orthonormal eigenvectors. This is just a part of the spectral theorem for normal matrices. Theorem 6.1.1 (Spectral theorem for normal matrices). If A ∈ Mn has eigenvalues λ1 . . . λn , counted according to multiplicity, the following statements are equivalent. (a) A is normal. (b) A is unitarily diagonalizable. Pn Pn Pn 2 2 (c) i=1 j=1 |aij | = j=1 |λj | . (d) There is an orthonormal set of n eigenvectors of A. 6.1. INTRODUCTION TO NORMAL MATRICES 199 Proof. (a) ⇒ (b). If A is normal, then AA∗ is Hermitian and therefore unitarily diagonalizable. Thus U ∗ A∗ AU = D = U ∗ AA∗ U . Also, A, A∗ , A∗ A = AA∗ form a commuting family. This implies that eigenvectors of A∗ A are also eigenvectors of A. Since A∗ A has a complete orthonormal set we know that U ∗ AU is also diagonal. It is easy to see also that (b) ⇒ (a) We also note that (a) ⇒ (d). (b) ⇒ (c). Suppose U ∗ AU = D. Then U ∗ A∗ U = D∗ and U ∗ A∗ AU = D∗ D. By Corollary P 3.5.2 P similarly preserves the trace. We know trace of A∗ A P P P P n n n n n n is tr (A∗ A) = j=1 k=1 a∗jk akj = j=1 k=1 ākj akj = j=1 k=1 |akj |2 . Since the trace of D∗ D is Σ|λj |2 , the result follows. (c) ⇒ (b). We know that A is unitarily equivalent to a upper P triangular matrix T . We also know that if A ∼ B are unitarily equivalent |aij |2 = ij P |bij |2 . Application of this equality to the upper triangular matrix T yields ij X i,j |aij |2 = X |λj |2 + X j>i |tij |2 = X |λj |2 . Thus tij = 0 for j > i. Thus A is uniformly diagonalizable. (d) ⇒ (b). Trivial. Corollary 6.1.1. Let A ∈ Mn and A is normal. If U is unitary and if U ∗ AU is upper triangular then U ∗ AU is diagonal. Theorem 6.1.2. Let N ∈ Mn (R). Then N is normal if and only if there is a real orthogonal matrix Q ∈ Mn (R) such that A1 A2 ° (1) QT N Q = .. . ° An where Ai is 1 × 1 (real) or Ai is 2 × 2 (real) of the form ¸ · αi βj . Ai = −βj αi Proof. First of all, any matrix A of the form given by (1) is normal, and therefore so also is any matrix unitarily similar (real orthogonally similar in this case) to it. 200 CHAPTER 6. NORMAL MATRICES To prove the converse we assume that N ∈ Mn (R) is normal. We know that N is unitarily diagonalizable. That is, there is a unitary matrix U such that U ∗ N U = D, the diagonal matrix of its eigenvalues. Because N is real, all complex eigenvalues occur in complex conjugate pairs. Arrange them as successive diagonal entries in D. If λ is a real eigenvalue, we can assume without loss of generality that the corresponding eigenvector is real. For complex eigenvalues, the corresponding eigenvectors also occur in conjugate pairs. Thus if α + iβ is an eigenvector of N with corresponding eigenvector written in real and complex parts u = ur + ius . Since N is real we have that α − iβ is also an eigenvector of N with corresponding eigenvector ū = ur − ius . By the fact that N is unitarily diagonalible, these vectors are orthogonal. This means hur , us i = 0. Replace the eigenvectors ur ± ius by the real an imaginary parts in U . This gives the matrix Q. Now compute QT NQ. It is easy to see that compute N ur = αur −βvs and N us = αus +βvr . When the first of these vectors (αur −βvs ) is multiplied by QT we obtain the vector [0, . . . , α, −β, 0, . . . 0]T . T Multiplication by the ·second gives ¸ the vector [0, . . . , β, α, 0, . . . 0] . In this α β way the components arise. −β α Corollary 6.1.2. (a) A ∈ Mn is symmetric if and only if (1) holds with all blocks 1 × 1 (and real). (b) AAT = I if and only if (1) has the form λ 1 ... λp ∗ A1 .. . ∗ Ak where λj = ±1 and Aj = 6.2 h cos θj − sin θj sin θj cos θj i θj ∈ R. Exercises 1. If A and B commute and if A is normal, then A∗ and B commute.