Download Lecture notes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Non-negative matrix factorization wikipedia , lookup

Elementary algebra wikipedia , lookup

Eigenvalues and eigenvectors wikipedia , lookup

Cartesian tensor wikipedia , lookup

Homogeneous coordinates wikipedia , lookup

Bra–ket notation wikipedia , lookup

Signal-flow graph wikipedia , lookup

Linear algebra wikipedia , lookup

Dimension wikipedia , lookup

Basis (linear algebra) wikipedia , lookup

Matrix multiplication wikipedia , lookup

Cayley–Hamilton theorem wikipedia , lookup

System of polynomial equations wikipedia , lookup

History of algebra wikipedia , lookup

Gaussian elimination wikipedia , lookup

Matrix calculus wikipedia , lookup

Four-vector wikipedia , lookup

Equation wikipedia , lookup

System of linear equations wikipedia , lookup

Transcript
Theodore Voronov.
§2
Differentiable Manifolds. Spring 2011
Constructions of manifolds
Last updated: April 14, 2011.
2.1
Product of manifolds.
Let X and Y are manifolds of dimensions n and m, respectively. Then the
Cartesian product X × Y has a natural structure of a manifold of dimension
n + m.
Construction: if ϕ1α : V1α → U1α are charts for X and ϕ2µ : V2µ → U2µ
are charts for Y (here V1α ⊂ Rn , V2µ ⊂ Rm ), then
ϕ1α × ϕ2µ : V1α × V2µ → U1α × U2µ ⊂ X × Y
are charts for X × Y . Here
ϕ1α × ϕ2µ : (x1 , . . . , xn , y 1 , . . . , y m ) 7→ (x, y) ∈ X × Y ,
x = ϕ1α (x1 , . . . , xn ), y = ϕ2µ (y 1 , . . . , y m ) .
If (V1α , ϕ1α ) and (V2µ , ϕ2µ ) are smooth atlases, then a direct check shows that
the atlas (V1α × V2µ , ϕ1α × ϕ2µ ) is also smooth. Changes of coordinates on
the product X × Y have the typical “diagonal” form:
0
0
xa = xa (x1 , . . . , xn ) ,
0
0
y i = y i (y 1 , . . . , y m ) .
Here a = 1, . . . , n and i = 1, . . . , m. In the first line we see transformations
of coordinates from X, in the second line, from Y .
In the same way one can consider products of three or more manifolds:
X1 × . . . × X k .
Example 2.1. The product S 1 × S 1 is called the torus or more precisely
2-torus, and is diffeomorphic with the “surface of a bagel” in R3 (check!).
(The surface of a bagel is the surface of revolution of a circle of radius r and
center (R + r, 0) in the xy-plane, where R > 0, r > 0, about the z-axis.)
Notation: T 2 .
Example 2.2. For any n, the n-fold product S 1 ×. . .×S 1 is called the n-torus.
Notation: T n . (Using induction, T n can be visualized as a ‘multidimensional
surface of revolution’ of T n−1 in Rn+1 in the same way as T 2 is obtained by
revolving T 1 = S 1 in R3 .)
1
Theodore Voronov.
Differentiable Manifolds. Spring 2011
Example 2.3. For any manifold M , the product M ×R is called an (infinite)
cylinder over M .
Example 2.4. If we replace R by the open interval (0, 1), we obtain a finite
cylinder M × (0, 1). (Sketch a picture for M = S 1 .)
Remark 2.1. The product of a manifold with a closed segment [0, 1] is NOT
a manifold. (Explain, why.) Actually, it gives an example of a “manifold with
boundary”, a notion to be discussed later.
2.2
2.2.1
Submanifolds. Specifying manifolds by equations
Open submanifolds
Consider a manifold M = M n . Let A = (ϕα : Vα → Uα ) be an atlas for M .
Which subsets S ⊂ M are naturally manifolds?
Definition 2.1. A subset S ⊂ M is called an open submanifold if for all α,
the subsets ϕ−1
α (S ∩ Uα ) ⊂ Vα are open.
Any open submanifold S ⊂ M has a natural structure of a smooth manifold of the same dimension as M .
Indeed, consider Ūα := S ∩ Uα , V̄α := ϕ−1
α (S ∩ Uα ) and ϕ̄α := ϕα |V̄α . By
the definition, the maps ϕ̄α give bijections V̄α → Ūα , and, since V̄α ⊂ Vα are
n
open subsets
maps ϕ̄α are n-dimensional charts for the set
S of Vα ⊂ R , the S
S. Since Uα = M , we have Ūα = S, so Ā = (ϕ̄α : V̄α → Ūα ) is an atlas
for S. The changes of coordinates for the atlas Ā are simply the restrictions
to open subsets of the changes of coordinates for the atlas A. Therefore they
are smooth. So the set S considered together with the atlas Ā is a smooth
manifold of dimension n.
Example 2.5. Every open set U ⊂ Rn is an open submanifold. The manifold
structure is given by the restriction of the standard coordinates x1 , . . . , xn
onto U .
Example 2.6. Consider the subset of all invertible matrices GL(n) ⊂ Mat(n) .
2
Since GL(n) = {A ∈ Mat(n) | det A 6= 0} is an open subset of Mat(n) ∼
= Rn ,
it is therefore an open submanifold and dim GL(n) = n2 .
2
Theodore Voronov.
Differentiable Manifolds. Spring 2011
Remark 2.2. The manifold GL(n) is called the general linear group. It is a
group w.r.t. the matrix multiplication and all the group operations are given
by smooth maps. Groups that are also manifolds and satisfy this condition
are known as Lie groups. They make a very important class of manifolds.
Example 2.7. For an arbitrary function f ∈ C ∞ (M ), the subset
Mf = {x ∈ M | f (x) 6= 0}
is an open submanifold. Indeed, for each coordinate chart, the set ϕ−1
α (Mf ∩
Uα ) it open because it is the preimage of the open set R \ {0} under the
smooth (hence, continuous) function fα = f ◦ ϕα : Vα → R.
Later, when we introduce topology on manifolds, we shall see that open
submanifolds of M are, in particular, open sets in M .
2.2.2
Closed submanifolds
Consider now a different kind of submanifolds.
Definition 2.2. A subset S ⊂ M of a manifold M = M n is called a closed
submanifold of dimension k if in some atlas A on M , for all α,
k+1
ϕ−1
= . . . = xn = 0} ∩ Vα .
α (S ∩ Uα ) = {x
Denote Ūα := S ∩ Uα , V̄α := ϕ−1
α (S ∩ Uα ) and ϕ̄α := ϕα |V̄α . We obtain
an atlas for S. The changes of coordinates have the form:
xi(α) = ϕiαβ (x1(β) , . . . , xk(β) , 0 . . . , 0) ,
i = 1, . . . , k ,
where ϕiαβ for i = 1, . . . , n, are the changes of coordinates for the atlas A.
Therefore S is a smooth manifold of dimension k.
The number n − k is often referred to as the codimension of S k in M n .
Definition 2.2 may look too restrictive because of a very special appearance of a closed submanifold in coordinate charts. In fact, it is not. We shall
see that closed submanifolds can be equivalently described in a less restrictive
way. For this, we need to analyze systems of equations in Rn .
Consider a system of p equations for n variables where p 6 n :
 1 1
n

 f (x , . . . , x ) = 0
...
(1)

 f p (x1 , . . . , xn ) = 0
3
Theodore Voronov.
Differentiable Manifolds. Spring 2011
where the LHS’s are smooth functions on Rn . Let S ⊂ Rn be the solution set
of (1), i.e., the set of all points of Rn satisfying equations (1). We say that
equations (1) are independent if the rows of the matrix of partial derivatives
 ∂f 1 ∂f 1

∂f 1
.
.
.
1
2
n
∂x
∂x
∂x
∂f
,
= . . .
(2)
∂x
∂f p
∂f p
∂f p
. . . ∂xn
∂x1
∂x2
which is a function of a point in Rn , are linearly independent atall points
∂f i
x ∈ Rn belonging to the subset S ⊂ Rn . That means that rank ∂x
=p
j
at all points x ∈ S. If p = 1, i.e., for a single equation
f (x1 , . . . , xn ) = 0 ,
(3)
we use ‘non-degenerate’ instead of ‘independent’, so we say that equation (3)
is non-degenerate if df (x) 6= 0 for all points of the solution set.
Recall that the rank of a matrix is the dimension of the vector space
spanned by its rows — the row space — or the dimension of the vector space
spanned by its columns —the column space. These two numbers coincide
and also coincide with the maximal size of an invertible square submatrix,
i.e., the maximal size of a non-zero minor.
We may consider equations such as (1) or (3) defined on open domains
of Rn and apply the same notions of independence or non-degeneracy.
Lemma 2.1. Consider an open domain V ⊂ Rn . Let Γ ⊂ V be specified by
independent equations f i (x) = 0, where i = 1, . . . , p. Let k = n − p. Then for
every point x0 ∈ Γ there is an open set W ⊂ V , x0 ∈ W , and new curvilinear
coordinates y 1 , . . . , y n on W such that Γ ∩ W is specified by the equations
y k+1 = . . . = y n = 0.
= p means that at each x0 ∈ Γ there is an
Proof. The condition rank ∂f
∂x
invertible p × p submatrix of the rectangular matrix ∂f
. Fix some x0 ∈ Γ.
∂x
Without loss of generality, we may assume that it is the submatrix
i
∂f
∂xj i,j=k+1,...,n
consisting of the last n − k = p columns that is invertible on some open
neighborhood W 0 of x0 . (Otherwise renumber the coordinates.) By the
4
Theodore Voronov.
Differentiable Manifolds. Spring 2011
implicit function theorem, we can resolve the equations w.r.t. xi , i > k + 1,
on a smaller neighborhood W ⊂ W 0 of x0 . So there are smooth functions
g i (x1 , . . . , xk ) on W such that the equations f i (x) = 0 are equivalent to
xi = g i (x1 , . . . , xk ) for i > k. Define new coordinates on W :
(
xi
if i = 1 . . . , k ,
yi =
i
i 1
k
x − g (x , . . . , x )
if i = k + 1 . . . , n .
This an invertible change of coordinates, and the subset Γ ∩ W is specified
by y j = 0 for all j > k + 1 as desired.
Corollary 2.1. A system of p independent equations on Rn specifies a closed
submanifold Γk ⊂ Rn of dimension k = n − p.
Proof. Use the notation of the lemma. We can consider a smooth atlas on Rn
constructed as follows. As coordinate domains Uα we take all neighborhoods
W for all points x0 ∈ Γ, with the corresponding coordinate systems (y i ),
as constructed in the lemma, and, in addition, the open set Rn \ Γ with
the standard coordinates (xi ). It is clear that Definition 2.2 is satisfied for
Γ ⊂ Rn if we use this atlas.
Corollary 2.2. For an arbitrary manifold M n , a subset S ⊂ M n is closed kdimensional submanifold if and only if for some atlas A = (ϕα : Vα → Uα ),
n
the subsets ϕ−1
α (S ∩ Uα ) ⊂ Vα ⊂ R (for all α) can be specified by n − k
independent equations.
Proof. Consider a subset S ⊂ M . If it satisfies Definition 2.2, it is obviously specified by independent equations in each chart, because the equations
xk+1 = 0 , . . . , xn = 0 are independent. Conversely, suppose S, for some atlas
A, is specified by n−k independent equations in each coordinate chart. Then
we can apply Lemma 2.1 to each such chart and introduce smaller open subsets of Rn and the corresponding smaller subsets of M making a new smooth
atlas of M so that for the new atlas, the appearance of S in each coordinate
chart will be as required by Definition 2.2.
(In the proof of Corollary 2.2 we have implicitly used the following argument: if the intersection of a subset S ⊂ M with each coordinate domain
U ⊂ M is a closed submanifold of U , then S is a closed submanifold of M .)
5
Theodore Voronov.
Differentiable Manifolds. Spring 2011
Remark 2.3. Considerations in this subsection are modeled on a linear
example. Let p 6 n. Consider a homogeneous linear system
Ax = 0
(4)
where x ∈ Rn and A is an p × n matrix. Equations (4) are independent if
the p rows of the matrix A are linearly independent. As we know, in this
case the solution set for (4) is a vector space of dimension n − p. Recall that
if we solve this system by the Gauss elimination, we obtain n − p variables
as ‘free variables’ (so that they can be set to any values), and the remaining
p variables expressed as linear functions of the free variables, and this gives
the general solution.
2.2.3
Specifying manifolds by equations in RN
We can use the above theory as a source of examples of manifolds.
Example 2.8. Consider once again the sphere S 2 defined as the unit sphere
in R3 . It is specified by the equation
x2 + y 2 + z 2 = 1 ,
where we used traditional notation for coordinates on R3 . In other words,
we have a single equation
f (x, y, z) = 0
where f (x, y, z) = x2 + y 2 + z 2 − 1. The set of solutions S is our sphere S 2 .
The matrix of partial derivatives is the row matrix
∂f ∂f ∂f
,
,
= (2x, 2y, 2z) .
∂x ∂y ∂z
We see that this row vector never vanishes for (x, y, z) 6= 0, in particular, it
does not vanish for all (x, y, z) ∈ S. Therefore our equation is non-degenerate
and Corollary 2.1 applies. Once again we obtain a manifold structure of S 2 ,
which, as one can easily see, is equivalent to that previously defined with the
help of stereographic projections. The point in Corollary 2.1 is the possibility
to solve the system in question w.r.t. some of the variables so that the
remaining variables are ‘free’ and can be taken as coordinates, thus giving
a manifold structure, which is deduced from the implicit function theorem.
For the equation
of the sphere we, of course, can do that directly, writing,
p
2
e.g., z = 1 − x − y 2 in a neighborhood of some point.
6
Theodore Voronov.
Differentiable Manifolds. Spring 2011
In practical situations it often happens that interesting objects are defined
by equations that are not independent, and constructing an equivalent system
of independent equations may be awkward, if at all possible. (It is always
possible locally, of course, in a neighborhood of each point, but this is of no
practical help.) The following generalization of the previous is helpful.
Consider once again a system of p simultaneous equations on Rn :
 1 1
n

 f (x , . . . , x ) = 0
...
(5)

 f p (x1 , . . . , xn ) = 0
where, unlike (1), we do not assume that p 6 n. By definition, the rank of
this system is the rank r = rank ∂f
of the matrix of partial derivatives. It is
∂x
a function of a point x. We shall consider it at the points of the solution set
S specified by (5). The rank obviously cannot exceed n (and p). Suppose
rmax is the maximal value of the rank, which is attained at some point x0 .
Then the rank equals rmax on a whole open neighborhood of x0 . (Indeed,
the rank of a matrix is the maximal size of the non-zero minors. Therefore
a small variation of the matrix can only increase the rank, so if it is already
maximal, it will remain constant.) Hence the rank of a system of equations
is a constant and equals to rmax on a dense open subset of S ⊂ Rn . It makes
sense to concentrate on the case when it is simply constant.
Definition 2.3. We say that a system of equations (5) has constant rank if
at all points of the solution set S specified by (5), the rank of the matrix of
partial derivatives (2) is a constant number r (the same for all points x ∈ S).
A particular case is of course the case of independent equations treated
in Lemma 2.1 and Corollary 2.1, where r = p.
Theorem 2.1 (Manifolds specified by equations of constant rank). The solution set S of a system of constant rank on Rn is a closed submanifold of
Rn of dimension k = n − r where r is the rank of the system.
Proof. Omitted. It is similar to that of Lemma 2.1, but more technically
complicated.
How do we practically know that the rank of a given system is constant?
We should look at the vector space of solutions of the following auxiliary
7
Differentiable Manifolds. Spring 2011
Theodore Voronov.
linear system:
X ∂f µ
i
∂xi
(x)ẋi = 0 .
(6)
Here x is a given point in the set of solutions. The unknowns are the variables
that we have denoted by ẋi . The matrix of the system is exactly the matrix
of partial derivatives for which we want to know whether its rank is constant
(independent of x) or not. From linear algebra we know that the dimension
of the space of solutions of (6) is the number of unknowns (which is n) minus
the rank of the matrix (which is r). Therefore the rank r of the matrix
∂f µ
(x) is constant for x ∈ S if and only if the dimension of the space of
∂xi
solutions of the auxiliary system (6) does not depend on x ∈ S.
Notice that the linear system (6) can be obtained by the formal differentiation of the system (5) w.r.t. a parameter t (“time”) so that the ‘dot’
stands for the “time derivative”. Later we shall see the geometrical meaning
of this.
Example 2.9. Recall that a matrix A is called orthogonal is AAT = E,
where E is the identity matrix. The set of all real orthogonal n × n matrices
is denoted O(n). One can check that O(n) is a group, called the orthogonal
group. We claim that O(n) has a natural manifold structure. Let us apply
Theorem 2.1. Notice that a single matrix equation such as AAT = E is in
fact a system of n2 equations for the n2 matrix entries. Obviously it cannot
be independent, otherwise it would specify a single point (at best). To see
that this system has constant rank we consider A as a function of a parameter
t, A = A(t), and differentiate the equation w.r.t. t. We obtain the auxiliary
linear system ȦAT + AT Ȧ = 0 or (ȦAT )T = −ȦAT . (Here A ∈ O(n) is a
fixed point of the solution set and the unknown is Ȧ.) The space of solutions
of the auxiliary linear system (for all A ∈ O(n)) is therefore isomorphic
to the space of all skew-symmetric n × n matrices. Hence it has constant
dimension n(n − 1)/2 independent of A and the rank of the original system
is constant. The dimension of the manifold is n(n − 1)/2. (The dimension of
the space of all skew-symmetric n × n matrices can be counted as the number
of independent entries for such a matrix: 21 (n2 − n), where n is the number
of the diagonal entries, which all have to be zero.)
Example 2.10. A particularly simple special case is the group O(2) of the
orthogonal 2 × 2 matrices. The equation AAT = E for n = 2 can be solved
8
Differentiable Manifolds. Spring 2011
Theodore Voronov.
explicitly (do this!). Any orthogonal 2 × 2 matrix is either
cos α − sin α
A=
sin α cos α
or
A=
− cos β sin β
sin β cos β
.
(Geometrically it is either the rotation through some angle α or a rotation
followed by the inversion in a coordinate axis.) As expected we obtain an
example of a one-dimensional manifold: 2(2 − 1)/2 = 1. Moreover, we see
that O(2) ∼
= S 1 ∪ S 1 (the disjoint union of two circles).
Slightly changing the notation, consider RN and closed submanifolds in
it. One can prove that every manifold1 M n of dimension n is diffeomorphic
to a closed submanifold of RN for a sufficiently large N n. This should
not be mistakenly understood as that every manifold can be specified by a
system of equations on RN given by globally defined functions. It is true
only for special manifolds such as the sphere S n or the Lie groups such as
O(n). Being a closed submanifold of RN only means that M n ⊂ RN can be
defined by a system of equations of rank N − n in a neighborhood of each
point x ∈ M n , i.e., only locally (and when we pass to a different point, the
system, generally, changes).
1
More precisely, certain general topological conditions should be satisfied by M n for
this to be true. These conditions are normally included in the definition of a manifold and
we discuss them in the next section.
9