Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
INVERSE FUNCTION THEOREM and SURFACES IN Rn Let f ∈ C k (U ; Rn ), with U ⊂ Rn open. Assume df (a) ∈ GL(Rn ), where a ∈ U . The Inverse Function Theorem says there is an open neighborhood V ⊂ U of a in Rn so that f|V : V → Rn is a homeomorphism onto its image W = f (V ), an open subset of Rn ; the inverse g = f −1 : W → V is a C k map. The inverse function theorem as an existence theorem. Given f : X → Y (X, Y Banach spaces) and y ∈ Y , we seek to solve the nonlinear equation: f (x) = y. The idea is that if f is close (in some sense) to a linear operator A ∈ L(X, Y ) for which the problem is uniquely solvable, and (for a given b ∈ Y ) we happen to know a solution a ∈ X to f (x) = b, then for values y close to b the equation should have a unique solution (close to a). For example, suppose f has the form f (x) = Ax + φ(x), where A ∈ L(X, Y ) is invertible (with bounded inverse) and φ : X → Y is Lipschitz, with a small Lipschitz constant Lip(φ). Then to solve Ax + φ(x) = y assuming the linear problem Av = w can be solved uniquely for any w ∈ Y (that is, A is invertible, and |v|X ≤ C|w|Y , where C = ||A−1 ||) we compute successive approximations, starting from an arbitrary a ∈ X and solving the sequence of linear problems with given ‘right-hand side’: Ax1 = y − φ(a), Ax2 = y − φ(x1 ), Ax3 = y − φ(x2 ), · · · Convergence of (xn ) is established by setting this up as a fixed point problem, for F (x) = A−1 [y − φ(x)]: x1 = F (a), x2 = F (x1 ), · · · Convergence to a fixed point is guaranteed provided F is a contraction of X: |F (x) − F (x̄)| ≤ λ|x − x̄|, where 0 < λ < 1. Since |F (x) − F (x̄)| = |A−1 [φ(x) − φ(x̄)]|, it suffices to require Lip(φ) < 1/||A−1 ||. Then the estimate (for two solutions of f (x) = y, f (x̄) = ȳ, y, ȳ ∈ Y : |x − x̄| = |A−1 [y − ȳ] − A−1 [φ(x) − φ(x̄)]| ≤ |A−1 |(|y − ȳ| + Lip(φ)|x − x̄|), 1 (1 − λ)|x − x̄| ≤ |A−1 ||y − ȳ|, λ = |A−1 |Lip(φ) < 1 shows that the inverse map g(y) = x is Lipschitz, with constant Lip(g) ≤ |A−1 |/(1 − λ). We recall here the standard fixed point theorem for contractions. Contractions have unique fixed points. Let (X, d) be a complete metric space, F : X → X a λ-contraction, where 0 < λ < 1: d(F (x), F (y)) ≤ λd(x, y), ∀x, y ∈ X. Then F has a unique fixed point p ∈ X, which is globally attracting (F n (x) → p, ∀x ∈ X.) Proof. Let x ∈ X. For the sequence of iterates xn = F n (x), the contraction property easily implies d(xm+1 , xm ) ≤ λm d(x1 , x) := λm δ, and then if n > m ≥ N: d(xn , xm ) ≤ d(xn , xn−1 ) + . . . d(xm+1 , xm ) ≤ (λn−1 + . . . λm )δ ≤ λN δ, 1−λ so (xn ) is a Cauchy sequence, and xn → p ∈ X. Then p is a fixed point: d(F (p), p) ≤ d(F (p), F (xn )) + d(xn+1 , p) ≤ λd(xn , p) + d(xn+1 , p) → 0. It follows easily from the contraction property that F can only have one fixed point. Typically the ‘perturbation’ φ of the invertible linear map A will satisfy a Lipschitz condition only in some open set U ⊂ X. Then given a point b ∈ Y for which the problem has a unique solution a ∈ U (that is, f (a) = b) and a nearby point y ∈ Y , we seek solutions x ∈ B̄r (a), (closed ball), where r > 0 is small enough that this ball is contained in U . The successive approximations scheme still works, provided we guarantee this ball is invariant under F . So we estimate, for x ∈ B̄r (a) (using a = A−1 [b − φ(a)]): |F (x) − a| = |A−1 [y − φ(x)] − a| = A−1 [y − b] − A−1 [φ(x) − φ(a)]| ≤ |A−1 |(|y − b| + Lip(φ)|x − a|) ≤ |A−1 |(s + Lip(φ)r), and this is bounded above by r provided we pick y ∈ Bs (b) (open ball in Y ), where s ≤ (1−λ)r (with λ = |A−1 |Lip(φ) < 1 as before). We summarize |A−1 | as follows: 2 Proposition 1. Perturbations of invertible linear maps are homeomorphisms. Let f : U → Y (U ⊂ X open) have the form f (x) = Ax + φ(x), where A ∈ L(X, Y ) is boundedly invertible and φ is Lipschitz with Lip(φ) < ||A−1 ||−1 . Then V = f (U ) is open in Y , and f : U → V is a homeomorphism with Lipschitz inverse. If U = X, f is a homeomorphism onto Y (with Lipschitz inverse). Turning to the case of f : U → Rn , where U ⊂ Rn is an open set, suppose f is differentiable at a ∈ U , with df (a) ∈ GL(Rn ) (that is, df (a) is an invertible linear map.) Then we have, for a function r : U → Rn : f (x) = f (a)+df (a)[x−a]+r(x) = df (a)[x]+φ(x), φ(x) = f (a)−df (a)[a]+r(x), so φ is Lipschitz in U if and only if r is (with the same Lipschitz constant.) Recall the condition ‘r is Lipschitz with s small constant, in a sufficiently small ball with center a’ corresponds exactly to strong differentiability at a. This motivates the hypothesis in the statement of the Inverse Function Theorem. Inverse Function Theorem. Let f : U → Rn (U ⊂ Rn open) be strongly differentiable at a ∈ U , and assume df (a) ∈ GL(Rn ) (that is, df (a) is an isomorphism.) Then there exists a neighborhood V ⊂ U of a so that: (i) The restriction f|V is a bi-Lipschitz homeomorphism onto its image W = f (V ), and W is open in Rn . (ii) The inverse map g = f −1 : W → V is strongly differentiable at b = f (a); (iii) If f ∈ C 1 (U ; Rn ), then V can be chosen so that g = f −1 is differentiable in W ; (iv) If f ∈ C k (U ; Rn ) (with k ≥ 1) then g ∈ C k (W ; Rn ). Proof. (i): Since f is strongly differentiable at a, we may find an open ball V = Br (a) so that for x, x̄ in V : f (x) = f (a)+df (a)[x−a]+r(x), where |r(x)−r(x̄)| ≤ λ|x−x̄| with λ|df (a)−1 | < 1. In other words (since f (a) − df (a)[a] is a constant) in this ball V , f is a perturbation of the isomorphism df (a). Conclusion (i) follows from the proposition. Prior to proving (ii) and (iii), we consider a general result on differentiability of the inverse (of a homeomorphism). Note that the inverse of the 3 homeomorphism f (x) = x3 of the real line is not differentiable at x = 0. This can only happen since f 0 (0) = 0. Differentiability of the inverse. Let f : U → V be a homeomorphism, where U, V are open in Rn , with inverse g : V → U . If f is differentiable at a ∈ U and df (a) ∈ GL(Rn ), then g is differentiable at b = f (a), with dg(b) = [df (a)]−1 . If f is strongly differentiable at a, then g is strongly differentiable at b. Proof. Defining s(w) (for w ∈ Rn with |w| < dist(b, ∂V )) by: g(b + w) = g(b) + [df (a)]−1 [w] + s(w), we need to show limw→0 s(w)/|w| = 0. Let v = g(b+w)−g(b), and compute: df (a)[v]+r(v) = f (a+v)−f (a) = f (g(b)+g(b+w)−g(b))−b = b+w−b = w. Thus: s(w) = v − [df (a)]−1 [df (a)[v] + r(v)] = −[df (a)]−1 [r(v)], so: |s(w)| |r(v)| |v| ≤ |df (a)−1 | , |w| |v| |f (a + v) − f (a)| and the claim follows from limv→0 r(v)/|v| = 0, the fact that v → 0 iff w → 0 (since f and g are both continuous), and the fact (proved earlier) that |f (a + v) − f (a)|/|v| is bounded below, for |v| sufficiently small (since df (a) is an isomorphism.) Turning to strong differentiability, if we set v = g(b+w)−g(b) (as before) and u = g(b + z) − g(b), it follows as above that we have the relation, for the first-order Taylor remainders (of f at a and of g at b): s(w) − s(z) = −[df (a)]−1 [r(v) − r(u)], w − z = df (a)[v − u] + r(v) − r(u). Recall f is strongly differentiable at a iff the remainder r satisfies a Lipschitz condition with arbitrarily small constant, in a sufficiently small ball with center a. As the following estimates show, this implies s has the same property, in a sufficiently small ball with center b; thus g is strongly differentiable at b. |w−z| ≥ |df (a)||v−u|−|v−u| ⇒ |v−u| ≤ (|df (a)|−)−1 |w−z| := M |w−z|. |s(w)−s(z)| ≤ |df (a)−1 ||r(v)−r(u)| ≤ |df (a)−1 ||v−u| ≤ M |df (a)−1 ||w−z| 4 Statement (ii) in the IFT follows directly from this. Proof (iii) in the IFT: If f ∈ C 1 (U, Rn ), since df : U → L(Rn ) is continuous, df (a) ∈ GL(Rn ) and GL(Rn ) is open in L(Rn ), we have df (x) ∈ GL(Rn ) for x in a neighborhood V1 ⊂ V of a. Then the lemma on differentiability of the inverse guarantees g = f −1 is differentiable at each point of W = f (V1 ). Before proving (iv) we must take a short detour. Lemma 1. The inversion map f (X) = X −1 is smooth from GL(Rn ) to L(Rn ) ∼ M (n) (n × n matrices.) Proof. First, f is continuous at any X ∈ GL(Rn ), since it is differentiable, with df (X)[V ] = −X −1 V X −1 . We know the directional derivative is ∂V f (X) = −X −1 V X −1 . Given V ∈ M (n), let BV : M (n) × M (n) → M (n) be the bilinear map BV (X, Y ) = XV Y . For fixed V ∈ M (n), the directional derivative map ∂V f : GL(Rn ) → M (n) is the composition: ∂V f = −B◦(f, f ), where (f, f ) : GL(n) → M (n)×M (n) maps X 7→ (X −1 , X −1 ). Thus ∂V f is continuous in GL(Rn ) for all V ∈ M (n), so f is C 1 . Then (f, f ) is C 1 , and since BV (being bilinear) is smooth, it follows the composition ∂V f is C 1 (for each V ∈ M (n)), so f is C 2 . Repeating this argument (or by induction) we see that f is a C k map, for each k ≥ 1. Corollary 1 (of lemma 1). Let f : U → V be a homeomorphism between open subsets U, V of Rn , of class C k (k ≥ 1). If g = f −1 is differentiable in V , then g is of class C k . Proof. It follows from the Chain Rule that, if g is differentiable in V , we have df (x) ∈ GL(Rn ) for all x ∈ U and dg(y) = [df (g(y))]−1 . Thus dg : V → L(Rn ) may be written as the composition: V → U → GL(Rn ) → L(Rn ) dg = Inv ◦ df ◦ g, where Inv : GL(Rn ) → L(Rn ) is the inversion map (which, according to the Lemma, is smooth.) Thus if f is C k (or df is C k−1 ), it follows dg is C k−1 (or g is C k .) Clearly this Corollary implies part (iv) of the IFT. Corollary 2 (of the proof of the IFT). (Differentiable perturbation of the identity.) Let U ⊂ Rn be open and convex, φ : U → Rn of class C 1 , with ||dφ(x)|| ≤ λ < 1 for all x ∈ U . Then f : U → Rn given by f (x) = x + φ(x) 5 is a diffeomorphism from U onto its image f (U ), an open subset of Rn . If U = Rn , then f (U ) = Rn . If f is C k (k ≥ 1), then g = f −1 is C k . Proof. Since U is convex, it follows φ is a λ-contraction. Thus f is a homeomorphism onto an open set f (U ). Since df (x) = In + dφ(x) and ||dφ(x)|| < 1, we know df (x) is an isomorphism, for each x ∈ U . Thus g = f −1 is differentiable in f (U ). (By definition, this says f is a diffeomorphism.) Exercise 1. Generalize this to f (x) = Ax + φ(x), where A ∈ GL(Rn ). (That is, state a precise theorem and prove it.) Implicit Function Theorem. Consider f : U → Rn differentiable, where U ⊂ Rm is open and m > n: m = n + p. Suppose we have a point a ∈ U where df (a) is surjective. We would like to use the Inverse Function Theorem to say something about the level set of f through a: M = {x ∈ U ; f (x) = b}, where b = f (a) ∈ Rn . The Implicit Function Theorem says that, locally in a neighborhood of a, M coincides with the graph of a map h from an open subset of Rp to Rn . To make this precise, let K = Ker(df (a)) be the nullspace of df (a), a p-dimensional subspace of Rm , and let E ⊂ Rm be a complementary (ndimensional) subspace, so that Rm = K ⊕ E is a direct sum decomposition and the restriction df (a)|E is a linear isomorphism from E to Rn . We may regard Rm as a cartesian product: Rm = K × E, and write x = (x1 , x2 ) with respect to this product decomposition. Theorem. Suppose f is strongly differentiable at a ∈ U , with df (a) ∈ L(Rm , Rn ) surjective (and m > n). There exists an open neighborhood V ⊂ U of a = (a1 , a2 ) in Rm , an open neighborhood W1 of a1 in K and a differentiable map h : W1 → E so that h(a1 ) = a2 and M ∩ V coincides with the graph of h: M ∩ V := {x ∈ V ; f (x) = b} = {(x1 , h(x1 )); x1 ∈ W1 }. If f is a C k map, then so is h. If f is C 1 in V , the differential of h at a point x ∈ W1 is: dh(x) = −[d2 f (x, h(x))]−1 ◦ d1 f (x, h(x)) ∈ L(K; E), where d1 f (x, h(x)) ∈ L(K; Rn ); 6 d2 f (x, h(x)) ∈ Iso(E; Rn ). Proof. Consider the map f¯ : U → K × Rn : f¯(x) = (π(x), f (x)), f¯(a) = (a1 , b), where π : Rm = K × E → K is projection onto the factor K. Then f¯ is strongly differentiable at a, with derivative: df¯(a)[v] = (π[v], df (a)[v]) ∈ K × Rn , v ∈ K × E ∼ Rm . This differential is an isomorphism, since df¯(a)[v] = 0 implies π[v] = 0, so v = (0, v2 ) ∈ {0} × E, hence df (a)[v] = 0 implies v = 0 (since the restriction of df (a) to {0} × E is an isomorphism to Rn .) Thus the Inverse Function Theorem implies the existence of neighborhoods V of a in Rm , W = W1 × W2 of (a1 , b) in K × Rn , so that f¯ : V → W is a diffeomorphism, with inverse g : W → V of class C k if f is C k . For y1 ∈ W1 , z ∈ W2 , g(y1 , z) = (g1 (y1 , z), g2 (y1 , z)) is mapped by f¯ to (g1 (y1 , z), f (g(y1 , z)), and since this must equal (y1 , z), it follows that g1 (y1 , z) = y1 , for all y1 ∈ W1 . Define: h : W1 → E, h(y1 ) = g2 (y1 , b). Clearly h(a1 ) = g2 (a1 , b) = a2 , since g(a1 , b) = (a1 , a2 ) (given f¯(a1 , a2 ) = (a1 , f (a2 )) = (a1 , b).) Since g ∈ C k (W, K × E), we have h ∈ C k (W1 , E). And we see that, for x = (x1 , x2 ) ∈ V ⊂ K × E: f (x) = b ⇔ f¯(x) = (x1 , b) ⇔ g(x1 , b) = x ⇔ g2 (x1 , b) = x2 ⇔ h(x1 ) = x2 . To compute the differential of h at x1 ∈ W1 , note that, with G : W1 → Rm , G(x1 ) = (x1 , h(x1 )): 0 = d(f ◦ G)(x1 ) = d1 f (x1 , h(x1 )) + d2 f (x1 , h(x1 )) ◦ dh(x1 ) ∈ L(K, Rn ), where d1 f (x) ∈ L(K, Rn ), d2 f (x) ∈ L(E, Rn ) are partial derivatives (with respect to the splitting Rm = K × E), with the second one an isomorphism for x ∈ V (taking a smaller V if needed), since d2 f (a) ∈ Iso(E, Rn ) and f is C 1 ; and dh(x1 ) ∈ L(K, E). Solving for dh(x1 ), we find: dh(x1 ) = −[d2 f (x1 , h(x1 ))]−1 ◦ d1 f (x1 , h(x1 )) ∈ L(K, E). Remark. In particular dh(a1 ) = 0, since d1 f (a) = df (a)|K×{0} = 0. Definition. For f : U → Rn of class C 1 (U ⊂ Rm open, m ≥ n), a point b ∈ Rn is a regular value of f if df (x) ∈ L(Rm , Rn ) is surjective, for each x in the level set Mb = {x ∈ U ; f (x) = b}. 7 Submersions are open maps. A C 1 map f : U → Rn (U ⊂ Rm open, m > n is a submersion if df (x) ∈ L(Rm , Rn ) is surjective, for each x ∈ U . Recall a map between topological spaces is open if it maps open subsets of X to open subsets of Y . Homeomorphisms are clearly open maps, and the composition of open maps is open. Proposition 2. (i) If f is strongly differentiable at a ∈ U , and df (a) is surjective, then there exists a neighborhood V of a so that the restriction f|V : V → Rn is an open map. (ii) If f is a C 1 submersion from U to Rn , then f is an open map. Proof. (i) In the proof of the Implicit function Theorem, we found a neighborhood V of a so that the map f¯(x) = (π(x), f (x)) ∈ K × Rn (where Rm = K × E is defined by choosing a complement E to the kernel K of df (a)) is a diffeomorphism from V to a neighborhood W1 × W2 of (a1 , b) in K × Rn (b = f (a), a = (a1 , a2 ) with a1 ∈ K, a2 ∈ E.) On V , f = π2 ◦ f¯, where π2 : W1 × W2 → W2 is projection onto the second factor W2 ⊂ Rn . Since π2 and f¯ are open, this shows f|V is open. (ii) From part (i), for each point x ∈ U there is a neighborhood Vx ⊂ U so that f|Vx is an open map. If A ⊂ U is open in U , we have A = ∪x∈U Vx ∩A, so f (A) = ∪x∈U f (Vx ∩ A), where Wx = f (Vx ∩ A) is open in Rn . Hence f (A) = ∪x∈U Wx is open in Rn . Remark. In particular, under the conditions of part(i), the point f (a) is in the interior of f (V ). Images of C k parametrizations. A C k map (k ≥ 1) f : U → Rn (U ⊂ Rn open, n > m) is an immersion if df (x) ∈ L(Rm , Rn ) is injective, for each x ∈ U . f is a C k parametrization of its image M = f (U ) ⊂ Rn if it is an injective immersion and defines a homeomorphism from U to M , where M ⊂ Rn has the induced topology. Proposition 3. Any image M ⊂ Rn of a C k parametrization φ : W0 → M = f (W0 ) (W0 ⊂ Rm open, m < n) is locally a graph. (That is, each point p ∈ M admits a neighborhood V ⊂ M in which M admits a parametrization as the graph of a C k map.) Proof. Let p ∈ M , p = φ(x0 ), x0 ∈ W0 . Consider the subspace Tp = dφ(x0 )[Rm ], and a complementary subspace Np ∈ Rm , which defines a direct sum decomposition Rm = Tp ⊕ Np ; let π : Rm → Tp be the associated 8 projection. (Note dim(Tp ) = m.)The composition π ◦ φ : W0 → Tp has differential at x0 given by: d(π ◦ φ)(x0 ) = dπ(p) ◦ dφ(x0 ). This is a linear isomorphism from Rm to Tp , since π is the identity on Tp and dφ(x0 ) is an isomorphism from Rm to Tp (φ is an immersion.) By the Inverse Function Theorem, we may find neighborhoods V0 ⊂ W0 of x0 and Dp ⊂ Tp of π(p) so that π ◦ φ is a diffeomorphism F : V0 → Dp (of class C k since φ is C k .) Let U = φ(V0 ) ⊂ M . This is an open neighborhood of p in M , since it coincides with π −1 (Dp ) ∩ M . Consider g : Dp → U given by g = φ|V0 ◦ F −1 . Clearly g is a homeomorphism (and of class C k , as the composition of C k maps). And π ◦ g = idDp shows that, with respect to the splitting Rm = Tp ⊕ Np , g has the form g(x) = (x, h(x)), for a C k map h : Dp → Np . Thus g is a C k graph parametrization of the open set U ⊂ M . Exercise 2. Let h : U → Rn be a C k map (k ≥ 1), U ⊂ Rm open. The graph of h is the subset of Rm × Rn : G = {(x, y) ∈ U × Rn ; y = h(x)} (i) Show that φ : U → Rm × Rn , φ(x) = (x, h(x)) is a C k parametrization of G. Hint: to show φ−1 is continuous on G, show that it is Lipschitz (by estimating from below |φ(x) − φ(y)|), or that it is the restriction to G of the projection from U × Rn to U . (ii) Show there exists F : U × Rn → Rn of class C k , with surjective differential at each point, so that G = F −1 (0) (the level set of 0 ∈ Rn .) m-dimensional surfaces in Rn . Definition. A non-empty subset M ⊂ Rn is an m-dimensional surface of class C k (also known as ‘m-dimensional submanifold of Rn , of class C k ’) if each p ∈ M admits a neighborhood U ⊂ M (in the topology induced from E) which is the image φ(U0 ) of a C k parametrization φ : U0 → E (U0 ⊂ Rm open.) Remark. It can be shown that there is no loss of generality in taking U0 connected in this definition; in fact we could assume U0 is homeomorphic to the open ball in Rm . 9 It follows from the work in the previous section that this definition is equivalent to either of the following two: (i) Each p ∈ M admits a neighborhood U ⊂ M (of the form U = W ∩ M, W ⊂ E open) so that for some C k submersion F : W → Rp (where p = dim(E) − m) we have U = F −1 (0). (ii) For each p ∈ M we may find a direct sum decomposition E = Tp ⊕Np , U1 ⊂ Tp open, a C k map h : U1 → Np and a neighborhood W of p in E so that U = M ∩ W = {(x, h(x)) : x ∈ U1 }. Colloquially, we speak of ‘M being given by local parametrizations’, ‘M being locally a regular level set’ or ‘M being locally a graph’. Remark. In the global sense, the three definitions describe different classes of subsets of Rn . For instance, no compact surface in Rn can be covered by a single parametrization (in particular, no compact surface can be globally a graph). There are also 2-surfaces in R3 (and compact 2-surfaces in R4 ) which are not globally level sets for a regular value (although this is always true for a simply-connected 2-surface in R3 .) The Implicit Function Theorem can be stated as saying that, if b ∈ Rn is a regular value for f : U → Rn (U ⊂ Rm open, m > n), then Mb is locally a graph, hence is a surface in Rn (of dimension m − n). Example 1. Consider the subset of R2 : 1 M = {(x, sin ); x > 0} ∪ {(x, y); x = 0, y ∈ (−1, 1)}. x This is not a one-dimensional submanifold of R2 , since it can be shown any submanifold is locally connected, and this set isn’t: any point on the vertical line segment has a neighborhood in M (in the induced topology) with infinitely many connected components. If we remove the vertical line segment, the resulting set is a submanifold of R2 (so we see that the disjoint union of two submanifolds of the same dimension is not always a submanifold of Rn .) Example 2. Consider the subset of R2 made up of a circle and one of its tangent lines: M = {(x, y) ∈ R2 ; y = 0 or x2 + (y − 1)2 = 1}. This is not a submanifold of R2 . Locally near any point, a submanifold must be expressible as a graph, with respect to some direct-sum splitting of Rn . 10 Thus it cannot have any “branching points” (as the origin is in this case). If we remove the origin, the resulting set is a submanifold. Tangent spaces. Let M ⊂ Rn be a C k m-surface (k ≥ 1, 1 ≤ m < n.) The tangent space to M at p is the set of all velocity vectors at p of C 1 curves contained in M . Precisely: Tp M = {v ∈ Rn ; v = α0 (0), where α : I → Rn , α(t) ∈ M ∀t, α(0) = p}. (Here I ⊂ R is any fixed open interval containing 0 and α is assumed to be C 1 .) Indeterminacy. Clearly many curves α yield the same v. For instance, let φ : I → I be a strictly increasing differentiable diffeomorphism of I fixing 0: t = φ(s), φ0 (s) > 0, φ(0) = 0. Then if φ0 (0) = 1, we have (α ◦ φ)0 (0) = α0 (0). A more serious problem is that it is not at all clear that this is a vector space! (At least it is easy to show it is a cone in Rn : v ∈ Tp M, λ ∈ R ⇒ λv ∈ Tp M.) We need the following proposition: Proposition 4. Let M be a C 1 m-surface in Rn , let p ∈ M . (i) If M is given near p by a parametrization φ : U0 → Rn , φ(x0 ) = p, φ(U0 ) = U ⊂ M , then: Tp M = dφ(x0 )[Rm ] = {v ∈ Rn ; v = dφ(x0 )[w] for some w ∈ Rm .} (This is clearly a subspace of Rn , of dimension m). (ii) If M is given near p by a graph parametrization (there is a neighborhood W of p in Rn = E ⊕ F and a map h : V → F , V ⊂ E open) so that M ∩ W = {(x, h(x)); x ∈ V } and p = (x0 , h(x0 )), x0 ∈ V , then: Tp M = {v ⊕ dh(x0 )[v], v ∈ E.} (iii) If M is given near p as the level set M ∩ W = {x ∈ W ; F (x) = b}, for some C 1 submersion F : W → Rn−m ,W a neighborhood of p in Rn , b = F (p), then: Tp M = Ker(dF (p)), a subspace of Rn . Remark: indeterminacy. The space in (i) might depend on φ (replacing φ by φ ◦ ψ, where ψ is a diffeomorphism of V0 fixing x0 gives another parametrization with domain V0 ). Similarly changing the splitting of Rn used in (ii) changes the map h, so we might get a different subspace. And the submersion F in (iii) is not uniquely defined, either. 11 The fact that these subspaces of Rn coincide is a consequence of the proposition, which shows they all coincide with the set given in the geometric definition (as the set of velocity vectors of curves through p.) But the following can be shown directly. If φ : U0 → U ⊂ M is a local parametrization of a neighborhood of p ∈ M with φ(x0 ) = p and F : U0 → U0 is a diffeomorphism of U0 fixing x0 , then (evidently) ψ = φ ◦ F is again a parametrization of U , with ψ(x0 ) = p. Then dψ(x0 )(Rm ) = dφ(x0 )(Rm ), since dF (x0 ) is an isomorphism of Rm . Exercise 3. (i) Show that if φ : U0 → U ⊂ M, ψ : U0 → U ⊂ M are parametrizations (U0 ⊂ Rm open) satisfying φ(x0 ) = ψ(x0 ) = p ∈ M , then dφ(x0 )(Rm ) = dψ(x0 )(Rm ). Hint: use the proof of proposition 5 at the end of this section to find a diffeomorphism F of U0 fixing x0 so that ψ = φ ◦ F , then the observation in the previous paragraph. (ii) Show that if p ∈ Rn is a regular value of f : U → Rn (U ⊂ Rm open, m > n) and φ : Rn → Rn is a diffeomorphism fixing p (φ(p) = p), then p is a regular value for g = φ ◦ f , the level sets {x|f (x) = p} and {x|g(x) = p} are equal (call them Mp ) and Ker(df (x)) = Ker(dg(x)) (as subspaces of Rm ), for each x ∈ Mp . Proof (of proposition 4). That the vector spaces in (i) and (ii) are contained in the set of velocity vectors of curves on M through p is clear. To prove that any velocity vector of a curve on M through p is in the vector space in (i) we need proposition 5 below. This shows that if α : I → Rn is a C 1 curve with image in U ⊂ M , then γ = φ−1 ◦ α : I → U0 is a C 1 curve. Let v0 = γ 0 (0) ∈ Rm . Then α0 (0) = dφ(x0 )[v0 ], as we wanted to show. Note that (ii) is a special case of (i). Having verified (i), we know the set of velocity vectors of curves on M through p is a vector space of dimension m. Under the hypothesis of part (iii), any such velocity vector is clearly contained in Ker(dF (p)); since both are m-dimensional vector spaces, they must coincide. Proposition 5. Let M ⊂ Rn be an m-dimensional surface of class C k , f : I ⊂ Rn be a C k map (I ⊂ Rp open). If f (I) ⊂ U , where U ⊂ M is the image of a C k parametrization φ : U0 → U (U0 ⊂ Rm open), then the composition φ−1 ◦ f : I → Rm is of class C k . Proof. It is enough to show φ−1 ◦ f is of class C k in a neighborhood of an arbitrary point t0 ∈ I. This would follow easily if we could show that, in some neighborhood V = W ∩ M ⊂ U of p = f (t0 ) = φ(x0 ), the inverse 12 φ−1 : V → U0 extends to a C k map g : W → Rm . To see this, consider the splitting Rn = E ⊕ F , where E = dφ(x0 )(Rm ) and F is any complementary subspace of Rn . Let {v1 , . . . , vn−m } be a basis of F , and extend φ to Φ : U × Rn−m → Rn , as follows: Φ(x, y) = φ(x) + n−m X yi v i , (x, y) ∈ U × Rn−m . i=1 The differential of Φ at (x0 , 0) ∈ U × Rn−m is: X dΦ(x0 , 0)[(v, w)] = dφ(x0 )[v] + wi vi ∈ E ⊕ F, (v, w) ∈ Rm × Rn−m . It is clear that this is an isomorphism of Rn . By the Inverse Function Theorem, there are neighborhoods V0 ⊂ U0 of x0 in Rm , W0 of 0 in Rn−m and W of p in Rn so that Φ is a C k diffeomorphism from V0 × W0 to W . Let g = Φ−1 : W → V0 × W0 . Since Φ(x, 0) = φ(x) ∈ W ∩ M := V ⊂ U for x ∈ V0 , we see that the restriction of g to V coincides with (φ−1 )|V . Observing that in the neighborhood f −1 (V ) ⊂ I of t0 we have φ−1 ◦ f = g ◦ f and that the latter composition is C k concludes the proof of the proposition. Problems from [Fleming]. 1. (7, p.147). Let g : U → Rn be a C 1 map (U ⊂ Rn open), with dg(x) ∈ GL(Rn ) for all x ∈ U . Suppose y0 6∈ g(U ), and let ψ(x) = |g(x)−y0 |2 . Show that dψ(x) 6= 0, for any x ∈ U . (This means: for any x ∈ U , we may find v ∈ Rn so that dψ(x)[v] 6= 0). Hint: Use fact that if A ∈ GL(Rn ), we have AT [w] 6= 0 if w 6= 0, so v = AT w satisfies hw, A[v]i = 6 0. Here the superscript T denotes ‘transpose’. 2. (8, p.147). Let g : Rn → Rn be a C 1 map. Suppose there exists c > 0 so that |g(x) − g(y)| ≥ c|x − y|, for all x, y ∈ Rn . Show that: (i) g is injective; (ii) dg(x) ∈ GL(Rn ) for all x ∈ Rn ; (iii) g(Rn ) = Rn . Hint: For (ii), assume dg(x)[v] = 0 for some 0 6= v ∈ Rn , and use the definition of differential to contradict the condition given. For (iii): By contradiction, if y0 6∈ g(Rn ), consider the function ψ(x) = |g(x) − y0 |2 . By the previous problem, this function has no critical points. On the other hand, ψ attains its infimum (call it M ) over Rn : let (xn ) be 13 a sequence in Rn so that ψ(xn ) = |g(xn ) − y0 |2 → M . Then (g(xn )) is a bounded sequence. Show this implies (xn ) is bounded, and therefore has a convergent subsequence. Conclude. 3. (11. p. 160). Let C = (cij ) ∈ GL(Rn ) be a symmetric matrix, and consider M = {x ∈ Rn ; hx, Cxi = 1} (for the usual inner product in Rn .) (i) Show that M is a compact submanifold of Rn , of dimension n − 1. Hint: M = {x ∈ Rn |F (x) = 1} for a submersion F : Rn → R. Don’t forget to show M is compact! (ii) Show that the equation of the tangent space Tx0 M , x0 ∈ M , is: Tx0 M = {x ∈ Rn ; hx, Cx0 i = 0}. 14