* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Scalar-valued Functions of a Vector
Jordan normal form wikipedia , lookup
Exterior algebra wikipedia , lookup
Matrix multiplication wikipedia , lookup
Cayley–Hamilton theorem wikipedia , lookup
Eigenvalues and eigenvectors wikipedia , lookup
Euclidean vector wikipedia , lookup
Laplace–Runge–Lenz vector wikipedia , lookup
System of linear equations wikipedia , lookup
Covariance and contravariance of vectors wikipedia , lookup
Vector space wikipedia , lookup
Four-vector wikipedia , lookup
Vector field wikipedia , lookup
Neither Div nor Curl nor Both Constitute the Derivative Charles S. Barnett Adjunct Mathematics Instructor Las Positas College Presented at the 41st annual fall conference California Mathematics Council, Community Colleges Monterey, California December 13 and 14, 2013 1 2 Abstract The Div and Curl operators are derivative-like but are only aspects of the derivative of a map from 3-space to 3-space. The derivative of such a given map is a linear map from 3-space to 3-space. By employing the Frechet difference quotient at the outset, we can construct that derivative without use of the heavy machinery of advanced analysis. The process, results, and properties parallel those of the one-dimensional case. Attribution 1. H. K. Nickerson, D. C. Spencer, and N. E. Steenrod, Advanced Calculus, Dover Publications, 2011. Unabridged republication of the work originally published in 1959 by the D. Van Nostrand Company. 2. Spivak, Michael, Calculus on Manifolds: A Modern Approach to Classical Theorems of Advanced Calculus,” W. A. Benjamin, Inc., New York, Amsterdam, 1965. 3 3. H. M. Schey, Div, Grad, Curl, and All That: An Informal Text on Vector Calculus, W.W. Norton & Company, Inc., New York, 1973. 4 Outline Alternate definition of the derivative of a real-valued function of a real variable (the 𝑓: ℝ → ℝ case) The Frechet derivative of a scalar-valued function of a vector (the 𝑓: ℝ3 → ℝ case) The Frechet derivative of a vector-valued function of a vector (the 𝐹: ℝ3 → ℝ3 case) The divergence and curl of a vector field An application: the divergence and curl of a steady flow Generalizations and the chain rule 5 Scalar-valued Functions of a Scalar (1) Situation: ℝ represents the real numbers. 𝑓: ℝ → ℝ . Goal: Define the derivative of 𝑓 at 𝑥, 𝑓 ′ (𝑥 ) . For (𝑥, 𝑦) ∈ ℝ2 consider the following limit: 1 [𝑓 (𝑥 + ℎ𝑦) − 𝑓 (𝑥 )] ℎ⟶0 ℎ 𝒻 ′ (𝑥, 𝑦) = lim If this limit exists, then 𝒻 ′ (𝑥, 𝑦) is called “the derivative of 𝑓 at 𝑥 with respect to 𝑦.” 6 Scalar-valued Functions of a Scalar (2) Claim: 𝒻 ′ (𝑥, 𝑦) is linear in the variable 𝑦. Pick and fix 𝒶 in ℝ and consider 𝒻 ′ (𝑥, 𝒶𝑦). Show first that 𝒻 ′ (𝑥, 𝒶𝑦) = 𝒶𝒻 ′ (𝑥, 𝑦). Well, 1 ) 𝒻 𝑥, 𝒶𝑦 = lim [𝑓 (𝑥 + ℎ𝑎𝑦) − 𝑓 (𝑥 )] ℎ→0 ℎ ′( 𝑎 [𝑓(𝑥 + ℎ𝑎𝑦) − 𝑓 (𝑥 )] ℎ→0 ℎ𝑎 = lim 7 1 = 𝑎 lim [𝑓(𝑥 + 𝑡𝑦) − 𝑓 (𝑥 )] 𝑡→0 𝑡 (𝑡 = ℎ𝑎) = 𝑎𝒻 ′ (𝑥, 𝑦) . Therefore, 𝒻 ′ (𝑥, 𝑎𝑦) = 𝑎𝒻 ′ (𝑥, 𝑦) Scalar-valued Functions of a Scalar (3) Now let 𝑦 and 𝑧 denote two non-zero elements of ℝ and consider 𝒻 ′ (𝑥, 𝑦 + 𝑧). 1 𝒻 ′ (𝑥, 𝑦 + 𝑧) = lim [𝑓(𝑥 + ℎ𝑦 + ℎ𝑧) − 𝑓 (𝑥 )] ℎ→0 ℎ 1 = lim [𝑓(𝑥 + ℎ𝑦 + ℎ𝑘𝑦) − 𝑓 (𝑥 )] ℎ→0 ℎ for some 𝑘 ∈ ℝ 1 = lim [𝑓(𝑥 + ℎ(1 + 𝑘)𝑦) − 𝑓(𝑥 )] ℎ→0 ℎ 8 = (1 + 𝑘)𝒻 ′ (𝑥, 𝑦) = 𝒻 ′ (𝑥, 𝑦) + 𝑘𝒻 ′ (𝑥, 𝑦) = 𝒻 ′ (𝑥, 𝑦) + 𝒻 ′ (𝑥, 𝑘𝑦) = 𝒻 ′ (𝑥, 𝑦) + 𝒻 ′ (𝑥, 𝑧). So, 𝒻 ′ (𝑥, 𝑦) is linear in the second variable. Scalar-valued Functions of a Scalar (4) Definition: The derivative of 𝑓 at 𝑥, denoted by 𝑓 ′ (𝑥 ), is the linear map from ℝ → ℝ defined by (𝑓 ′ (𝑥 ))(𝑦) = 𝒻 ′ (𝑥, 𝑦). Examples (old chestnuts in new shells): 9 1. 𝑓 is a constant function, 𝑓 (𝑥 ) = 𝑘, where 𝑘 is a constant. Then (𝑓 ′ (𝑥 ))(𝑦) = 0, i.e. 𝑓 ′ (𝑥 ), sends all 𝑦 to zero for every 𝑥 ∈ ℝ. Argument: 𝑓(𝑥 + ℎ𝑦) − 𝑓 (𝑥 ) = 𝑘 − 𝑘 = 0. 2. 𝑓 is a linear map, i.e., 𝑓 (𝑥 ) = 𝑎𝑥, where 𝑎 ∈ ℝ is an arbitrary but fixed number. Then 𝑓 ′ (𝑥 ) is a constant map, i.e., (𝑓 ′ (𝑥 ))(𝑦) = 𝑎𝑦 for every 𝑥 ∈ ℝ, 𝑦 ∈ ℝ. Argument: 𝑓(𝑥 + ℎ𝑦) − 𝑓 (𝑥 ) = 𝑎(𝑥 + ℎ𝑦) − 𝑎𝑥 = ℎ𝑎𝑦. Scalar-valued Functions of a Scalar (5) Examples (old chestnuts in new shells), cont’d: 3. 𝑓 (𝑥 ) = 𝑥 2 implies that 𝑓 ′ (𝑥 ) = (2𝑥 ), i.e., 𝑓 ′ (𝑥 )(𝑦) = (2𝑥 )𝑦 all (𝑥, 𝑦) ∈ ℝ2 10 Argument: 𝑓(𝑥 + ℎ𝑦) − 𝑓 (𝑥 ) = (𝑥 + ℎ𝑦)2 − 𝑥 2 = 𝑥 2 + 2ℎ𝑥𝑦 + ℎ2 𝑦 2 − 𝑥 2 = 2ℎ𝑥𝑦 + ℎ2 𝑦 2 Now, divide by ℎ and pass to the limit to see that 𝒻 ′ (𝑥, 𝑦) = (2𝑥 )𝑦 ⟹ 𝑓 ′ (𝑥 ) = 2𝑥. Scalar-valued Functions of a Scalar (6) Best affine approximation to 𝒇 Pick and fix 𝑥0 ∈ ℝ. Then the mean value theorem says that 11 𝑓 (𝑥 ) = 𝑓(𝑥0 ) + 𝑓 ′ (𝑥0 )(𝑥 − 𝑥0 ) + 𝑒𝑟𝑟𝑜𝑟 . You will see direct parallels to this equation as we work our way up to functions from ℝ3 ⟶ ℝ3 . Scalar-valued Functions of a Vector (1) Situation: 𝑓: ℝ3 → ℝ 12 Goal: Define the derivative of 𝑓 at 𝑋, 𝑓 ′ (𝑋). For (𝑋, 𝑌) ∈ ℝ3 × ℝ3 , consider the following limit that defines function 𝒻 ′ : ℝ3 × ℝ3 → ℝ. ′( 1 [𝑓(𝑋 + ℎ𝑌) − 𝑓(𝑋)] ℎ⟶0 ℎ 𝒻 𝑋, 𝑌) = lim If this limit exists, then 𝒻 ′ (𝑋, 𝑌) is called “the derivative of 𝑓 at 𝑋 with respect to 𝑌”. Scalar-valued Functions of a Vector (2) 13 Claim: 𝒻 ′ (𝑋, 𝑌) is linear in the variable 𝑌. First, show that 𝒻 ′ (𝑋, 𝑎𝑌) = 𝑎𝒻 ′ (𝑋, 𝑌) for 𝑎 ∈ ℝ. Pick and fix 𝑎 in ℝ. If 𝑎 = 0, the difference quotient is 0. If 𝑎 ≠ 0, then 1 𝒻 𝑋, 𝑎𝑌) = lim [𝑓(𝑋 + ℎ𝑎𝑌) − 𝑓 (𝑋)] ℎ→0 ℎ 𝑎 [𝑓(𝑋 + ℎ𝑎𝑌) − 𝑓 (𝑋)] = lim ℎ→0 ℎ𝑎 ′( 1 = 𝑎 lim [𝑓(𝑋 + 𝑡𝑌) − 𝑓 (𝑋)] 𝑡→0 𝑡 = 𝑎𝒻 ′ (𝑋, 𝑌) 14 Scalar-valued Functions of a Vector (3) Now, consider 𝒻 ′ (𝑋, 𝑌 + 𝑍). We must show that 𝒻 ′ (𝑋, 𝑌 + 𝑍) = 𝒻 ′ (𝑋, 𝑌) + 𝒻 ′ (𝑋, 𝑍), where 1 𝒻 ′ (𝑋, 𝑌 + 𝑍) = lim ℎ [𝑓(𝑋 + ℎ𝑌 + ℎ𝑍) − 𝑓(𝑋)]. ℎ→0 There exists a version of the mean value theorem that we will need: If 𝒻 ′ (𝑋 + 𝑡𝑌, 𝑌) exists for 0 ≤ 𝑡 ≤ 1, then there exists 𝜃, 0 < 𝜃 < 1 such that 𝑓(𝑋 + 𝑌) − 𝑓 (𝑋) = 𝒻 ′ (𝑋 + 𝜃𝑌, 𝑌) (∗) This version can be established fairly easily from the standard (ℝ → ℝ) version. In the proof of linearity for the (ℝ → ℝ) case I used the fact that for any two nonzero real numbers 𝑥, 𝑦 there exists a third, 𝑘, say, such that 𝑥 + 𝑦 = 𝑥 + 𝑘𝑥. The above (ℝ3 → ℝ) version (∗) of the mean value theorem is used similarly to complete the proof of linearity for the present case. The details, somewhat involved, appear on page 5 of the handout. 15 So, 𝒻 ′ (𝑋, 𝑌) is linear in 𝑌 for every 𝑋. Scalar-valued Functions of a Vector (4) Definition: The derivative 𝑓 ′ (𝑋) of 𝑓 at 𝑋 is the linear map 𝑓 ′ (𝑋): ℝ3 → ℝ defined by (𝑓 ′ (𝑋))(𝑌) = 𝒻 ′ (𝑋, 𝑌). 16 Scalar-valued Functions of a Vector (5) The Gradient Let 𝑌 ∈ ℝ3 Expand 𝑌 in the standard basis. Invoke the fact that 𝒻 ′ (𝑋, 𝑌) is linear in 𝑌 and expand 𝒻 ′ (𝑋, 𝑌) via 𝑌’s components 𝑦1 , 𝑦2 , 𝑦3 . Arrive at 3 ′( 𝒻 𝑋, 𝑌) = ∑ 𝑦𝑖 𝑖=1 𝜕𝑓 . 𝜕𝑥𝑖 A theorem from linear algebra says that if 𝑇: ℝ3 → ℝ is linear, then there exists a unique vector 𝐵 such that 𝑇(𝑌) = 𝐵 ∙ 𝑌. 17 So 𝒻 ′ (𝑋, 𝑌) = 𝐵 ∙ 𝑌, which implies that 𝐵 = ∇𝑓 (𝑋). Hence, 𝒻 ′ (𝑋, 𝑌) = ∇𝑓(𝑋 ) ∙ 𝑌. Scalar-valued Functions of a Vector (6) Observe that knowing ∇𝑓 (𝑋) is equivalent to knowing 𝑓 ′ (𝑋), but ∇𝑓 (𝑋) ∈ ℝ3 and 𝑓 ′ (𝑋): ℝ3 → ℝ. Best affine approximation to 𝒇 Pick and fix 𝑋0 ∈ ℝ3 . Then 𝑓(𝑋) = 𝑓(𝑋0 ) + (𝑓 ′ (𝑋0 ))(𝑋 − 𝑋0 ) + 𝑒𝑟𝑟𝑜𝑟. Or, in terms of the gradient, 𝑓(𝑋) = 𝑓(𝑋0 ) + ∇𝑓(𝑋0 ) ∙ (𝑋 − 𝑋0 ) + 𝑒𝑟𝑟𝑜𝑟. 18 Vector-valued Functions of a Vector (1) Situation: 𝐹: ℝ3 → ℝ3 For (𝑋, 𝑌) ∈ ℝ3 × ℝ3 , consider the following limit that defines ℱ ′ (𝑋, 𝑌): 1 ℱ ′ (𝑋, 𝑌) = lim [𝐹 (𝑋 + ℎ𝑌) − 𝐹 (𝑋)] ℎ→0 ℎ if the limit exists. Examples: 1. 𝐹: ℝ3 → ℝ3 is a constant function. 𝐹 (𝑋) = 𝐵 ∈ ℝ3 for every 𝑋. 𝐵 is a fixed vector. 19 Then 𝐹 (𝑋 + ℎ𝑌) − 𝐹 (𝑋) = 𝐵 − 𝐵 = 0 ∈ ℝ3 . So, ℱ ′ (𝑋, 𝑌) = 0 ∈ ℝ3 . 20 Vector-valued Functions of a Vector (2) 2. 𝐹: ℝ3 → ℝ3 is a linear map. 𝐹 (𝑋) = 𝑇(𝑋), where 𝑇 is linear. Then 𝐹 (𝑋 + ℎ𝑌) − 𝐹 (𝑋) = 𝑇(𝑋 + ℎ𝑌) − 𝑇(𝑋). = 𝑇(𝑋) + ℎ𝑇(𝑌) − 𝑇(𝑋) = ℎ𝑇(𝑌). Now, divide by ℎ and pass to the limit to see that ℱ ′ (𝑋, 𝑌) = 𝑇(𝑌) for all (𝑋, 𝑌) ∈ ℝ3 × ℝ3 21 Vector-valued Functions of a Vector (3) Claim: ℱ ′ (𝑋, 𝑌) is linear in the variable 𝑌. Let {𝐴1 , 𝐴2 , 𝐴3 } represent the usual basis in ℝ3 . Then 𝐹 (𝑋) = ∑3𝑖=1 𝑓𝑖 (𝑋)𝐴𝑖 , where the 𝑓𝑖 represent the component functions. Let 𝒻𝑖′ (𝑋, 𝑌) represent the derivative with respect to 𝑌 of the component functions. Earlier I sketched an argument that showed that the 𝒻𝑖′ are linear in the variable 𝑌. It can be shown that ℱ ′ (𝑋, 𝑌) inherits linearity in 𝑌 from the 𝒻𝑖′ . So, 1 ℱ 𝑋, 𝑌) = lim [𝐹 (𝑋 + ℎ𝑌) − 𝐹 (𝑋)] ℎ→0 ℎ ′( is linear in the variable 𝑌. Definition: The derivative 𝐹 ′ (𝑋) of 𝐹 at 𝑋 is the linear transformation 𝐹 ′ (𝑋): ℝ3 → ℝ3 defined by 22 (𝐹 ′ (𝑋))(𝑌) = ℱ ′ (𝑋, 𝑌) . Vector-valued Functions of a Vector (4) Examples (restatement in terms of 𝐹 ′ rather than ℱ ′ ): 1. The derivative of a constant function is the zero vector. So 𝐹 ′ (𝑋) transforms all of ℝ3 into the zero vector. 2. The derivative of a linear function is a constant. 𝐹 (𝑋) = 𝑇(𝑋), 𝑇 linear, implies that 𝐹 ′ (𝑋) = 𝑇 for every 𝑋. Now we can get back to more familiar, classical ground. 𝐹 ′ (𝑋) is a linear map. Given a basis, a linear map can be represented by a matrix relative to that basis. So, as above, let {𝐴1 , 𝐴2 , 𝐴3 } represent the standard basis. Then 𝑋 = ∑3𝑖=1 𝑋𝑖 𝐴𝑖 , 𝑌 = ∑3𝑖=1 𝑦𝑖 𝐴𝑖 , 𝐹 (𝑋) = ∑3𝑖=1 𝑓𝑖 (𝑋)𝐴𝑖 , where the scalar-valued 𝑓𝑖 are the component functions of 𝐹. 23 Then 𝜕𝑓 (𝐹 ′ (𝑋))(𝑌) = ∑3𝑖=1 ∑3𝑗=1 𝑦𝑗 𝜕𝑥 𝑖 𝐴𝑖 . (∗) 𝑗 Vector-valued Functions of a Vector (5) I have here invoked both the fact (shown earlier) that, for functions 𝑓: ℝ3 → ℝ, 𝜕𝑓 ′ 𝒻 ′ (𝑋, 𝑌) = ∑3𝑖=1 𝑦𝑖 , and the linearity of 𝐹 (𝑋 ) to arrive at Eqn.(∗). 𝜕𝑥𝑖 Observe from Eqn. (∗) that, relative to the standard basis, (𝐹 ′ (𝑋))(𝑌) is represented by 𝜕𝑓1 𝜕𝑥1 𝜕𝑓2 [𝐹 ′ (𝑋)][𝑌] = 𝜕𝑥1 𝜕𝑓3 [𝜕𝑥1 𝜕𝑓1 𝜕𝑥2 𝜕𝑓2 𝜕𝑥2 𝜕𝑓3 𝜕𝑥2 𝜕𝑓1 𝜕𝑥3 𝑦1 𝜕𝑓2 [𝑦2 ] , 𝜕𝑥3 𝑦 3 𝜕𝑓3 𝜕𝑥3 ] where here and below I use [∙] to indicate the matrix of ∙ . Best affine approximation to F 24 As in the previous (ℝ → ℝ) and (ℝ3 → ℝ) cases, we have a best approximation via Taylor’s formula: 𝐹 (𝑋) = 𝐹 (𝑋0 ) + (𝐹 ′ (𝑋0 ))(𝑋 − 𝑋0 ) + 𝑒𝑟𝑟𝑜𝑟. Divergence and Curl of a Vector Field (1) Given 𝐹: ℝ3 → ℝ3 , then 𝐹 ′ (𝑋) is a linear map from ℝ3 → ℝ3 . Linear maps may be broken into the sum of their symmetric and skew-symmetric parts. Let 𝐹 ′ (𝑋)∗ represent the adjoint of 𝐹 ′ (𝑋). Then 1 ′ ′( ) 𝐹+ 𝑋 = [𝐹 (𝑋) + 𝐹 ′ (𝑋)∗ ] = 𝑠𝑦𝑚𝑚𝑒𝑡𝑟𝑖𝑐 𝑝𝑎𝑟𝑡 2 and 1 𝐹−′ (𝑋) = 2 [𝐹 ′ (𝑋) − 𝐹 ′ (𝑋 )∗ ] = skew-symmetric part. Observe that 𝐹 ′ (𝑋) = sum of the two parts. 25 Divergence and Curl of a Vector Field (2) To simplify typography, until further notice let 𝑓𝑖𝑗 ≡ 𝜕𝑓𝑖 . 𝜕𝑥𝑗 Then the matrix representation of 𝐹 ′ (𝑋) is 𝑓11 [𝐹 ′ (𝑋)] = [𝑓21 𝑓31 𝑓12 𝑓22 𝑓32 𝑓13 𝑓23 ] , 𝑓33 and the matrix of 𝐹 ′ (𝑋)∗ is 26 𝑓11 [𝑓12 𝑓13 𝑓21 𝑓22 𝑓23 𝑓31 𝑓32 ] . 𝑓33 Divergence and Curl of a Vector Field (3) Then a bit of matrix algebra leads to the matrix representations of 2𝐹+′ (𝑋) and 2𝐹−′ (𝑋) They are 2𝑓11 [2𝐹+′ (𝑋)] = [𝑓12 + 𝑓21 𝑓13 + 𝑓31 𝑓12 + 𝑓21 2𝑓22 𝑓23 + 𝑓32 𝑓13 + 𝑓31 𝑓23 + 𝑓32 ] 2𝑓33 27 0 [2𝐹−′ (𝑋)] = [𝑓21 − 𝑓12 𝑓31 − 𝑓13 𝑓12 − 𝑓21 0 𝑓32 − 𝑓23 𝑓13 − 𝑓31 𝑓23 − 𝑓32 ] . 0 Divergence and Curl of a Vector Field (4) Definition: The Divergence of 𝐹 is a map 𝐷𝑖𝑣 𝐹: ℝ3 → ℝ, defined by 𝐷𝑖𝑣 𝐹 (𝑋) = 𝑇𝑟𝑎𝑐𝑒 𝐹 ′ (𝑋). The Trace of 𝐹 ′ (𝑋) is equal to the sum of the roots of the characteristic polynomial of 𝐹 ′ (𝑋), a property of the linear map that is invariant to matrix representation. The trace of any linear map from ℝ3 → ℝ3 is equal to the sum of 28 the diagonal elements of any matrix representation. Hence, the familiar version of the Divergence: 𝐷𝑖𝑣 𝐹 (𝑋) = 𝑓11 + 𝑓22 + 𝑓33 = 𝜕𝑓1 𝜕𝑓2 𝜕𝑓3 + + . 𝜕𝑥1 𝜕𝑥2 𝜕𝑥3 Observe also that 𝐷𝑖𝑣 𝐹 (𝑋) = 𝑇𝑟𝑎𝑐𝑒 𝐹+′ (𝑋) . Divergence and Curl of a Vector Field (5) Definition: The Curl of 𝐹 at 𝑋 is defined by 2𝐹−′ (𝑋)(𝑌) = [𝐶𝑢𝑟𝑙 𝐹 (𝑋)] × 𝑌 (∗) 29 How do we know that a vector that makes the right side of Eqn. (∗) equal to the left-hand side exists? Well, 2𝐹−′ (𝑋) is a skew-symmetric transformation and item A5 of the Appendix of the handout asserts that such a vector exists for such maps; we name it 𝐶𝑢𝑟𝑙 𝐹 (𝑋). Also, according to A5, if the matrix of a skewsymmetric map looks like 0 [−𝑎 −𝑏 𝑎 0 −𝑐 𝑏 𝑐] , 0 (∗∗) then the vector at issue (𝐶𝑢𝑟𝑙 𝐹 (𝑋) here) is −𝑐𝑖 + 𝑏𝑗 − 𝑎𝑘 . (∗∗∗) Divergence and Curl of a Vector Field (6) Compare the matrix representation of 2𝐹−′ (𝑋) and you see that 𝐶𝑢𝑟𝑙 𝐹 (𝑋) = (𝑓32 − 𝑓23 )𝑖 + (𝑓13 − 𝑓31 )𝑗 + (𝑓21 − 𝑓12 )𝑘 30 or, in the usual notation, 𝜕𝑓3 𝜕𝑓2 𝜕𝑓1 𝜕𝑓3 𝜕𝑓2 𝜕𝑓1 𝐶𝑢𝑟𝑙 𝐹 (𝑋) = ( − )𝑖 + ( − )𝑗 + ( − )𝑘 𝜕𝑥2 𝜕𝑥3 𝜕𝑥3 𝜕𝑥1 𝜕𝑥1 𝜕𝑥2 Divergence and Curl of a Steady Flow: an application (1) Consider a vector field 𝐹: ℝ3 → ℝ3 and the flow induced by the differential equation 31 𝑑𝑌 = 𝐹 (𝑌) . 𝑑𝑡 (⋆) Note that time does not appear on the right hand side of Eqn. (⋆), hence “steady flow.” Pick and fix 𝑋0 ∈ ℝ3 . Expand 𝐹 around 𝑋0 via Taylor’s formula; substitute the result into Eqn. (⋆) to yield 𝑑 (𝑌 − 𝑋0 ) = 𝐹 (𝑋0 ) + 𝐹 ′ (𝑋0 )(𝑌 − 𝑋0 ) + 𝑒𝑟𝑟𝑜𝑟 . 𝑑𝑡 (⋆⋆) Split 𝐹 ′ (𝑋0 ) into the sum of its symmetric and skew-symmetric components 𝐹 ′ (𝑋0 ) = 𝐹+′ (𝑋0 ) + 𝐹−′ (𝑋0 ) (⋆⋆⋆) and substitute into Eqn. (⋆⋆) to obtain 𝑑 (𝑌 − 𝑋0 ) = 𝐹 (𝑋0 ) + 𝐹+′ (𝑋0 )(𝑌 − 𝑋0 ) + 𝐹−′ (𝑋0 )(𝑌 − 𝑋0 ) + 𝑒𝑟𝑟𝑜𝑟 𝑑𝑡 (⋆⋆⋆⋆) Divergence and Curl of a Steady Flow: an application (2) 32 Now, consider the motion of the points within a small sphere centered at 𝑋0 at time zero, say. How has that spherical test ball changed in a small time 𝑡? The terms on the right-hand side yield an approximate answer. Consider first the motion induced by 𝑑 (𝑌 − 𝑋0 ) = 𝐹−′ (𝑋0 )(𝑌 − 𝑋0 ) 𝑑𝑡 1 = 𝐶𝑢𝑟𝑙 𝐹 (𝑋0 ) × (𝑌 − 𝑋0 ) 2 (⋇) Equation (⋇) says that the test ball rotates about an axis through 𝑋0 parallel to 1 𝐶𝑢𝑟𝑙𝐹 (𝑋0 ) with angular velocity 2 |𝐶𝑢𝑟𝑙𝐹 (𝑋0 )|. That motion is volumepreserving. 33 Divergence and Curl of a Steady Flow: an application (3) Consider next the motion described by 𝑑 (𝑌 − 𝑋0 ) = 𝐹+′ (𝑋0 )(𝑌 − 𝑋0 ) . 𝑑𝑡 (⋇⋇) 𝐹+′ (𝑋0 ) is a symmetric map centered at 𝑋0 . Let 𝜆1 , 𝜆2 , 𝜆3 represent its eigenvalues and 𝐵1 , 𝐵2 , 𝐵3 denote the corresponding orthonormal set of eigenvectors. Then the corresponding eigenvalues of the flow described by Eqn. (⋇⋇) are 𝑒 𝜆1 𝑡 , 𝑒 𝜆2 𝑡 , 𝑒 𝜆3 𝑡 . That flow distorts the spherical ball slightly into an ellipsoid. The volume of the ball changes by a factor that is the determinant of the transformation. That determinant is 𝑒𝑥𝑝(𝑡 𝑡𝑟𝑎𝑐𝑒 𝐹+′ 𝑋0 ) and its time rate of change when 𝑡 = 0 is 𝑡𝑟𝑎𝑐𝑒 𝐹+′ (𝑋0 ) = 𝑡𝑟𝑎𝑐𝑒 𝐹 ′ (𝑋0 ) = 𝐷𝑖𝑣 𝐹 (𝑋0 ). Finally, consider the motion induced by 𝑑 (𝑌 − 𝑋0 ) = 𝐹 (𝑋0 ) . 𝑑𝑡 (⋇⋇⋇) Equation (⋇⋇⋇) says that the ellipsoid experiences a small translation with velocity vector 𝐹 (𝑋𝑜 ). 34 35 The General 𝑭: ℝ𝒏 → ℝ𝒎 Case and the Chain Rule (1) The chain rule is so important in differential calculus that its multidimensional version deserves a brief discussion. Before doing so, I point out that our restriction to the 1 ≤ 𝑛 ≤ 3 case above was not necessary; I invoked the restriction because of the emphasis on the undergraduate calculus sequence. Let 𝑛 and 𝑚 represent natural numbers and consider 𝐹: ℝ𝑛 → ℝ𝑚 . If 𝐹 is differentiable at 𝑋, then its derivative can be defined via the Frechet difference quotient just as we did in the earlier cases. The process is identical to the one used for the 1 ≤ 𝑚, 𝑛 ≤ 3 case. The derivative is a linear transformation as before, and all goes through except the discussion of the Curl, which is unique to the three-dimensional case. With this generalization all cases are subsumed, again except for the Curl, by this general case. 36 The General 𝑭: ℝ𝒏 → ℝ𝒎 Case and the Chain Rule (2) Consider now the chain rule in the general case. Let 𝑚, 𝑛, and 𝑝 represent natural numbers. Situation: 𝐹: ℝ𝑛 → ℝ𝑚 is differentiable at 𝑋, and 𝐺: ℝ𝑚 → ℝ𝑝 is differentiable at 𝐹 (𝑋), then 𝐺 ∘ 𝐹 is differentiable at 𝑋 and (𝐺 ∘ 𝐹 )′ (𝑋) = 𝐺 ′ (𝐹 (𝑋)) ∘ 𝐹 ′ (𝑋), (∗) where ∘ represents function composition. Observe that in Eqn. (∗) 𝐺 ′ (𝐹 (𝑋)) is a linear map and 𝐹 ′ (𝑋) is a linear map; hence the composition of the two maps is a linear map, and all is well. Use brackets to denote matrix representation, and the matrix version of Eqn. (∗) becomes [(𝐺 ∘ 𝐹 )′ (𝑋)] = [𝐺 ′ (𝐹 (𝑋))][𝐹 ′ (𝑋)]. (∗∗) 37 The General 𝑭: ℝ𝒏 → ℝ𝒎 Case and the Chain Rule (3) Recall the one-variable chain rule. 𝑓: ℝ → ℝ, 𝑔: ℝ → ℝ (𝑔 ∘ 𝑓)′ (𝑥 ) = 𝑔′ (𝑓(𝑥 ))𝑓 ′ (𝑥 ) (∗∗∗) where the juxtaposition on the right-hand side of Eqn (∗∗∗) denotes multiplication, but could just as well have been written (𝑔 ∘ 𝑓)′(𝑥 ) = 𝑔′ (𝑓(𝑥 )) ∘ 𝑓 ′ (𝑥 ) (∗∗∗′ ) because, in the one-dimensional case, composition of linear functions is multiplication. Compare Eqn. (∗) and Eqn. (∗∗∗′ ) and you see that Eqn. (∗∗∗′ ) is just a special case of Eqn. (∗). 38 Nerd’s Bumper Sticker And God said ∇×E+ 𝜕𝐵 =0 𝜕𝑡 ∇×H− 𝜕𝐷 =𝐽 𝜕𝑡 ∇∙B=0 ∇∙D=ρ and there was light. 39 Closure The derivative is a linear map. The Divergence and Curl are aspects of that map that account for about 4/9 of its properties, an especially important fraction in applications. ________________________________________________________________________ “Some people can stay longer in an hour than others can in a week.” 40 William Dean Howells, American novelist, 1837-1920 41