Download MATH 235.9 HW QUIZ 1 (1) (a) Two nonzero real vectors u, v ∈ R 2

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
MATH 235.9 HW QUIZ 1
ANDREW J. HAVENS
(1) (a) Two nonzero real vectors u, v ∈ R2 are said to be colinear if and only if there is a nonzero number
λ ∈ R such that u = λv. Let a, b, c and d be real numbers, and let
u :=
a
c
,
v :=
b
d
.
Show that u and v are colinear if and only if ad − bc = 0. What does it mean geometrically for
two vectors to be colinear?
Solution:
In one direction, if u and v are colinear we have some nonzero λ ∈ R such that
u=
a
c
=λ
b
d
= λv .
Thus, a = λb and c = λd. If none of a, b, c or d are zero, then we can solve for λ in each equation,
obtaining
λ=
c
a
=
=⇒ ad = bc =⇒ ad − bc = 0 .
b
d
If any one of a = 0 or b = 0 then by the relation a = λb, λ 6= 0, both a and b must be zero, whence
ad − bc = 0, and similarly if either of c = 0 or d = 0 then both must be zero, which again shows
ad − bc = 0.
Conversely, if it is known ad − bc = 0, then ad = bc. We seek a nonzero constant λ such that
u = λv. If a, b, c, and d are each nonzero, then a/b = c/d, and one can check readily that this
ratio gives a λ 6= 0 such that u = λv. On the other hand, if any one of a, b, c, or d is zero, then
both sides of the equation ad = bc vanish. Since u and v are each nonzero vectors, at least one
component of each vector is nonzero. So, if for instance a = 0, then c 6= 0, and thus b = 0, and
one can take λ = c/d, which is well defined since d 6= 0 as well. A symmetric argument works if
c = 0 or d = 0.
The geometric meaning of colinearity is that the vectors span the same line through the origin.
In particular, the λ in u = λv is a scale factor scaling v to u. Any linear combination of colinear
vectors u and v is also a vector lying on the same line, and thus is colinear to both u and v. In
consideration of the bonus establishing the determinant ad−bc as a signed area of the parallelogram
spanned by u and v, it is obvious that colinearity of the vectors is equivalent to the vanishing of
this quantity.
1
2
ANDREW J. HAVENS
(b) Let e and f be real numbers. Under what conditions on a, b, c, d, e and f can we express
e
f
as a linear combination of the vectors u and v above?
(?)
Solution:
This is a rephrasing of the results in class regarding solutions of the pair of linear equations
(
ax + by = e
.
cx + dy = f
e
In particular, w :=
can be expressed as a linear combination of u and v if and only if
f
there are real numbers x and y such that
a
b
e
xu + yv = w ⇐⇒ x
+y
=
,
c
d
f
which can be done if and only if the system of equations (?) has at least one solution.
We know that there’s a unique solution if ad − bc 6= 0 from class, and thus in this case a unique
linear combination giving the desired vector. If not, by part (a) u and v are colinear, and so we
require the vector w to be colinear to both. In particular, the equations af − ce = 0 = bf − de
must both hold. In this case, we are guaranteed infinitely many solutions (I leave the construction
of all possible solutions as an exercise.)
Bonus: Show that ad − bc is the signed area of the parallelogram spanned by u and v, where the sign
is positive if rotating u counter-clockwise to be colinear to v sweeps into the parallelogram, and
is negative otherwise. (Most convincing arguments use some amount of trigonometry; you are
encouraged to discuss this problem with me if you are unsure of how to proceed.)
Solution:
First, let us suppose u and v are unit vectors, i.e. a2 + c2 = 1 = b2 + d2 . Geometrically, they are
vectors lying on the unit circle, and so we can express their components as trigonometric functions of
the angles they make with the x axis. Let u make an angle of α with the x axis and v make an angle of
β with the x axis. Then the angle between the vectors is β − α, and from the sine subtraction formula:
sin(β − α) = cos(α) sin(β) − cos(β) sin(α) = ad − bc.
Recall that the area of a parallelogram is the base times an altitude, formed by taking an orthogonal
line segment from one side to an opposite side. From a picture, one sees that the area of a parallelogram
can be expressed as the product of side lengths times the sine of the internal angle between adjacent
sides. If the sides are the unit vectors u and v, then the area is | sin(β − α)|. Thus, for unit vectors,
ad − bc is ±area, with the sign positive if the angle β − α ∈ (0, π), negative if β − α ∈ (π, 2π), and 0
if the angle β − α = 0 or π (the colinear case). Thus, for the non-colinear case, if u sweeps into the
parallelogram when rotated counterclockwise towards v, the sign is positive. Note that switching the
order of the vectors switches the sign of the determinant ad − bc, and this is consistently reflected in
the convention regarding the vectors’ orientations.
For general vectors, one scales the area of the parallelogram as well as the components, and discovers
that the scale factors for the area and the equation ad − bc are identical: e.g. if we scale u by λ, then
the area scales by λ, and so do the components:
λa
λu =
,
λc
so the determinant scales to (λa)d − b(λc) = λ(ad − bc). Thus, the determinant is the signed area,
accounting for the orientation/ordering of the two vectors.
MATH 235.9 HW QUIZ 1
3
(2) You’ve probably been made aware of the fact that two points in Euclidean space determine a line, and
given the points in Cartesian coordinates you likely can give an equation for that line. A problem in a
similar spirit from algebra and analytic geometry is to find an equation for the graph of a polynomial
given the Cartesian coordinates of some sample points.
(a) Recall that a real quadratic function has the form f (x) = ax2 + bx + c for a, b, c ∈ R, with a 6= 0,
and the graph of y = f (x) is a parabola. Suppose you are given three distinct points in the
plane R2 . Under what condition(s) can you determine a real quadratic function whose graph is a
parabola passing through these points? (Try to intuit it; no proof is required, but you should try
to explain your reasoning.)
Solution:
The condition that our parabola be realizable as a graph of a quadratic function implies that no
pair of the three points lie on the same vertical line (so the x coordinates must all be distinct).
Moreover, since it is stipulated that the quadratic function be nontrivial (the leading coefficient
a is nonzero), we cannot have the three points colinear. That a parabola cannot contain three
colinear points is intuitive, but here is a simple proof (not required in your solutions): suppose
the parabola contains three points which lie on a common line. Then this line has an equation of
the form y = dx + e for some numbers d, e ∈ R. Since the line is coincident with the parabola at
three points, we expect the difference of the quadratic and the line to have three distinct roots.
This is impossible, since the resulting difference ax2 + (b − d)x + c − e = 0 is a quadratic provided
a 6= 0, whence it has at most two distinct solutions. (This last step uses the fundamental theorem
of algebra, a theorem whose proof is nontrivial.)
(b) Assuming the necessary conditions, write down a linear system of equations to find an equation of
the form y = ax2 +bx+c for the parabola passing through the three points (x1 , y1 ), (x2 , y2 ), (x3 , y3 ) ∈
R2 . What are the variables of your linear system? What are the coefficients?
Solution:
By plugging the points in, we obtain the system

 2
2

x1
ax1 + bx1 + c = y1
2

x22
←→
ax2 + bx2 + c = y2

 2
x23
ax3 + bx3 + c = y3
x1
x2
x3

1 y1
1 y2  .
1 y3
The coefficients are determined by the given points, and a, b, and c are the variables of the linear
system.
(c) Find an equation of the form y = ax2 + bx + c for the parabola containing the points (1, −1),
(2, 1), and (−2, 5).
Solution:
The associated system is




1 1 1 −1
a + b + c = −1
4a + 2b + c = 1 ←→ 4 2 1 1  .


4 −2 1 5
4a − 2b + c = 5
Applying Gauss-Jordan gives the solution a = 1, b = −1, and c = −1, so the equation is y =
x2 − x − 1.
(d) Describe how you would determine an equation of the form y = ax3 + bx2 + cx + d for a cubic
curve given some sample points. How many points are needed? Can you guess what conditions
on the sample points are necessary for the resulting system to yield a unique solution?
Solution:
Since there are four coefficients to find, we need four equations, and thus four distinct points.
Moreover, since we want the cubic curve to be a graph, we require that no two points lie on the
same vertical line (equivalently, all of the x values of the coordinates are distinct). There are
4
ANDREW J. HAVENS
two additional algebraic conditions: the four points cannot lie on a common line, or on the graph
of a quadratic function. The proof of this is in the same spirit of the proof that three colinear
points do not lie on the graph of a nontrivial quadratic. These conditions ensure that the resulting
linear system is consistent (i.e. that it may be solved, without producing contradictory equations).
Given the necessary conditions on our sample points the linear system necessary to determine the
coefficients has the augmented matrix

 3
x1 x21 x1 1 y1
x32 x22 x2 1 y2 

 3
x3 x23 x3 1 y3  .
x34 x24 x4 1 y4
Note that the coefficient matrix is eerily reminiscent of the Vandermonde matrix in problem
3. If we chose to reorder our variables, then we would permute the columns, and realize that
Vandermonde matrices are precisely the matrices which arise from the above kinds of polynomial
interpolation problems.
Bonus:
Suppose the points (x1 , y1 ), (x2 , y2 ), and (x3 , y3 ) satisfy the conditions you found in part (a).
A theorem of geometry states that there is then also a unique circle which contains these three
points. Write down (with justification) a linear system in some choice of variables from which
one can determine an equation for the circle (in particular, you should be able to determine its
center and radius.)
For extra points, write down explicit formulae for the center and radius in terms of (x1 , y1 ), (x2 , y2 ),
and (x3 , y3 ). You may even consider writing a program: it should accept three points as inputs,
warn the user if the points fail the criteria from part (a), and otherwise let the user choose to see
the equation of a circle or a parabola through these points, perhaps with the corresponding graph.
Solution:
We can in fact recover a circle so long as the three points are not all colinear. Indeed, when
three points are not a colinear set, they are the vertices of a triangle, and there is a unique
circle circumscribing this triangle, called the circumcircle. Thus, the task at hand is to know the
circumcircle of a given triangle.
The equation for a circle in R2 with center (b, c) and radius r is
(x − b)2 + (y − c)2 = r2 .
Expanding this, one has
x2 − 2bx + b2 + y 2 − 2cy + c2 = r2 .
Given b and c, we can recover r from the constant term in this equation, which after rearrangement
we find to be the quantity r2 − b2 − c2 . Let a := r2 − b2 − c2 . Then, given points (x1 , y1 ), (x2 , y2 ),
and (x3 , y3 ) which are not colinear, we obtain three equations



2
2
2
2

1 2x1 2y1 x21 + y12
a + 2bx1 + 2cy1 = x1 + y1
a + 2bx22 + 2cy22 = x22 + y22 ←→ 1 2x2 2y2 x22 + y22  .


1 2x3 2y3 x23 + y32
a + 2bx23 + 2cy32 = x23 + y32
There’s another, more geometric (or perhaps just more esoteric), method to arrive at a set of
linear equations from which one can recover the circumcircle from the triangle. It is a theorem
of Euclid (the proof of which is a good exercise) that the circumcenter is the intersection of
the perpendicular bisectors of the sides of the triangle. Of course, one only needs two of the
perpendicular bisectors to find the point of intersection of the three bisectors. The radius is then
easily recovered using the distance formula from the circumcenter to any one of the given points.
This has the advantage of producing a pair of linear equations, but at the cost of computing
the equations of two perpendicular bisectors. If one is armed with a compass and straightedge,
intersecting perpendicular bisectors is an efficient way to go, but armed with a computer and
Gauss Jordan, the above augmented matrix does things about as efficiently as possible for this
problem.
MATH 235.9 HW QUIZ 1
5
In the interest of reviewing some basic facts about lines in R2 , I discuss the Euclidean method of
finding the center. Observe that at most one side of the triangle made by our points is horizontal,
so we may assume, possibly after a relabeling of the points, that neither of the sides meeting at
the vertex (x1 , y1 ) are horizontal. Thus, these sides are segments along lines with slopes
m2 =
y2 − y1
,
x2 − x1
y3 − y1
.
x3 − x1
If we add the assumption that the points have distinct x coordinates, then we may assume none
of the sides are vertical so that both of these slopes are defined. This assumption isn’t necessary
if we later rewrite our equations in standard form. We make this assumption temporarily as it
eases our considerations of slopes. Recall that lines with nonzero, defined slopes m and m0 are
orthogonal if mm0 = −1. Thus the lines we desire have slopes
m3 =
m02 = −
x2 − x1
,
y2 − y1
x3 − x1
.
y3 − y1
Further, recall that one obtains the midpointof two points by averaging their coordinates. Thus,
1 yi +y1
the midpoints of these sides are xi +x
for i = 1, 2. We can then write down the equations
2 ,
2
of the perpendicular bisectors as
x2 − x1
x2 + x1
y2 + y1
y−
=−
x−
,
2
y2 − y1
2
x3 − x1
x3 + x1
y3 + y1
=−
x−
.
y−
2
y3 − y1
2
m03 = −
These equations are only defined if the slopes are, but the standard forms are defined even if we
admit a horizontal edge. Thus, one might write the system as
(
2(x2 − x1 )x + 2(y2 − y1 )y = x22 − x21 + y22 − y12
.
2(x3 − x1 )x + 2(y3 − y1 )y = x23 − x21 + y32 − y12
These are precisely the result of subtracting the first row of the augmented matrix from each of
the second and third rows! We may apply the formula from our initial investigation of solutions
to two linear equations, and obtain the coordinates of the center as
(y3 − y1 )(x22 − x21 + y22 − y12 ) − (y2 − y1 )(x23 − x21 + y32 − y12 )
2(x2 − x1 )(y3 − y1 ) − 2(x3 − x1 )(y2 − y1 )
y3 − y2 + (x2 + x1 )m02 − (x3 + x1 )m03
=
,
2m02 − 2m03
b=
(x2 − x1 )(x23 − x21 + y32 − y12 ) − (x3 − x1 )(x22 − x21 + y22 − y12 )
2(x2 − x1 )(y3 − y1 ) − 2(x3 − x1 )(y2 − y1 )
x3 − x2 + (y3 + y1 )m3 − (y2 + y1 )m2
=
.
2m3 − 2m2
c=
Thus, the radius can be obtained as
"
2
(y3 − y1 )(x22 − x21 + y22 − y12 ) − (y2 − y1 )(x23 − x21 + y32 − y12 )
r=
x1 −
2(x2 − x1 )(y3 − y1 ) − 2(x3 − x1 )(y2 − y1 )
(x2 − x1 )(x23 − x21 + y32 − y12 ) − (x3 − x1 )(x22 − x21 + y22 − y12 )
+ y1 −
2(x2 − x1 )(y3 − y1 ) − 2(x3 − x1 )(y2 − y1 )
2 #1/2
.
6
ANDREW J. HAVENS
The reduced row echelon form of the augmented matrix, for your viewing pleasure:


y32 x2 y1 +y32 (−y2 )x1 +y3 x22 x1 −y3 x2 x21 −y3 x2 y12 +y3 y22 x1 −x22 x3 y1 +x2 x23 y1 −y22 x3 y1 −y2 x23 x1 +y2 x3 x21 +y2 x3 y12
1 0 0
−y3 x2 +y3 x1 +x2 y1 +y2 x3 −y2 x1 −x3 y1


y32 y2 −y32 y1 −y3 x22 −y3 y22 +y3 x21 +y3 y12 +x22 y1 +y22 y1 +y2 x23 −y2 x21 −y2 y12 −x23 y1
,
0 1 0
−


2(y3 x2 −y3 x1 −x2 y1 −y2 x3 +y2 x1 +x3 y1 )
2
2
2
2
2
2
2
2
(2x2 −2x1 )(y3 +x3 −x1 −y1 )−(2x3 −2x1 )(x2 +y2 −x1 −y1 )
0 0 1
4y3 x2 −4y3 x1 −4x2 y1 −4y2 x3 +4y2 x1 +4x3 y1
(3) The matrix

1 a a2 a3
 1 b b 2 b3 

V := 
 1 c c2 c3 
1 d d2 d3
is an example of a Vandermonde matrix. Let vi,j denote the (i, j)th-entry of V.
(a) Assuming a, b, c and d are distinct real numbers, how many pivots does the matrix V have? If
a = c and b = d, how many pivots does V possess?

Solution:
If a, b, c and d are distinct, there are four pivots (see part b for a sequence of row operations proving
that each diagonal entry is a pivot position). If a = c and b = d, there are only two pivots, as the
matrix then contains two pairs of twin rows, which means two rows can each be made into the
zero row.
(b) A square matrix is said to be upper triangular if all nonzero entries occur on or above the main
diagonal (the main diagonal is the entries whose row and column numbers match, i.e. vi,i for
i = 1, 2, 3, 4).
Find a sequence of row operations to convert V into an upper triangular matrix. (Hint: use a
modified Gauss-Jordan algorithm; you should not change entries on the diagonal to 1 as you do
when putting a matrix into REF or RREF–doing so makes V a strictly upper triangular matrix,
but you will need to track the values on the diagonal if you intend to do the bonus.)
Solution:
See the solution for the bonus; I give a row reduction which simultaneously solves this and achieves
the property listed in the bonus.
Bonus: Find a series of row operations on V making V upper triangular such that the product of the
entries on the main diagonal of the final matrix is, up to a sign (depending on the row operations
you chose), the product of the differences of the distinct numbers a, b, c and d: i.e. if Ṽ is the final
upper triangular matrix, with entries ṽi,j , i, j = 1, 2, 3, 4, then
4
Y
ṽi,i = ±(a − b)(a − c)(a − d)(b − c)(b − d)(c − d) .
i=1
Solution:
Before proceeding, note that I have simplified my calculations using two factorizations: the difference of perfect squares factorization, and the difference of perfect cube factorization. For example,
b2 − a2 = (b − a)(b + a), and c3 − a3 = (c − a)(c2 + ac + a2 ). The sequence of row operations is


R2 − R1 7→ R2 ,
(i) R3 − R1 7→ R3 ,


R4 − R1 7→ R4 ,
(
R3 − c−a
b−a R2 7→ R3 ,
(ii)
R4 − d−a
b−a R2 7→ R4 ,
(d − a)(d − b)
(iii) R4 −
R3 7→ R4 .
(c − a)(c − b)
MATH 235.9 HW QUIZ 1
7
The pivotal moments (pun intended) in the sequence of augmented matrices are shown below:




1
a
a2
a3
1 a a2 a3
 1 b b2 b3  (i)  0 b − a b2 − a2 b3 − a3 




 1 c c2 c3  7−→  0 c − a c2 − a2 c3 − a3 
0 d − a d2 − a2 d3 − a3
1 d d2 d3


1
a
a2
a3

(ii)  0 b − a
b2 − a2
b3 − a 3

7−→ 
 0
0
(c − a)(c − b) (c − a)(c − b)(a + b + c) 
0
0
(d − a)(d − b) (d − a)(d − b)(a + b + d)


1
a
a2
a3

(iii)  0 b − a
b2 − a 2
b3 − a 3

7−→ 
 0
0
(c − a)(c − b) (c − a)(c − b)(a + b + c) 
0
0
0
(d − a)(d − b)(d − c)
The final matrix is upper-triangular, and the product of diagonal entries is
4
Y
ṽi,i = (b − a)(c − a)(c − b)(d − a)(d − b)(d − c)
i=1
= (−1)6 (a − b)(a − c)(a − d)(b − c)(b − d)(c − d)
= (a − b)(a − c)(a − d)(b − c)(b − d)(c − d) ,
as desired. Note that the product vanishes if and only if a, b, c and d are not all distinct, in which
case the matrix has less than four pivots.