Download Complex Numbers - Mathematical Institute Course Management BETA

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Foundations of mathematics wikipedia , lookup

Large numbers wikipedia , lookup

Wiles's proof of Fermat's Last Theorem wikipedia , lookup

Georg Cantor's first set theory article wikipedia , lookup

Non-standard calculus wikipedia , lookup

List of important publications in mathematics wikipedia , lookup

Real number wikipedia , lookup

Non-standard analysis wikipedia , lookup

Fundamental theorem of calculus wikipedia , lookup

Theorem wikipedia , lookup

Addition wikipedia , lookup

System of polynomial equations wikipedia , lookup

Mathematics of radio engineering wikipedia , lookup

Elementary mathematics wikipedia , lookup

Proofs of Fermat's little theorem wikipedia , lookup

Vincent's theorem wikipedia , lookup

Fundamental theorem of algebra wikipedia , lookup

Transcript
Version of 22 August 2016
Oxford University Mathematical Institute
Complex Numbers
Notes by Peter M. Neumann (Queen’s College)
1. Introduction
Students coming to Oxford to study mathematics arrive with a varied range of background
knowledge and training. In respect of complex numbers some will know much and have acquired
a useful confidence, others will be almost beginners. But it is likely that, however much or
little you have already learned, there will be less unanimity of language and perception than is
desirable in a large group of students who are moving on to study mathematics to a deep level.
The purpose of the two lectures, and therefore the purpose of these lecture notes, is to serve
as a reminder to those who have met the topic before, to introduce the ideas to those new to
the topic, and—perhaps most importantly—to establish a common language and notation.
2. Complex numbers
For us a complex number is an entity of the form a + bi. Here a, b are real numbers and
√
i = −1. The real number system R—which√
should be familiar to us all—consists of rational
numbers p/q and the irrational numbers like 2 or e that fit between them to make a system
that many of us picture using the ‘number line’
...
√
√
. . . − 27 . . . − 10 2 . . . − 1 .. 0 .. 1 . . . 7 3
...
27 . . .
. . . 65 537 . . .
...
It is the ingredient i of a complex number that took mathematicians of the 16th to 19th
centuries much time and mental readjustment to come to terms with. We learn quite early in
our lives that ‘minus times minus is plus’. As a consequence, the square of any real number
is non-negative, so the equation x2 = −1 has no real root. Complex numbers result from the
imaginative idea of simply inventing an entity i that is a root of this equation, that is, for
which i2 = −1. Indeed, the notation i is said to have been invented in recognition that this
must be an imaginary number having nothing to do with quantity and little to do with reality.
Note. Sometimes we’ll use the form a + bi, sometimes a + ib. There is no mathematical
difference. In some contexts there is a mild psychological difference.
As is well known, in the Real world, there are quadratic equations such as x2 + 1 = 0 and
− x + 1 = 0 that have no roots. Once we have complex numbers, however, all quadratic
equations with real coefficients have roots:
√
−b ± b2 − 4ac
2
If a 6= 0 and ax + bx + c = 0 then x =
. If b2 − 4ac, the so-called
2a
2
discriminant,
√
√is non-negative then the2 roots are real numbers; if b − 4ac < 0 then
b2 − 4ac = 4ac − b2 × i. Thus if b > 4ac then the equation has real roots and
if b2 − 4ac < 0 then roots exist as complex, non-real numbers.
x2
1
Exercise 2.1 Which of the following quadratic equations have real roots, which do not?
3x2 + 2x − 1 = 0;
2x2 − 6x + 9 = 0;
−4x2 + 7x − 9 = 0;
4x2 − 9x + 5 = 0.
3. Arithmetic of complex numbers
The collection of all complex numbers is written C (notation that is pretty much standard in
mathematics everywhere in the world). Addition and multiplication are basic binary operations
(in the precise sense that is to be explained in other lecture courses) on C. They are what you
would expect:
(a + bi) + (c + di) = (a + c) + (b + d)i
(a + bi) × (c + di) = (ac − bd) + (ad + bc)i .
That rule for multiplication comes simply from the calculation
(a + bi) × (c + di) = ac + adi + bci + bdi2
and exploiting the fact (or assumption?) that i2 = −1. We’ll come to division later.
Exercise 3.1 Put each of the following complex numbers into standard form a + bi:
√
(1 + 2i)(3 − i); (2 + i)(1 − 2i); (1 + i)4 ; (1 − 3i)3 .
4. The Argand diagram
The complex number a+bi may be identified with the point (a, b) of the euclidean plane R2 .
Just as we speak of ‘the real line’ when we represent real numbers geometrically on the number
line, so we speak of ‘the complex plane’ (also known as ‘the Argand diagram’) when we picture
C as a 2-dimensional geometrical object.
If z ∈ C and z = a + bi then a is known as the real part Rez (or Rez or Rz —notation
varies from book to book and from lecturer to lecturer but its meaning should always be clear
from the context) and b is known as the imaginary part Imz (or Imz or I z ). It is important
that R ⊆ C in the sense that
R = {z ∈ C | Imz = 0}.
In fact this is not just a containment as set, it is containment as arithmetical structure. For,
(a+0i)+(c+0i) = (a+c)+0i and (a+0i)×(c+0i) = (ac)+0i, so the arithmetic of real numbers
identified as a special kind of complex numbers is precisely the same as the arithmetic of R.
We often express this by writing R 6 C rather than R ⊆ C. With this identification R becomes
the x-axis in the complex plane.
The y -axis consists of numbers of the
form bi.
These are generally known
as ‘pure imaginary’ complex numbers.
Clearly, any complex number is uniquely
the sum of a real number and a pure imaginary number.
Addition of complex numbers may be
represented geometrically in the Argand
diagram by the so-called parallelogram
law, as shown in the diagram. There is
also a geometrical description of multiplication, but it is more complicated and will
be discussed later.
2
Exercise 4.1. Show that there is no binary relation < on C for which the following
conditions are satisfied:
• for any z ∈ C exactly one of z < 0, z = 0, 0 < z holds;
• if z, w ∈ C, 0 < z and 0 < w then 0 < z + w ;
• if z ∈ C and z 6= 0 then 0 < z 2 .
5. Conjugate and modulus of a complex number
For z = a + bi ∈ C the complex conjugate z (for which you might sometimes see the
notation z ∗ , though very rarely in the Oxford mathematics courses) is defined by z := a − bi.
Interpreted geometrically complex conjugation is reflection of the Argand diagram in the real
axis. Here is an easy, but disproportionately useful and important fact about it:
Theorem C1:
If z, w ∈ C then z + w = z + w and z w = z w .
Because this is a very important fact, it is important that you yourself have internalised the
reasons why it is true. That is why the proof is left as an exercise—if it gives you trouble then
please consult your Tutors or write me a note.
Exercise 5.1.
Write out a proof of this theorem.
For z = a + bi ∈ C
or modulus (these terms are used interchangeably) |z|
√ the absolute value
a2 + b2 > 0 this real number has a non-negative square
is defined by |z| := a2 + b2 . Since √
root and this is what is meant by the
symbol. Thus |z| > 0, and |z| = 0 if and only if z = 0.
Geometrically, |z| is the distance of z from 0 in the complex plane.
Three little details about the modulus that follow directly from the definition are worth
noting. The first is that |z| = |z|. The second, that
√
x if x > 0,
2
if x is real then |x| = x =
−x if x < 0,
which accords with the definition of absolute value for real numbers (in their own right). And
the third is that if z = a + bi then, since a2 + b2 > a2 , |z| > |a|, that is, |Rez| 6 |z|. Similarly,
of course, |Imz| 6 |z|.
√
Theorem C2: (1) If z ∈ C then |z| = z z . (2) If z, w ∈ C then |z w| = |z||w|.
2
2 2
2
2
For the proof
√ of (1) note that if z = a + bi then z z = (a + bi)(a − bi) = a − i b = a + b ,
so that |z| = z z by definition of the modulus. Now to prove (2) calculate as follows:
|z w|2 = (z w)(z w) = (z w)(z w)
[by Theorem C1]
2
= (z z)(w w) = |z| |w|2 ,
and the desired result comes by taking square roots.
Suggestion: Get into the habit of working with complex numbers as single entities. There
is rarely any need to write them in terms of their real and imaginary parts. Our proof of (2)
illustrates this.
3
Theorem C3 [The Triangle Inequality]:
If z, w ∈ C then |z + w| 6 |z| + |w|.
Proof. Geometrically, this is the fact that the
length of any side of a triangle is always at most the
sum of the lengths of the other two sides. In this
case the relevant triangle has its vertices at 0, z and
z + w in the complex plane. The side from 0 to z
has length |z|, the side from z to z + w has length
|w|, and the side from 0 to z + w has length |z + w|.
It is perhaps worth seeing an alternative, more algebraic proof.
|z + w|2 = (z + w)(z + w) = (z + w)(z + w) = zz + w z + z w + w w .
Now w z = z w and so w z + z w = 2Re(z w). As was noted immediately after the definition of
absolute value, |Re(z w)| 6 |z w|. We also have |z w| = |z||w| = |z||w|. Therefore
|z + w|2 = |z|2 + 2Re(z w) + |w|2 6 |z|2 + 2|z||w| + |w|2 .
Thus |z + w|2 6 (|z| + |w|)2 , and taking positive square roots we find that |z + w| 6 |z| + |w|.
Exercise 5.2. Let z, w ∈ C and suppose that z 6= 0. Show that |z + w| = |z| + |w|
if and only if there is a real number λ > 0 such that w = λz ,
6. Division of complex numbers
We have just seen that if z ∈ C then z z = |z|2 and that if z 6= 0 then |z|2 > 0. It follows
that
z
1
1
a
b
if z 6= 0 then
= 2 , that is,
= 2
−
i.
z
|z|
a + bi
a + b2 a2 + b2
w×z
wz
w
=
=
. Thus C is very similar to Q or
z
z×z
|z|2
R in that it has addition, multiplication and division by non-zero numbers. These operations
satisfy the usual rules of arithmetic—rules that are familiar, but only to those who have taken
the time to stop and think what they are. In language to be introduced in other courses, like
Q and R the complex number system C is a field.
Therefore if z, w ∈ C and z 6= 0 then
7 − 2i
i
,
and a few other such complex number calculations
5 + 12i 1 − i
until you are confident that you fully understand division in C.
Exercise 6.1.
Exercise 6.2.
form a + bi:
Work out
Let z1 := 1 + i and z2 := 2 − 3i. Put each of the following into standard
z1 + z2 ;
z1 − z2 ;
z1 z2 ;
z1 /z2 ;
z1 z2 ;
z1 z2 .
7. The argument of a complex number
In plane geometry and its applications it is often useful to switch between cartesian and polar
coordinates. In the Argand diagram the polar coordinates of a complex number z are (r, α)
where r is the distance from 0 and α is the angle from the positive x-axis (the positive real
line) anti-clockwise to the ray joining 0 to z . It should be clear from the definition of absolute
value that r = |z|. The angle α is known as the argument arg z . In principle it is many-valued.
4
For any integer k the angle α + 2kπ would serve just as well. Many mathematicians would
express this by saying that arg z is only determined “modulo 2π ”.
Two important details about the argument are worth noting: that arg 0 is not defined, and
that arg z = − arg z .
Sometimes one specifies that −π < arg z 6 π to make
the argument single-valued. This single-valued version of
arg z is known as its principal value. Another convention
that some authors use is that the principal value should
satisfy 0 6 arg z < 2π , but this version does not combine
well with the equation arg z = − arg z .
Switching back and forth between polar and cartesian
coordinates requires trigonometry. If z = a + bi ∈ C \ {0}
and r = |z|, α = arg z then a = r cos α, b = r sin α. So
in terms of modulus and argument, z = r(cos α + i sin α).
Going the other way, if√z = a + bi 6= 0 and r := |z|,
α := arg z then r = a2 + b2 (of course—this is how
|z| was defined) and, since b/a = r sin α/r cos α = tan α,
we find α as tan−1 (b/a)—where, however, the inverse tangent function has (as always) to be
interpreted with great care.
Exercise 7.1.
Find the modulus and argument of each of the following complex numbers:
√
1 + 3i; (2 + i)(3 − i); (1 + i)5 .
Exercise 7.2.
On separate Argand diagrams sketch each of the following subsets of C:
A := {z |z| < 1};
B := {z Rez = 3};
C := {z − 14 π < arg z < 14 π};
D := {z arg(z − i) = 12 π}; E := {z |z − 3 − 4i| = 5}; F := {z |z − 1| = |z − i|} .
We have seen in Theorem C2 that if z, w ∈ C then |z w| = |z||w|. How is arg(z w) related
to arg z and arg w ? Well, if α := arg z and β := arg w , so that z = |z|(cos α + i sin α),
w = |w|(cos β + i sin β), then
z w = |z||w|(cos α + i sin α)(cos β + i sin β).
Using the trigonometrical addition formulae for sine and cosine we see that
(cos α + i sin α)(cos β + i sin β) = (cos α cos β − sin α sin β) + i(sin α cos β + cos α sin β)
= cos(α + β) + i sin(α + β).
Thus we have proved the following companion to Theorem C2(2):
Theorem C4:
If z, w ∈ C then arg(z w) = arg z + arg w .
Note. It is important here that arg z is not restricted to the principal value. Even if
−π < arg z 6 π and −π < arg w 6 π , it could well happen that arg z + arg w > π or
arg z + arg w < −π .
The theorem indicates that the argument function behaves somewhat like a logarithmic
function. In fact, it is an important constituent of the complex logarithmic function—but that
is material for a more advanced study than can be offered in this short course.
Theorem C4 yields a geometric interpretation of multiplication as promised above, at the
end of §4. As a transformation of the Argand diagram, the operation ‘multiply by w ’ leaves the
5
point 0 unmoved, scales distances from 0 by a factor |w|, and rotates the plane anticlockwise
about 0 through an angle arg w . It is worth remembering that this anticlockwise convention
for rotations is standard not just in the context of the complex plane, but also in geometry, in
mechanics—in most of mathematics. There is more to the convention, though: if α > 0 then
an anticlockwise rotation through angle −α is a clockwise rotation through angle α.
Exercise 7.3. For each of the following complex numbers w , what transformation of the
Argand diagram does multiplication by w represent?
i;
(1 + i);
(1 − i);
(3 + 4i).
8. De Moivre’s Theorem
If θ ∈ R then cos2 θ + sin2 θ = 1. Therefore complex numbers of the form cos θ + i sin θ
are of absolute value 1. Conversely, we have seen that if |z| = 1 and α := arg z then
z = cos α + i sin α. Define
S 1 := {z ∈ C |z| = 1},
[standard notation]
the unit circle in the complex plane. Described a little differently, S 1 = {cos θ +i sin θ | θ ∈ R }.
We know, since absolute values are multiplicative, that if z, w ∈ S 1 then also z w ∈ S 1 ,
that is, S 1 is closed under the multiplication of complex numbers; moreover, if z ∈ S 1 then
z −1 = z ∈ S 1 . In terms that some of you will already have met and all of you will meet before
long, this means that S 1 is a group. It is a very important group known as the circle group.
Theorem C5 [De Moivre’s Theorem]. If z ∈ S 1 , θ := arg z and n ∈ Z then arg(z n ) = nθ .
Equivalently, (cos θ + i sin θ)n = cos nθ + i sin nθ .
For n > 0 this comes quickly by mathematical induction: it is certainly true if n = 0 (since
= 1 and arg 1 = 0) or if n = 1; if it is true when n = m then arg(z m+1 ) = arg(z m )+arg z =
m. arg z + arg z (by inductive assumption) and so arg(z m+1 ) = (m + 1) arg z , that is, it is then
true when n = m + 1.
z0
If n < 0, say n = −N where N > 0, then z n = (z −1 )N = z N , and so arg(z n ) = N arg z =
N (− arg z) = −N arg z = n arg z . Thus the assertion is true for all integers n.
Note. Both for De Moivre’s Theorem and for its proof it is important that we do not
have to construe an argument θ as being an angle in the range (−π, π]. Even if θ lies in this
range, nθ need not.
De Moivre’s Theorem has many uses. One of them is to express the cosines and sines of
multiples of an angle θ as polynomials in cos θ and sin θ . For example:
cos 3θ + i sin 3θ = (cos θ + i sin θ)3
= cos3 θ + 3i cos2 θ sin θ + 3i2 cos θ sin2 θ + i3 sin3 θ
= (cos3 θ − 3 cos θ sin2 θ) + i(3 cos2 θ sin θ − sin3 θ) ,
and, by equating real and imaginary parts, we find that
cos 3θ = cos3 θ − 3 cos θ sin2 θ = cos3 θ − 3 cos θ (1 − cos2 θ)
and
= 4 cos3 θ − 3 cos θ
6
sin 3θ = 3 cos2 θ sin θ − sin3 θ = 3(1 − sin2 θ) sin θ − sin3 θ
= −4 sin3 θ + 3 sin θ .
Exercise 8.1.
Show that if θ ∈ R then
cos 5θ = 16 cos5 θ − 20 cos3 θ + 5 cos θ;
Exercise 8.2.
sin 5θ = (16 cos4 θ − 12 cos2 θ + 1) sin θ .
Show that if m is a positive integer then (1 + i)4m = (−1)m 22m .
9. Roots of unity
If z ∈ C, n ∈ N and z n = 1 then z is known as a root of unity—an nth root of unity, to be
precise. For such numbers, since |z|n = |z n | = 1 and |z| > 0, we have |z| = 1. Thus complex
roots of unity lie in S 1 .
Theorem C6. Let z ∈ C. Then z is an nth root of unity if and only if |z| = 1 and there
exists an integer k such that arg z = 2k π/n.
Proof. Suppose first that |z| = 1 and arg z = 2k π/n for some integer k . Then |z n | = 1
and, by De Moivre’s Theorem, arg(z n ) = n × 2k π/n = 2k π . Thus z n has the same modulus
and the same argument as 1, and therefore z n = 1.
Conversely, if z n = 1 then, as we have already seen, |z| = 1, and moreover, if α := arg z
then by De Moivre’s Theorem again, nα = arg 1 = 2k π for some integer k , and therefore
arg z = 2k π/n.
Corollary: For any positive integer n the number of nth roots of 1 is n.
For, by Theorem C5, the nth roots of 1 are the complex numbers cos(2k π/n)+i sin(2k π/n)
for 0 6 k 6 n − 1, and there are precisely n of these.
Examples: The square roots of 1 in C are just its real square roots 1 and −1;
the fourth roots of 1 in C are 1, i, −1, −i.
Note. If z is an nth root of unity and z m 6= 1 for each natural number m in the range
0 < m < n then z is said to be a primitive nth root of unity.
Theorem C7. Let z ∈ C and let n be an integer > 2. If z is a primitive nth root of unity
then z n−1 + z n−2 + · · · + z 2 + z + 1 = 0.
For, let z be a primitive nth root of unity. Since
0 = z n − 1 = (z − 1)(z n−1 + · · · + z + 1)
and z 6= 1 (since n > 1), it must be the case that z n−1 + · · · + z + 1 = 0.
Example. Let ω be a primitive cube root of 1. Then ω 2 + ω + 1 = 0.√ Solving this
√
quadratic equation in the usual way we find that ω = 21 (−1 ± −3) = 12 (−1 ± i 3).
I find it comforting that this is consistent with the facts that
ω = cos(2π/3) + i sin(2π/3)
or
ω = cos(4π/3) + i sin(4π/3),
7
and
1
cos(2π/3) = cos(4π/3) = −√
2
sin(2π/3) = − sin(4π/3) = 21 3.
Exercise 9.1. Write down the primitive 6th roots of unity and the primitive 8th roots of
unity in standard form a + bi.
Exercise 9.2.
Is 3 + 4i a root of unity? Is
3
5
+ 45 i a root of unity?
Exercise 9.3. Let φ := cos(2π/5) + i sin(2π/5), a primitive 5th root of 1.
α := φ + φ4 , β := φ2 + φ3 .
Define
(i) Show that α and β are real numbers.
(ii) Show that α + β = −1 and αβ = −1 (so that α, β are the roots of x2 + x − 1 = 0).
√
(iii) Deduce that cos(2π/5) = 14 ( 5 − 1).
Exercise 9.4. Let p be a prime number let ω be a primitive pth root of unity and let m
be a positive integer. Show that
p−1
X
ω
mk
=
k=0
0 if p does not divide m,
p if p divides m.
10. A formula discovered by Euler
We observed in §7 above that Theorem C4 seems to say that the argument behaves somewhat like a logarithm. A famous formula published by Leonhard Euler in 1748,
eiθ = cos θ + i sin θ ,
explains, or at least is consistent with, the logarithmic nature of the argument. In this notation
Theorem C4 says that
eiα eiβ = ei(α+β) ,
just as we would expect of the exponential function.
The exponential function may be introduced in many different ways. An important approach
is to say that ex is that function f (x) which is its own derivative and for which f (0) = 1.
That such a function exists is not obvious, however. An alternative is to use a power series,
defining ex to be the infinite sum (once a sense can be attached to the idea of adding infinitely
many numbers together)
x
x2
xn
1+ +
+ ··· +
+ ··· .
1!
2!
n!
Substituting x = iθ and separating the series into real and imaginary parts we find two power
series, recognisable as those for cos θ and for sin θ respectively, so that eiθ = cos θ + i sin θ .
Some of you may have seen this derivation before you arrived at Oxford. It begs rather a lot of
questions, though. In what sense can we add infinitely many numbers together? Is it sensible
to take a formula that is valid for real numbers and expect it to hold for complex numbers?
Certainly that is not always a good idea—for example, x2 6= −1 for all real numbers x, and
yet by definition i2 = −1 holds in C. One of the great successes of 19th century mathematics
was the development of answers to awkward questions and criticisms such as these. By the
end of the Prelim Analysis courses you will have acquired an understanding of what might, if
8
mathematicians had been unlucky, have gone wrong with the reasoning above and how we can
be sure that it leads in fact to perfectly valid results.
As a matter of fact, the reasoning sketched above is a little different from the line that Euler
took to discover his formula. In some ways he was even more inventive. Starting from the
equation (sin z)2 + (cos z)2 = 1, he took what was at the time the amazing step of introducing
√
√
√
−1 and factorising it as (cos z + −1 sin z)(cos z − −1 sin z) = 1. This historical matter
is of great interest to some of us but is not relevant to the Prelim course, so although I’ll say
a few more words about it I’ll put them into an appendix.
Let’s take Euler’s formula for granted. Then for a complex number z we have
z = r eiθ
where r := |z| and θ := arg z .
Although this formula expresses nothing more than the polar form of z , it does so in a notation
that is particularly suggestive and particularly easy to use in calculations. Here is one:
Theorem C8. Let z ∈ C and let n be a positive integer. Then z has nth roots in C. In
fact, if z 6= 0 then there are precisely n distinct complex numbers w such that wn = z .
The reason is this. If z = 0 the assertion is trivial, so now suppose that z 6= 0. Write
z = r eiα where r := |z| and α := arg z . Then r > 0 so we know (do we?—yes, methods for
a proof occur quite early in Analysis I) that there is a unique positive real number s (written
√
n
r ) such that sn = r . Now let β := α/n and let w := seiβ . Then wn = sn (eiβ )n = sn ei nβ =
r eiα = z .
This proves the existence of one nth root of z . If φ is any one of the nth roots of 1, of which
we know (from §9) that there are precisely n, then (φw)n = φn wn = wn = z : therefore z has
at least n distinct nth roots. On the other hand, if un = z then (u/w)n = un /wn = z/z = 1,
so u/w is an nth root of unity and therefore u = φw , where φ is one of the nth roots of 1:
therefore z has precisely n distinct nth roots.
Exercise 10.1.
For n := 2, 3, 4, 5 locate the nth roots of 16i on an Argand diagram.
11. Polynomial equations
We have known since early in our acquaintance with algebra that if ax2 + bx + c = 0 and
a 6= 0 then
√
−b ± D
x=
where D := b2 − 4ac.
2a
When we were young, and when mathematics was several centuries more primitive (or innocent?) than it is now, the number D (known as the discriminant of the equation) was expected
to be positive. Once a square root i of −1 had entered the scene every quadratic equation with
real coefficients turned out to have two roots in C. Well, not quite. Only one root if D = 0,
but in that case the equation may be rewritten as a(x + b/a)2 = 0 and its single root −b/a
may be deemed to be a root of multiplicity 2.
Those were real quadratic equations. We now have more numbers available to serve as
coefficients. The algebraic manipulations involved in completing the square to find the classic
formula for the roots of a quadratic equation work just as well in C as they do in R. Moreover,
by Theorem C7 every non-zero complex number D has precisely 2 distinct square roots. Thus
we have
Theorem C9. Every quadratic equation with complex coefficients has either 2 distinct roots
in C or a single root with multiplicity 2
9
Exercise 11.1.
Find the square roots of −7 + 24i.
Now solve the equation z 2 − (2 + 2i)z + (7 − 22i) = 0.
By Theorem C7, if c 6= 0 then the polynomial equation xn = c has precisely n roots in C.
And if c = 0 then we think of the equation as having 0 as a root of multiplicity n. In this
sense the equation always has n roots, counting multiplicities. Consider the problem of solving
the polynomial equation
a0 xn + a1 xn−1 + · · · + an−2 x2 + an−1 x + an = 0 .
It is assumed that a0 6= 0 since otherwise the equation would be of lower degree. It is also now
assumed that ai ∈ C for 0 6 i 6 n. Dividing through by a0 does not alter the roots and so
we treat equations f (x) = 0 where f (x) is a so-called monic polynomial,
f (x) = xn + c1 xn−1 + · · · + cn−2 x2 + cn−1 x + cn
with complex coefficients. The Factor Theorem (an elementary application of the Division
Algorithm for polynomials with complex coefficients) states that if u ∈ C and f (u) = 0 then
there is a polynomial g(x) such that f (x) = (x − u)g(x). Applying the rules for multiplying
polynomials we see that g(x) is monic and of degree n − 1. It follows easily that if f (x) = 0
has distinct roots u1 , u2 , . . . , um in C then there exists a monic polynomial gm (x) of degree
n − m that has complex coefficients and for which
f (x) = (x − u1 )(x − u2 ) · · · (x − um )gm (x) .
Consequently the polynomial equation f (x) = 0 of degree n has at most n roots in C.
So far so good. Nothing deeper than (advanced) High School mathematics here. This fact,
that a polynomial equation of degree n can have at most n roots, was seen, if through a glass
darkly, by Thomas Harriot in about 1605, by Girard (1629), by Descartes (1637), by Newton
(probably about 1667 but not published until 1707), and by many other mathematicians in
clearer and clearer form as the years went by.
The belief that a polynomial equation of degree n does have n complex roots if multiplicities
are taken into account came fifty years later. It gradually grew from about the middle of the
17th century. Its modern form could not properly be formulated until complex numbers were
introduced in the mid-18th century and became reasonably widely accepted early in the 19th
century. Nevertheless, it was understood quite a long time before that as the assertion that
any polynomial with real coefficients may be expressed as a product of linear and
quadratic factors.
In this form there were many attempted proofs, none of them sufficiently convincing. The following result, however, with a remarkable number of reasonably convincing proofs—C. F. Gauss,
for example, published his first in 1799 and his fourth in 1849, fifty years later)—is now a little
over 200 years old:
Theorem C10. When multiplicities are taken into account every complex polynomial of
degree n has exactly n roots. That is, if f (x) is a monic polynomial xn +c1 xn−1 +· · ·+cn−1 x+cn
with coefficients in C then there exist u1 , u2 , . . . , un ∈ C (not necessarily distinct) such that
f (x) = (x − u1 )(x − u2 ) · · · (x − un ).
Although this is called The Fundamental Theorem of Algebra its natural placement within mathematics lies in Analysis or Topology. Its proofs require ideas that lie a
10
long way beyond school mathematics, and somewhat beyond first-year university courses. A
very attractive proof will emerge from theorems proved in the Oxford second-year Complex
Analysis course. You may come across other proofs in courses on Topology. For your first year
at Oxford, however, we want you to take it on trust. That is, we want you to appreciate what
it says and believe it, but use it only when you cannot make progress without it.
Theorem C11. Let f (x) be a polynomial with real coefficients. If u ∈ C and f (u) = 0
then also f (u) = 0.
The reason is this. Write f (x) = a0 xn + a1 xn−1 + · · · + an−2 x2 + an−1 x + an where am ∈ R
for 0 6 m 6 n. Using Theorem C1 and induction on the number of summands we find that
f (u) = a0 u n + a1 u n−1 + · · · + an−2 u 2 + an−1 u + an ,
By assumption, however, am = am for 0 6 m 6 n and so f (u) = f (u). Thus f (u) = 0 if and
only if f (u) = 0.
What Theorem C10 means is that the non-real roots of a polynomial with real coefficients
occur in complex-conjugate pairs. Now if u ∈ C and u ∈
/ R, then (x − u)(x − u) is the
real quadratic polynomial x2 − 2Re(u)x + |u|2 . It follows from the Fundamental Theorem of
Algebra that any polynomial with real coefficients has a factorisation as a product of linear and
quadratic factors with real coefficients. As was mentioned above, this fact was appreciated (if
not proved) in ever clearer form from about the middle of the 17th century. By some time in
the 18th century (I do not know quite when) it was also realised that in fact Theorem C10 is
actually equivalent to Theorem C9 even although the latter appears at first sight to be much
stronger.
Exercise 11.2. Let (A) be the assertion that any polynomial with real coefficients may
be expressed as a product of linear and quadratic factors. Let (B) be the assertion that any
polynomial with complex coefficients may be expressed as a product of linear factors. Write
down straightforward proofs that if (A) is true then (B) must also be true, and that if (B) is
true then (A) must also be true.
Exercise 11.3. Show that σ 8 = −1 if and only if σ is a primitive 16th root of 1. Find the
roots of the equation x8 = −1, and use them to write x8 + 1 as the product of four quadratic
factors with real coefficients.
Acknowledgements: I am grateful to Professor Francis Kirwan and her predecessors as
lecturers on Complex Numbers for many ideas, for the majority of the questions in the exercise
sheet, and for the idea that some historical comments might be included. I thank my wife
Sylvia for her critical reading and careful proofreading of early drafts of these notes and of the
accompanying exercise material.
11
Appendix: Notes on Euler’s formula (1748)
Note 1. Leonhard Euler (b. 1707 in Basel, Switzerland; d. 1783 in St Petersburg, Russia)
was a wonderfully inventive and prolific mathematician, and a great teacher. There is a commonly held belief that the equation ei π = −1 is something that Euler was proud of—as he well
might have been. The general formula ei θ = cos θ + i sin θ was a discovery that Euler published
in 1748 but he never focussed on the special case θ = π .
z i
Euler’s argument starts from the description of ez as 1 +
where i is an
i
n
z
infinite magnitude—in present times this is written ez = lim (1 +
. From this he moved
n→∞
n
z
neatly to power series for e , cos z and sin z , but his argument is nevertheless not quite what
I have sketched in Section 10 above.
Note 2.
These images come from the bottom of p. 103 and the top of p. 104 of Euler’s Introductio in
analysin infinitorum (Lausanne, 1748). Although the language is√Latin and the typography is
√
different
from modern usage, at the end of §138 the formulae e+v −1 = cos v + −1. sin v and
√
√
e−v −1 = cos v − −1. sin v can easily be recognised.
ΠMN: Queen’s: 22.viii.2016
12