Download Handout on Complex Numbers and Roots of Polynomials

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bernoulli number wikipedia , lookup

System of polynomial equations wikipedia , lookup

Real number wikipedia , lookup

Transcript
Complex numbers and roots of polynomials
This handout covers some background material about complex numbers
and roots of polynomials that will be needed for solving linear DE’s.
Recall that a polynomial of degree n is an expression of the form
P (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 ,
(1)
where an , an−1 , . . . a1 , a0 are constants, and an 6= 0. A root of (1) is a
number r such that
an rn + an−1 rn−1 + · · · + a1 r + a0 = 0.
(2)
The multiplicity of a root r is the largest integer k such that (1) can be
expressed in the form P (x) = (x − r)k Q(x), where Q(x) is another polynomial. Note that in this case Q(x) must have degree n − k.
For example, consider the polynomial
x5 − 2x4 + x3 = (x − 1)2 x3 = (x − 1)2 (x − 0)3 .
This polynomial has roots r1 = 1 of multiplicity k1 = 2 and r2 = 0 of
multiplicity k2 = 3. Note that k1 + k2 = 5, that is, the multiplicities of
the roots add up to the degree of the polynomial. This is true in the more
general case when a polynomial can be written as a product of linear factors
P (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 = (x − r1 )k1 (x − r2 )k2 . . . (x − r` )k` ,
then k1 + k2 + · · · + k` = n, that is, the multiplicities of the roots add
up to the degree of the polynomial. A more informal way of expressing this
is to say that “P (x) has exactly n roots, counting multiplicities.”
Unfortunately, not every polynomial can be written as a product of linear
factors. Recall that a quadratic polynomial
ax2 + bx + c
(3)
has roots
√
√
b
b2 − 4ac
b
b2 − 4ac
r1 = − +
r2 = − −
.
(4)
2a
2a
2a
2a
The expression ∆ = b2 − 4ac is called the discriminant of (3). If ∆ > 0,
then (3) has two distinct real roots. If ∆ = 0, then (3) has one real root
1
b
r1 = r2 = − 2a
of multiplicity 2. In each of these two cases, the multiplicities
of the roots add up to the degree of the polynomial. If ∆ < 0, then (3) is
indecomposable and has no real roots.
Could we somehow define numbers r1 , r2 that could be considered distinct roots of (3) in this third case? Consider the simplest example of an
indecomposable polynomial P1 (x) = x2 + 1. It has no real√roots. But let
us imagine a number i such that i2 = −1, or, equivalently, −1 = i. Then
i2 + 1 = 0, and also (−i)2 + 1 = (−1)2 i2 + 1 = 0. Thus i and −i would be
distinct roots of P1 (x), of multiplicity 1 each. Again, the multiplicities of
the roots of P1 (x) would add up to its degree. Moreover, if such i exists,
then we could write the roots of any polynomial (3) with ∆ < 0 as
p
√
(−1)(4ac − b2 )
b
b
4ac − b2
=− +i
r1 = − +
2a p
2a
2a
√ 2a
2
(−1)(4ac − b )
b
b
4ac − b2
r2 = − −
=− −i
.
2a
2a
2a
2a
Now let a, b be arbitrary real numbers. An expression of the form
z = a + bi,
(5)
(6)
will be called a complex number. The number a is called the real part of
z and denoted by Re(z); the number b is called the imaginary part of z and
denoted by Im(z). Note that both Re(z) and Im(z) are real numbers. The
conjugate of a complex number z = a + bi is the expression z̄ = a − bi. Note
that the roots r√
1 , r2 in (5) are conjugate complex numbers, with Re(r1 ) =
2
b
− 2a , Im(r1 ) = 4ac−b
, and r2 = r̄1 .
2a
Real numbers are considered a special case of complex numbers, more
precisely, they are exactly the complex numbers z with imaginary part
Im(z) = 0, that is, they are the complex numbers of the form a + 0i.
Note that a complex number is real if and only if z̄ = z.
In order to consider complex numbers as bona fide numbers, we need to
define arithmetic operations on them. Let z1 = a + bi and z2 = c + di. Then
z1 + z2 = (a + c) + (b + d)i
z1 z2 = (ac − bd) + (bc + ad)i.
(7)
In order to see why multiplication is defined in this way, note that
(a+bi)(c+di) = ac+bci+adi+bdi2 = ac+bci+adi+bd(−1) = (ac−bd)+(bc+ad)i.
2
With these arithmetic operations, the roots r1 , r2 of any indecomposable
quadratic polynomial ax2 +bx+c can be expressed as two conjugate complex
numbers r1 = z1 and r2 = z̄1 (see formula (5)). Since every polynomial with
real coefficients can be factored as a product of linear and indecomposable
quadratic polynomials, we obtain the following:
Theorem 1 Suppose P (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 is a polynomial of degree n with real coefficients. Then P (x) has exactly n complex
or real roots, counting multiplicities. Moreover, for each root z of P (x) of
multiplicity k that is not a real number, its conjugate z̄ is also a root of P (x)
with the same multiplicity k.
A few comments are in order. Since real numbers are also complex
numbers, we did not really need to write “complex or real” in the statement
of the theorem; we did so for better readability. And since each real number
is a complex number z with z̄ = z, the last sentence of the theorem is,
strictly speaking, also true when z is a real root of P (x). However, the
important point of the last sentence is that roots that are not real occur in
conjugate ate pairs of the same multiplicity.
If we allow the coefficients of the polynomial to be arbitrary complex
numbers, it still remains true that P (x) has exactly n complex roots, but
these are no longer required to appear in conjugate pairs. This more general
form of Theorem 1 is known as the Fundamental Theorem of Algebra.
How can we picture complex numbers geometrically? Real numbers are
usually represented as points on the real line, and they fill up this line.
There is no room for additional numbers such as i. The solution is to depict
complex numbers as vectors in a two-dimensional plane. The horizontal axis
will be labelled Re and called the real axis. The vertical axis wil be labelled
Im and called the imaginary axis. A complex number z = a + bi will be
represented as a vector with Cartesian coordinates (a, b). In particular, all
real numbers are of the form (a, 0) and will be depicted on the real axis in
the usual way. The number i = 0 + 1i has coordinates (0, 1). In general, the
vertical axis contains all purely imaginary numbers, that is, numbers of the
form 0 + bi.
In this geometric interpretation, addition of complex numbers is simply
vector addition. In order to see how multiplication of complex numbers
works in the complex plane, we use√representation of the vector (a, b) by
polar coordinates (r, Θ), where r = a2 + b2 and a = r cos Θ, b = r sin Θ.
This allows us to represent a complex number as
z = a + bi = r(cos Θ + i sin Θ).
3
(8)
In this representation, r is called the absolute value or modulus of z and
often denoted by |z| and Θ is called the angle or argument of z and often
denoted by arg(z). Now notice that according to (7) we obtain the product
of z1 = r1 (cos Θ1 + i sin Θ1 ) and z2 = r2 (cos Θ2 + i sin Θ2 ) as
z1 z2 = r1 (cos Θ1 + i sin Θ1 )r2 (cos Θ2 + i sin Θ2 )
= r1 r2 (cos Θ1 cos Θ2 − sin Θ1 sin Θ2 + i(cos Θ1 sin Θ2 + cos Θ2 sin Θ1 ))
= r1 r2 (cos(Θ1 + Θ2 ) + i sin(Θ1 + Θ2 )).
(9)
Thus multiplication in the complex plane, works by multiplying absolute values and adding angles. Note that for real numbers these operations
coincide with our familiar concepts: For positive real numbers, the angles
are zero and the absolute values are the numbers themselves. Negative
real numbers have angles of π; if we multiply two of them, we obtain a
number with angle 2π, which looks the same as an angle of zero. That is,
we obtain a positive real number. Moreover, note that i has an absolute
value of one and an angle of π2 ; in other words i = cos π2 + i sin π2 . Thus
i2 = cos π + i sin π = −1 + i0 = −1, exactly as we “imagined” at the start
of our construction of complex numbers.
Finally, we will need a way to calculate ez for complex numbers. For
that, define exponention of purely imaginary numbers as follows:
eiΘ = cos Θ + i sin Θ.
(10)
Equation 10 is called Euler’s formula. Now we can define exponentiation
of arbitrary complex numbers as
ea+bi = ea (cos b + i sin b).
(11)
Note that by (9) exponentiation of purely imaginary numbers satisfies
the property
eiΘ1 eiΘ2 = ei(Θ1 +Θ2 ) .
(12)
This, together with (11), implies for arbitrary real number x:
e(a+bi)x = eax (cos bx + i sin bx).
4
(13)