Download 3 Basics of Polynomial Theory

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Addition wikipedia , lookup

Mathematics of radio engineering wikipedia , lookup

List of important publications in mathematics wikipedia , lookup

Recurrence relation wikipedia , lookup

Horner's method wikipedia , lookup

Algebra wikipedia , lookup

Vincent's theorem wikipedia , lookup

Polynomial wikipedia , lookup

Fundamental theorem of algebra wikipedia , lookup

Factorization of polynomials over finite fields wikipedia , lookup

System of polynomial equations wikipedia , lookup

Transcript
3
Basics of Polynomial Theory
3.1 Polynomial Equations
In geodesy and geoinformatics, most observations are related to unknowns parameters through equations of algebraic (polynomial) type.
In cases where the observations are not of polynomial type, as exemplified by the GPS meteorology problem of Chap. 13, they are converted
via Theorem 3.1 on p. 20 into polynomials. The unknown parameters
are then be obtained by solving the resulting polynomial equations.
Such solutions are only possible through application of operations addition and multiplication on polynomials which form elements of polynomial rings. This chapter discusses polynomials and the properties
that characterize them. Starting from the definitions of monomials,
basic polynomial aspects that are relevant for daily operations are presented. A monomial is defined as
Definition 3.1 (Monomial). A monomial is a multivariate product
of the form xα1 1 xα2 2 . . .xαnn , (α1 , . . ., αn ) ∈ Zn+ in the variables x1 , . . ., xn .
In Definition 3.1 above, the set Zn+ comprises positive elements of the
set of integers (2.2) that we saw in Chap. 2, p. 8.
Example 3.1 (Monomial). Consider the system of equations for solving
distances in the three-dimensional resection problem given as (see e.g.,
(11.44) on p. 180)
 2
x1 + 2a12 x1 x2 + x22 + ao = 0
 x22 + 2b23 x2 x3 + x23 + bo = 0

 x23 + 2c31 x3 x1 + x21 + co = 0
where x1 ∈ R+ , x2 ∈ R+ , x3 ∈ R+ .
18
3 Basics of Polynomial Theory
The variables {x1 , x2 , x3 } are unknowns ©while the other terms areª
known constants. The products of variables x21 , x1 x2 , x22 , x2 x3 , x23 , x3 x1
are monomials in {x1 , x2 , x3 }.
Summation of monomials form polynomials defined as
Definition 3.2 (Polynomial). A polynomial f ∈ k [x1 , . . . , xn ] in
variables x1 , . . . , xn with coefficients in the field k is a finite linear
combination of monomials with pairwise different terms expressed as
f=
X
α
aα xα , aα ∈ k, xα = (xα1 , . . ., xαn ), α = (α1 , . . ., αn ), (3.1)
where aα are coefficients in the field k, e.g., R or C and xα the monomials.
Example 3.2 (Polynomials). Equations
 2
x1 + 2a12 x1 x2 + x22 + ao = 0
 x22 + 2b23 x2 x3 + x23 + bo = 0
x23 + 2c31 x3 x1 + x21 + co = 0,
in Example 3.1 are multivariate polynomials. The first expression is a
multivariate polynomial
{x1 , x2 } and a linear combiª
© in two variables
nation of monomials x21 , x1 x2 , x22 . The second expression is a multivariate polynomial© in two variables
{x2 , x3 } and a linear combination
ª
of the monomials x22 , x2 x3 , x23 , while the third expression is a multivariate polynomial© in two variables
{x3 , x1 } and a linear combination
ª
2
2
of the monomials x3 , x3 x1 , x1 .
In Example 3.2, the coefficients of the polynomials are elements of
the set Z. In general, the coefficients can take on any sets Q, R, C of
number rings or other rings such as modular arithmetic rings. These
coefficients can be added, subtracted, multiplied or divided, and as such
play a key role in determining the solutions of polynomial equations.
The definition of the set to which the coefficients belong determines
whether a polynomial equation is solvable or not. Consider the following
example:
Example 3.3. Given an equation 9w2 − 1 = 0 with the coefficients in
the integral domain, obtain the integer solutions. Since the coefficient
9 ∈ Z, the equation does not have a solution. If instead the coefficient
1
9 ∈ Q, then the solution w = ± exist.
3
3.2 Polynomial Rings
19
From Definition 2.1 of algebraic, polynomials become algebraic once
(3.1) is equated to 0. The fundamental problem of algebra can thus be
stated as the solution of equations of form (3.1) equated to 0.
3.2 Polynomial Rings
In Sect. 2.3 of Chap. 2, the theory of rings was introduced with respect
to numbers. Apart from the number rings, polynomials are objects
that also satisfy ring axioms leading to “polynomial rings” upon which
operations “addition” and “multiplication” are implemented.
3.2.1 Polynomial Objects as Rings
Polynomial rings are defined as
Definition 3.3 (Polynomial ring). Consider a ring R say of real
numbers R. Given a variable x ∈
/ R, a univariate polynomial f (x) is
formed (see Definition 3.2 on p. 18) by assigning coefficients ai ∈ R
to the variable and obtaining summation over finite number of distinct
integers. Thus
X
f (x) =
cα xα , cα ∈ R, α ≥ 0
α
is said to be a univariatePpolynomial over R P
. If two polynomials are
dj xj , then two binary
ci xi and f2 (x) =
given such that f1 (x) =
j
i
operations “addition” and “multiplication” can be defined on these polynomials such that:
P
(a) Addition: f1 (x) + f2 (x) = ek xk , ek = ck + dk , ek ∈ R
k P
P
ci dj , gk ∈ R .
(b) Multiplication: f1 (x).f2 (x) = gk xk , gk =
k
i+j=k
A collection of polynomials with these “additive” and “multiplicative”
rules form a commutative ring with zero element and identity 1. A univariate polynomial f (x) obtained by assigning elements ci belonging to
the ring R to the variable x is called a polynomial ring and is expressed
as f (x) = R[x]. In general the entire collection of all polynomials in
x1 , . . . , xn , with coefficients in the field k that satisfy the definition of
a ring above are called a polynomial rings.
Designated P, polynomial rings are represented by n unknown variables
xi over k expressed as P := k [x1 , . . ., xn ] . Its elements are polynomials
20
3 Basics of Polynomial Theory
known as univariate when n = 1 and multivariate otherwise. The distinction between a polynomial ring and a polynomial is that the latter
is the sum of a finite set of monomials (see e.g., Definition 3.1 on p. 17)
and is an element of the former.
Example 3.4. Equations
 2
x1 + 2a12 x1 x2 + x22 + ao = 0
 x22 + 2b23 x2 x3 + x23 + bo = 0
x23 + 2c31 x3 x1 + x21 + co = 0
of Example 3.1 are said to be polynomials in three variables [x1 , x2 , x3 ]
forming elements of the polynomial ring P over the field of real numbers
R expressed as P := R [x1 , x2 , x3 ].
Polynomials that we use in solving unknown parameters in various problems, as we shall see later, form elements of polynomial rings.
Polynomial rings provide means and tools upon which to manipulate
the polynomial equations. They can either be added, subtracted, multiplied or divided. These operations on polynomial rings form the basis
of solving systems of equations algebraically as will be made clear in the
chapters ahead. Next, we state the theorem that enables the solution
of nonlinear systems of equations in geodesy and geoinformatics.
Theorem 3.1. Given n algebraic (polynomial) observational equations,
where n is the dimension of the observation space Y of order l in m unknown variables , and m is the dimension of the parameter space X, the
application of least squares solution (LESS) to the algebraic observation
equations gives (2l − 1) as the order of the set of nonlinear algebraic
normal equations. There exists m normal equations of the polynomial
order (2l − 1) to be solved.
Proof. Given nonlinear algebraic equations fi ∈ k{ξ1 , . . . , ξm } expressed as

f1 ∈ k{ξ1 , . . . , ξm }
 f2 ∈ k{ξ1 , . . . , ξm }


.

(3.2)

.


.
fn ∈ k{ξ1 , . . . , ξm },
with the order considered as l, we write the objective function to be
minimized as
3.2 Polynomial Rings
kf k2 = f12 + . . . . + fn2 | ∀fi ∈ k{ξ1 , . . . , ξm },
21
(3.3)
and obtain the partial derivatives (first derivatives of 3.3) with respect
to the unknown variables {ξ1 , . . . , ξm }. The order of (3.3) which is
l2 then reduces to (2l − 1) upon differentiating the objective function
with respect to the variables ξ1 , . . . , ξm . Thus resulting in m normal
equations of the polynomial order (2l − 1).
u
t
Example 3.5 (Pseudo-ranging problem). For pseudo-ranging or distance
equations, the order of the polynomials in the algebraic observational
equations is l = 2. If we take the “pseudo-ranges squared” or “distances
squared”, a necessary procedure in-order to make the observation equations “algebraic” or “polynomial”, and implement least squares solution
(LESS), the objective function which is of order l = 4 reduces by one
to order l = 3 upon differentiating once. The normal equations are of
order l = 3 as expected.
The significance of Theorem 3.1 is that all observational equations of
interest are successfully converted to “algebraic” or “polynomial” equations. This implies that problems requiring exact algebraic solutions
must first have their equations converted into algebraic. This will be
made clear in Chap. 13 where trigonometric nonlinear system on equations are first converted into algebraic.
3.2.2 Operations “Addition” and “Multiplication”
Definition 3.3 implies that a polynomial ring qualifies as a ring based
on the applications of operations “addition” and “multiplication” on
its coefficients. In this case, the axioms that follow the Abelian group
with respect to “addition” and the semi group with respect to “multiplication” readily follow. Of importance in manipulating polynomial
rings using operations “addition” and “multiplication” is the concept
of division of polynomials defined as
Definition 3.4 (Polynomial division). Consider the polynomial ring
k[x] whose elements are polynomials f (x) and g(x). There exists unique
polynomials p(x) and r(x) also elements of polynomial ring k[x] such
that
f (x) = g(x)p(x) + r(x),
with either r(x) = 0 or degree of r(x) is less than the degree of g(x).
22
3 Basics of Polynomial Theory
For univariate polynomials, as in Definition 3.4, the Euclidean algorithm employs operations “addition” and “multiplication” to factor
polynomials in-order to reduce them to satisfy the definition of division
algorithm.
3.3 Factoring Polynomials
In-order to understand the factorization of polynomials, it is essential
to revisit some of the properties of prime numbers of integers. This is
due to the fact that polynomials behave much like integers. Whereas
for integers, any integer n > 1 is either prime (i.e., can only be factored by 1 and n itself) or a product of prime numbers, a polynomial
f (x) ∈ k[x] is either irreducible in k[x] or factors as a product of irreducible polynomials in the field k[x]. The polynomial f (x) has to
be of positive degree. Factorization of polynomials play an important
role as it enables solution of polynomial roots as will be seen in the
next section. Indeed, the Groebner basis algorithm presented in Chap.
4 makes use of the factorization of polynomials. In general, computer
algebra systems discussed in Chap. 16 offers possibilities of factoring
polynomials.
3.4 Polynomial Roots
More often than not, the most encountered interaction with polynomials is perhaps the solution of its roots. Finding the roots of polynomials
is essential for most computations that we undertake in practice. As an
example, consider a simple planar ranging case where distances have
been measured from two known stations to an unknown station (see
e.g, Fig. 4.1 on p. 30). In such a case, the measured distances are normally related to the coordinates of the unknown station by multivariate
polynomial equations. If for instance a station P1 , whose coordinates
are {x1 , y1 } is occupied, the distance s1 can be measured to an unknown station P0 . The coordinates {x0 , y0 } of this unknown station
are desired and have to be determined from distance measurements.
The relationship between the measured distance and the coordinates is
given by
p
s1 = (x1 − x0 )2 + (y1 − y0 )2 .
(3.4)
Applying Theorem 3.1, a necessary step to convert (3.4) into polynomial, (3.4) is squared to give a multivariate quadratic polynomial
3.4 Polynomial Roots
s21 = (x1 − x0 )2 + (y1 − y0 )2 .
23
(3.5)
Equation (3.5) has two unknowns thus necessitating a second distance
measurement to be taken. Measuring this second distance s2 from station P2 , whose coordinates {x2 , y2 } are known, to the unknown station
P0 leads to a second multivariate quadratic polynomial equation
s22 = (x2 − x0 )2 + (y2 − y0 )2 .
(3.6)
The intersection of the two equations (3.5) and (3.6) results in two
quadratic equations ax20 + bx0 + c = 0 and dy02 + ey0 + f = 0 whose
roots give the desired coordinates x0 , y0 of the unknown station P0 . In
Sect. 4.1, we will expound further on the derivation of these multivariate
quadratic polynomial equations.
In Sect. 3.6, we will discuss the types of polynomials with real coefficients. Suffice to mention at this point that polynomials, as defined in
Definition 3.2 with the coefficients in the field k, has a solution ξ such
that on replacing the variable xα , one obtains
an ξ n + an−1 ξ n−1 + ... + a1 ξ + a0 = 0.
(3.7)
From high school algebra, we learnt that if ξ is a solution of a
polynomial f (x), also called the root of f (x), then (x − ξ) divides
the polynomial f (x). This fact enables the solution of the remaining
roots of the polynomial as we already know. The division of f (x) by
(x − ξ) obeys the division rule discussed in Sect. 3.2.2. In a case where
f (x) = 0 has many solutions (i.e., multiple roots ξ1 , ξ2 , ..., ξm ), then
(x − ξ1 ), (x − ξ2 ), ..., (x − ξm ) all divide f (x) in the field k.
In general, a polynomial of degree n will have n roots that are
either real or complex. If one is operating in the real domain, i.e., the
polynomial coefficients are real, the complex roots normally results in a
pair of conjugate roots. Polynomial coefficients play a significant role in
the determination of the roots. A slight change in the coefficients would
significantly alter the solutions. For ill-conditioned polynomials, such
a change in the coefficients can lead to disastrous results. Methods of
determining polynomial roots have been elaborately presented by [269].
We should point out that for polynomials of degree n in the field of
real numbers R however, the solutions exist only for polynomials up to
degree 4. Above this, Niels Henrick Abel (1802-1829) proved through
his impossibility theorem that the roots are insolvable, while Evariste
Galois (1811-1832) gave a more concrete proof that for every integer n
greater than 4, there can not be a formula for the roots of a general
nt h degree polynomial in terms of coefficients.
http://www.springer.com/978-3-540-23425-8