Download Chapter 10

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of trigonometry wikipedia , lookup

Symmetric group wikipedia , lookup

Group (mathematics) wikipedia , lookup

History of geometry wikipedia , lookup

Line (geometry) wikipedia , lookup

Euclidean geometry wikipedia , lookup

Transcript
10 Postulational Mathematics
Axiomatic Mathematical Systems
An axiomatic system begins with definitions of terms plus a set of axioms. One
constructs valid deductive arguments using the definitions and axioms as premises. If
the axioms are true, then every conclusion derived from them must be true as well.
One statement of the Pythagorean theorem is:
In any triangle, the sum of the squares of the lengths of two adjacent sides is equal to the square
of the length of the third side, if and only if the angle between the adjacent sides will be a right angle.
Egyptians and Babylonians found many triangles the lengths of whose sides agreed
with these two statements. They seemed to have believed that these statements were
true because in every case that they tried, a triangle that satisfied one side of the
biconditional also satisfied the other. They did not know that the statements were
universally and necessarily true. The Pythagoreans proved they were.
The earliest axiomatic mathematics is the 13 books of Euclid's Elements. The
Elements begins with 23 definitions, five "postulates," and five "common notions."
The postulates and common notions were considered to be "self-evident truths" – so
obviously true as to need no proof. If the postulates and common notions are true, then
every conclusion derived from them by valid deductive argument must be true.
P1.
P2.
P3.
P4.
P5.
Euclid's five postulates of plane geometry were:
A straight line may be drawn between any two points.
Any straight line may be extended indefinitely.
A circle may be drawn with any point as centre and any radius.
All right angles are equal.
If two straight lines lying in a plane are crossed by another straight line, and if the
sum of the internal angles on one side is less than two right angles, then the
straight lines will meet if extended sufficiently on the side on which the sum of
the angles is less than two right angles.
A Problem with Euclid's Postulates
L1
L3
Postulates P1 through P4 are easy to understand.
A
They may be "self-evident." However, postulate 5 is
much more complicated than the others. In the L2
B
diagram, the lines labelled L1 and L2 are the original
"two straight lines lying in a plane." Line L3 is
"another straight line" that crosses them. Angles A and B are the "internal angles on
one side" of line L3. The postulate says that if the sum of angles A+B is less than
AXIOMATIC MATHEMATICAL SYSTEMS
141
twice the size of a right angle, then if we extend lines L1 and L2 indefinitely, they'll
meet somewhere off to the right. The definition of "parallel lines" says "Parallel
straight lines are straight lines which, being in the same plane and produced
indefinitely in both directions, do not meet one another in either direction."
Even when the meaning of the postulate is clear, its truth is not self-evident. In the
diagram, L1 and L2 are clearly inclined toward each other. But what if the sum of
angles A+B were only very slightly less than two right angles, so that the lines would
have to be drawn millions of miles long before they met? Would they meet?
For two thousand years, geometers tried to derive the parallel postulate from the
other postulates. They failed, but they did prove that the parallel postulate was
logically equivalent to other statements. Some statements that are equivalent to P5 are:
E1. If a straight line intersects one of two parallel lines, it will intersect the other.
E2. Straight lines parallel to the same straight line are parallel to each other.
E3. Two straight lines that intersect one another cannot be parallel to the same line.
E4. Given a line L and a point P in a plane, where P is not on L, there is one and only
one line through P which is parallel to L.
If we could prove any of these, then (by equivalence) we would have proved postulate P5. They failed to find a proof using direct methods. They tried Indirect Proof.
If one derives a contradiction from the assumption that postulate P5 is false, then one
will have proved that the postulate is true.
Occasionally somebody would claim to have proved a contradiction, but either the
"proof" was invalid or the "contradiction" was not really a contradiction.
If there is no contradiction, we could invent new plane geometries by replacing
Euclid's fifth postulate with its denial. Lobachevskian geometry replaces postulate
E4 with "Given a line L and a point P in a plane, where P is not on L, there is more
than one line through P which is parallel to L." Riemannian geometry uses the statement "Given a line L and a point P in a plane, where P is not on L, there is no line
through P which is parallel to L."
Each of these postulates was proved to be logically independent of the other
postulates. A statement is logically independent of other statements if it can be false
when the other statements are all true. That entails that none of them can be proved by
deductive argument using the other postulates as premises. It was also shown that if
Euclidean geometry is consistent1 then Riemannian geometry and Lobachevskian
geometry are consistent too. By all purely mathematical and logical standards,
Riemannian and Lobachevskian geometries are "just as good" as Euclid's.
1
That is, if all of its postulates can be true on some interpretation, so it must be impossible to derive any
contradiction from them by any valid logical argument. (The meaning of "on some interpretation," is
explained below.)
142
10 POSTULATIONAL MATHEMATICS
A geometry (one of the many different geometries) came to be seen as one
particular set of undefined terms and basic postulates. The postulates specify the
relations between the terms.
Why "undefined terms"? Euclid's definitions of "point," "line," "surface," and so
on are not very clear anyway. What does it mean to define "point" as "that which has
no part," or "line" as "breadthless length"? These definitions require us to go on to
define "part" and "breadth" and "length." The words used in those definitions require
definition also. We're always going to have undefined terms. Why not stop at "point"
and "line," and leave them undefined?
Euclid's definitions were scrapped. The postulates were sanitized to use undefined
terms. For example, Euclid's first postulate says, "A straight line may be drawn
between any two points," which is equivalent to "Given any two points, there is at
least one straight line that contains them." To keep this abstract, we shouldn't talk
about "straight line," because people may interpret that to mean the path of a ray of
light or the streak of ink left by a pen following a ruler. Abstract geometry is not about
the paths of rays of light or streaks of ink. Using an undefined word like prosfo would
be better. Similarly with "point," where we might say clamit instead. The relation
"contains" is tied to the intuitive interpretation of "point" and "line" so we use another
undefined word (e.g., gragga). Now we can restate the postulate as "Given any two
clamits there is at least one prosfo such that both clamits have a gragga relation to that
prosfo." Better yet, we can replace the verbal formulation with a symbolic one, like:
∀x∀y((Cx ∧ Cy) ⊃ ∃z(Dz ∧ (Gxz ∧ Gyz))). The parallels postulate E4 can be
replaced with "Given any prosfo P1 and a clamit C1 that does not have the gragga
relation to P1, there is one and only one prosfo P2 that has the gragga relation to C1
such that, no matter how much we nichkak2 P1 and P2, there will never be a clamit
that has the relation gragga to both P1 and P2."
What does it mean to say that something is a clamit? What does it mean to say
that some x has the property C or that there is a G relation between two things? At this
level of abstraction, it doesn't matter. We can say that the theorems of a geometry are
true of any things, properties, operations and relations that satisfy the postulates. If we
can interpret the postulates so that they state truths about some set of things, then the
theorems will also be true of that set of things on that interpretation. The mathematician can concentrate on deriving theorems without caring about what sort of
things she is dealing with. She can also focus on the properties of the system, in
abstraction from questions about the things the system describes.
David Hilbert created such an abstract axiomatic geometry. In the process, he
showed that some of Euclid's proofs were invalid. Euclid had assumed things that
were not "contained" in the premises (axioms and postulates) he was supposed to be
using.
2
I cannot say "extend" because that is likely to be interpreted.
POSTULATIONAL SYSTEMS
143
One can interpret Hilbert's abstract geometry in terms of Euclidean lines and
points. On that interpretation it is a cleaned-up and corrected version of Euclid's
geometry. One can also interpret it differently (e.g., if we interpreted clamit to mean
line and prosfo to mean point) to give different geometries. Similar abstraction and
interpretation could be done with the non-Euclidean geometries. We can create all
sorts of abstract "geometries" unrelated to anything that had been done before.
Postulational Systems
Pure (or abstract) mathematics is the construction and investigation of
postulational systems.
A postulational system consists of a set or sets of undefined terms, predicates,
relations and operations, and a set of unproved basic statements that establish
relationships between these terms. These basic statements are just assumed as
postulates of the system.
The basic statements don't have to be true or even meaningful (they may contain
terms that have no meaning other than what the postulates assert about them).
Consider symbolic logic as a pure postulational system. Give up the idea that the
A's and B's "stand for" statements. Just accept them as the things we manipulate with
our rules. Connectives and operators would not be defined in terms of truth-values T
and F. Any kind of symbolic constants (e.g., and ) can be used in the tables that
define them. We don't have to be limited to just two possible values – we can
construct alternative "logics" using as many values as we like. We just specify the
formation rules and transformation rules of a system, and that will be a logic.
Formation rules are rules for constructing well-formed formulas (WFFs). We
could distinguish A, B, C, … as simple WFFs, and then describe how to make
compound WFFs out of simple WFFs plus connectives. The value of a compound
WFF would be a function of the values of its atomic components, but we wouldn't
specify that these values are truth-values.
The transformation rules specify allowable transformations on WFFs,
describing what WFF you can get by transforming one or more other WFFs. To state
the transformation rules, we'd create a symbolic shorthand using a special kind of
variable to represent the form of a WFF (like our p, q, r, …) and rules about when a
WFF is a "substitution instance" of a form. Our derivation rules are transformation
rules.3 We would also state some postulates or specify a form whose substitution
instances are postulates (such as ~(p ∧ ~p)). The postulates are just postulated in the
system. We don't commit ourselves to saying that they are "true."
3
Although we'd have some problem stating rules CP and IP in such an abstract logic.
144
10 POSTULATIONAL MATHEMATICS
Here are a couple of pure postulational systems from Douglas Hofstadter's Gödel,
Escher, Bach: An Eternal Golden Braid.4
Hofstadter's MIU System
Hofstadter created the MIU system to illustrate the idea of postulational systems.
The only formation rule of the MIU system is: a WFF is any string containing only
the letters M, I, and U. There are four transformation rules, and all of them are "oneway" rules (like implicational rules). The rules are:
Rule I: Given a WFF whose last letter is I, you can add a U at the end. For example,
from MMI you can get MMIU.
Rule II: Given a WFF of the form Mx,5 you may repeat x to get Mxx. From MIU you
can get MIUIU, from MUM you get MUMUM and from MU you can get MUU, etc.
Rule III: If III occurs in a WFF, you can replace the III with U. So, from UMIIIMU you
get UMUMU and from MIIII you get MIU or MUI. From MIII you can make MU.
Note: these are "one-way" rules, so you cannot use Rule III to get MIII from MU.
Rule IV: If UU occurs in a WFF, you can drop the UU. So from UUU you can get U.
From MUUIII you can get MIII.
There is one postulate (initial premise or axiom) in this system. It is MI.
Hofstadter sets his readers a problem that he calls "the MU-puzzle." Starting from that
one postulate as a given WFF, can you derive MU?
Here's an example of a derivation in the MIU system. It resembles our propositional logic derivations, except that it doesn't mean anything.6 The goal is to derive the
theorem7 MUIIU.
(1)
(2)
(3)
(4)
(5)
(6)
(7)
4
5
6
7
MI
MII
MIIII
MIIIIU
MUIU
MUIUUIU
MUIIU
Axiom
1, Rule II
2, Rule II
3, Rule I
4, Rule III
5, Rule II
6, Rule IV
Douglas R. Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid (A Metaphorical Fugue on Minds and
Machines in the Spirit of Lewis Carroll) (New York: Basic Books, 1979). This is an amazing, wonderful
book. Gödel was a mathematician; Escher was a graphic artist/printmaker/painter; Bach was Johann
Sebastian Bach. The book is about mathematics, graphic art, music, philosophy, minds and brains, meanings,
computers and artificial intelligence, and lots more.
x here stands for any of M, I, or U or any string of them. It works something like our p, q, and r forms. That
is, Mx is not a WFF – it is the form of a WFF. Mx is any WFF whose first letter is M.
A former student asked (about this sentence) "Then what's the point?" The point emerges in the section on
"Interpretation of Formal Systems," below.
Any WFF that can be derived just from the basic axiom(s) or postulate(s) of a system is a theorem of
that system. Remember this definition!!
POSTULATIONAL SYSTEMS
145
Exercise on the MIU System
Play with the MIU system. Derive some theorems. Try to derive MU. If you cannot
derive MU, try to prove that it is impossible. The proof of impossibility cannot be done
within the MIU system; you have to "step outside the system" and do your reasoning in
traditional logic or math.
Hofstadter's pq- System
WFFs in the pq- system contain just three kinds of symbols: p, q, and - (the
hyphen). There are infinitely many axioms. Hofstadter defines an axiom-schema,
which is the form of an axiom. Every WFF of that form is an axiom of the system.
Definition: xp-qx- is an axiom, where x consists only of hyphens. Note that x is
not part of a WFF, but is used to stand for a string of hyphens. The same string of
hyphens (i.e., the same number of hyphens in the string) replaces both xs.
From this definition, we can see that -p-q-- is an axiom (substituting a single
hyphen for x). So are --p-q--- and ---p-q---- and ----p-q-----. Any axiom of the system
is also a theorem of the system.
There is only one transformation rule (Hofstadter calls it a "rule of production"
or "production rule") in this system.
Rule: Suppose x, y, and z stand for particular strings of hyphens. Suppose that xpyqz
is a theorem. Then xpy-qz- is a theorem. If x is '--' and y is '---' and z is '-', then the rule
tells us: If --p---q- is a theorem (is it?), then --p----q-- is a theorem.
What about formation rules? A derivation must start with an axiom or a theorem.
If it begins with a theorem, that theorem will have to have been derived from an axiom
or a theorem. Ultimately we arrive at premises that are axioms only. No axiom
contains more than one or less than one p or q. Our transformation rule does not allow
us to add a p or a q. So every WFF must contain exactly one p and exactly one q. By
similar argument, since the transformation rule never permits derivation of a WFF that
has fewer hyphens than the WFF it's derived from, and the simplest axiom is -p-q--, so
every WFF in a derivation will begin with a hyphen, and the p and q will be separated
by at least one hyphen, and the WFF will end with a hyphen. The axiom schema and
the transformation rule determine the only allowable formulas in derivations. We don't
need explicit formation rules.
Exercise on the pq- System
Play around with the system. Get an interesting axiom using the schema and use
the transformation rule to derive theorems.
Interpretation of Formal Systems
Now we get to the point of all this. MI is an axiom (postulate) of the MIU system.
Is it true? Is it false? The theorems of the system are derived according to the
146
10 POSTULATIONAL MATHEMATICS
transformation rules from the axiom MI. In a sense, then, they are conclusions of valid
arguments using MI as a premise. Are they true or false? M and I and U were never
defined. We don't know what these symbols denote or connote. They are undefined
terms or primitives of the system.
Are the axioms of the pq- system true or false? We don't know what p or q or –
stand for, so we don't know what (if anything) is stated by "---p-q----." The theorems
can be validly (i.e., according to the transformation rules of the system) derived from
the axioms, but are the axioms true or false?
WFFs in these systems don't make statements until we interpret the system.
One system is considered to be an interpretation of another if and only if one can
map one system onto the other in such a way that each part8 of one system
corresponds to one part of the other, where "correspond" means that the two parts
play similar roles in their respective systems. When such a mapping exists, we say
that one system is an interpretation of the other. Mathematicians say that the two
systems are isomorphic, or that there is an isomorphism between them.
Hofstadter proposes that we consider the isomorphism or interpretation given by
the mapping: p ↔ plus; q ↔ equals; - ↔ one; -- ↔ two; --- ↔ three; etc. On this interpretation, ---p-q---- maps to "three plus one equals four." It and all of the axioms and
theorems are true on this interpretation.
Another interpretation is: p ↔ horse; q ↔ happy; - ↔ apple. Then -p-q-- translates
as "apple horse apple happy apple apple." On this interpretation, axioms and theorems
are no more true than non-axioms and non-theorems. Hofstadter says, "A horse might
enjoy 'happy happy happy apple horse' (mapped onto qqq-p) just as much as any
interpreted theorem."
A problem with interpretation is that one might "read too much into" the original
system, based on knowledge of the system to which it is mapped. For example, on the
"plus, equals, one, two, …" interpretation, someone might think that --p--p--q-----should be a theorem because "two plus two plus two equals six" is true. That would be
a mistake. --p--p--q------ is not a theorem. It's not even a WFF in the system.
Is "plus, equals, one, two, …" the interpretation of the pq- system? Hofstadter
gives the mapping: p ↔ equals; q ↔ taken from; - ↔ one; -- ↔ two; etc. On this
interpretation, --p---q----- means "two equals three taken from five." This is another
interpretation. Again, all the axioms and every theorem are true on this interpretation.
Which is the real meaning of the string? Is there even any sense in asking about the
real meaning?
8
"Part" includes the objects that are components of the two systems as well as the relations between and
operations on those objects.
PROPERTIES OF A POSTULATIONAL SYSTEM
147
Properties of a Postulational System
Algebra is usually taught as if it were just symbolic arithmetic. That is, one learns
to use letters as variables to stand for numbers. Modern algebra is more abstract. The
things the variables stand for don't have to be numbers. We can explore arithmetic
operations and relations without thinking about numeric interpretations. The freedom
that results from seeing mathematics as the construction of postulational systems
allows us to construct algebras.
An algebraic structure or algebraic system (or an algebra, where one can describe
many different algebras) is a system consisting of a set of elements or objects, together
with operations on and relations between those objects. Ordinary arithmetic on
integers is one algebra. Changing the set of objects to rational numbers gives a
different (but similar) algebra. Defining different arithmetic operators or relations
gives other algebras. Mathematicians explore the properties of various abstract
algebras, including the properties that many different algebras share.
Although an algebraic system can be freely invented with no particular interpretation, it is easiest to construct an algebraic system based on something. We can then
illustrate how one explores the properties of the system that results.
Because permutations are important in probability theory later in this course, we'll
explore an algebra of permutations.
Permutations
The order in which the members of an ordered pair are listed is important. We can
extend the notion of an ordered pair and talk about ordered triples (a list of three
elements, where the order of the elements matters) or ordered quadruples (four elements). In general, we can discuss ordered n-tuples (ordered lists of n elements).
Since order matters, two ordered lists are distinct if the elements are listed in
different orders. Each distinct way of ordering the n elements of an n-tuple is called a
permutation of those elements. "Permutation" is also the word for the operation of
permuting the elements of an ordered n-tuple into a different ordering.
How many permutations of two elements a and b can we distinguish? The elements can be put into ordered pairs as a, b or as b, a . If there are three elements a,
b and c, they can be ordered in six distinct ways, so there are six permutations of three
elements: a, b, c , a, c, b , b, a, c , b, c, a , c, a, b , and c, b, a .
Exercise on Permutations
1.
2.
Write out all the permutations of four elements a, b, c and d. How many permutations are there?
Write your answer from question 1 into the following table. Is there a pattern?
Describe the pattern in a conjecture about the relation between the number of
148
10 POSTULATIONAL MATHEMATICS
elements and the number of permutations. Your conjecture should be one that
would hold for all collections (sets) of elements of any finite size. Can you prove
that your conjecture is universally true?
Number of Elements
Number of Permutations
1
2
3
4
5
6
7
8
9
1
2
6
120
720
5040
40320
362880
Permutations as Operations
We develop a notation. Let's say that (2 1) specifies a permutation of an ordered
pair that puts the first element into the second place and the second element into the
first place. Let's call that permutation B. So B = (2 1). When we apply the permutation
B to the ordered pair a, b we get b, a . We symbolize the operation of applying the
B-permutation to the ordered pair a, b as B( a, b ). We say B( a, b ) = b, a ,
pronounced "B of a, b equals b, a ." Another permutation on ordered pairs would
be (1 2). Call it A, so A = (1 2). This is the null permutation or identity permutation. It
leaves the order unchanged. A( a, b ) = a, b .
This is an example of "thingification." We started with ordered pairs and treated
them as things. Then we treated operations on ordered pairs as things, and gave the
operations names, calling them "A" and "B."
Working on triples is more interesting. Call the identity permutation A, again. So
A = (1 2 3). Let B = (1 3 2), C = (2 1 3), D = (2 3 1), E = (3 1 2) and F = (3 2 1). If
we apply the F permutation to a, b, c we get c, b, a . If we apply the F permutation
to c, b, a we get a, b, c . So applying the F permutation twice is like applying the
identity permutation. Also, D( a, b, c ) = b, c, a . D( b, c, a ) = c, a, b . So, by
applying D twice to a, b, c we get c, a, b , which is the same thing we'd get by
applying E to a, b, c . We can describe this as D(D( a, b, c ) = E( a, b, c ). This
says that applying D to the result of applying D to a, b, c is the same as the result of
applying E to a, b, c . We pronounce it "D of D of a, b, c equals E of a, b, c ."
What happens if you apply F to the result of applying B to a, b, c ? Applying B
to a, b, c gives you a, c, b . Applying F to a, c, b gives b, c, a . This is just
what you'd get from D( a, b, c ). So F(B( a, b, c )) = D( a, b, c ).
What if we didn't start with a, b, c ? What is the result of applying F to the result
of applying B to b, a, c ? B( b, a, c ) = b, c, a . F( b, c, a ) = a, c, b . But a,
c, b is just what you'd get from D( b, a, c ). Check it and see. What's more, it doesn't
PROPERTIES OF A POSTULATIONAL SYSTEM
matter which of the six ordered triples you start with:
Applying F to the result of applying B to any triple is the
same as applying D to it. If we limit our universe of
discourse to just the six ordered triples (all possible
permutations of a, b, c ), we can say: ∀x(F(B(x)) =
D(x)). For any x (of the six triples), F of B of x is equal to
D of x.
149
◦
A
B
C
D
E
F
A
A
B
C
D
E
F
B
B
A
D
C
F
E
C
C
E
A
F
B
D
D
D
F
B
E
A
C
We saw that D(D( a, b, c ) = E( a, b, c ). You can check to see that the
generalization ∀x(D(D(x)) = E(x)) is true for any (x) of the six triples. Again, it does
not matter what triple you start with: applying D to the result of applying D to any
triple will give the same result as just applying E to that triple.
Applying a permutation to the result of applying a permutation to a triple is an
operation (as addition and multiplication are operations). We call the operation
"composition" of permutations.
Since the effect of composition of permutations is independent of the particular
triple we start with, so the triples can "drop out" of the discussion and we can concentrate on the permutation operations (as things). We can say "B followed by F is the
same as D" and "D followed by D is the same as E" and so on. Let's represent "is the
same as" with the = symbol. Instead of "B followed by F" let's use B◦F. The ◦ symbol
stands for the operation of composing two permutations (i.e., composition of two
permutations). The ◦ symbol is an operator-symbol, like + or × in ordinary arithmetic.
Now we can say B◦F = D and D◦D = E.
We summarize the statements about the results of all possible compositions of
permutations on triples in a table, rather like a multiplication table. In the table above,
we find B◦F by looking in the column on the left for B and then following that row
across to the column with F at its top. We see that B◦F = D.
This is an algebraic system consisting of six elements A, B, C, D, E and F and the
operation ◦. Its creation was motivated by the notion of permutations of ordered
triples. Now that we've got it, we can study it on its own, unrelated to its origins.
We said that our operator symbol ◦ was "like + or × in ordinary arithmetic." How
much (or little) does it resemble those common arithmetic operators? For example, is
our ◦ operator associative? In ordinary arithmetic, + is associative (for any numbers x
and y and z, (x + y) + z = x + (y + z)). Is that true for our ◦ operator? That is, among
the six "things" A, B, C, D, E and F described in our table, is it true that
∀x∀y∀z((x◦y) ◦ z = x ◦ (y ◦ z)). Now we're treating the operation ◦ as a thing!
Exercise on Permutation-pairs
1.
Explore the conjecture that X◦(Y◦Z) always gives the same result as (X◦Y)◦Z
(where X, Y and Z are variables standing for permutations). Try B◦(B◦B),
C◦(D◦D), F◦(E◦C), D◦(C◦D) and F◦(F◦E). Five successful tests do not prove that
E
E
C
F
A
D
B
F
F
D
E
B
C
A
150
2.
3.
10 POSTULATIONAL MATHEMATICS
the conjecture is true – we'd either have to try all the 216 possible combinations or
else derive a formal deductive proof – but it may encourage you to believe that
the conjecture is true.
Check if the operation of permuting an ordered pair and then permuting the result
(i.e., composition of permutations) is commutative.9 Go through the table and find
all those pairs where X◦Y = Y◦X. Note any patterns (if any) that you discover.
We listed and named the two permutations of two elements. Make a
"multiplication table" showing the product of pairs of permutations of ordered
pairs.
Studying General Properties of a System
After you have done #3 in the exercise, you have a table for the compositions of
permutations on ordered pairs. The table for the compositions of permutations on
triples is given above. There are 24 different permutations of four elements. The table
of compositions of permutations on ordered quadruples is called "Table of 'Products'
of Permutations on Quadruples," below.
◦
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
A
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
9
B
B
C
A
J
I
K
D
F
L
G
H
E
N
P
X
M
U
S
V
O
W
R
Q
T
C
C
A
B
G
L
H
J
K
E
D
F
I
P
M
T
N
W
V
R
X
Q
S
U
O
D
D
L
H
E
A
B
K
J
G
C
I
F
O
W
Q
S
M
U
X
R
T
N
V
P
E
E
F
J
A
D
L
I
C
K
H
G
B
Q
V
M
X
O
T
P
U
R
W
N
S
F
F
J
E
H
K
G
A
L
B
I
C
D
V
X
S
L
R
P
W
M
N
T
O
U
G
G
I
K
L
C
A
F
D
J
B
E
H
T
U
W
R
P
Q
O
V
X
M
S
N
H
H
D
L
K
F
J
C
I
A
E
B
G
S
O
R
W
V
N
U
P
M
X
T
Q
I
I
K
G
B
J
E
L
A
H
F
D
C
U
R
N
T
X
O
M
W
S
Q
P
V
J
J
E
F
I
B
C
H
G
D
A
L
K
X
Q
U
V
N
W
T
S
O
P
R
M
K
K
G
I
F
H
D
B
E
C
L
A
J
R
T
V
U
S
M
Q
N
P
O
X
W
L
L
H
D
C
G
I
E
B
F
K
J
A
W
S
P
P
T
X
N
Q
V
U
M
R
M
M
P
N
Q
O
S
U
V
T
W
R
X
A
C
E
B
D
K
F
I
G
H
J
L
N
N
M
P
U
X
V
W
R
O
Q
S
T
B
A
I
C
J
H
K
L
D
F
G
E
O
O
S
W
M
Q
X
T
N
R
V
U
P
D
H
A
L
E
I
B
G
K
J
C
F
P
P
N
M
W
T
R
Q
S
X
U
V
O
C
B
L
A
G
F
H
E
J
K
D
I
Q
Q
X
V
O
M
P
R
W
U
N
T
S
E
J
D
F
A
G
L
K
I
C
H
B
R
R
U
T
S
V
Q
P
O
N
X
M
W
K
I
H
G
F
A
D
C
B
E
L
J
S
S
W
O
V
R
U
M
X
P
T
N
Q
H
L
F
D
K
B
J
A
C
I
E
G
T
T
R
U
P
W
O
X
M
V
S
Q
N
G
K
C
I
L
E
A
J
F
D
B
H
An operation is commutative if and only if X◦Y = Y◦X for every possible choice of X and Y.
U
U
T
R
X
N
M
S
Q
W
P
O
V
I
G
J
K
B
D
E
H
L
A
F
C
V
V
Q
X
R
S
W
N
T
M
O
P
U
F
E
K
J
H
C
G
B
A
L
I
D
W
W
O
S
T
P
N
V
U
Q
M
X
R
L
D
G
H
C
J
I
F
E
B
K
A
X
X
V
Q
N
U
T
O
P
S
R
W
M
J
F
B
E
I
L
C
D
H
G
A
K
PROPERTIES OF A POSTULATIONAL SYSTEM
151
It doesn't matter that the tables were originally made to show the result of
successive permutations. We can treat them as defining the result ("product") of a
binary operation (◦) on undefined mathematical objects. In the tables, the operator is
indicated as ◦ (in the top left corner), to help you stop thinking of the table as showing
composition of permutations. Think of it as just an operation called "◦" on things
called "A," "B," and so on. The tables define the operations, just as truth tables defined
the logical operators and addition and multiplication tables define those operations.
Mathematics studies the patterns that emerge among the tables. See if you can
find any properties that all three tables have in common.
Table of "Products" of Permutations on Quadruples
Some of the things all of these tables have in common are:
1.
2.
3.
4.
5.
6.
7.
Each table deals with a certain set of objects. Every object is listed in the guide (top) row
and guide (leftmost) column of the table.
Every ordered pair of objects (e.g., A and F in that order are an ordered pair) in the set
produces exactly one "product."
Every "product" is a member of the original set of objects.
Every member of the set of objects appears exactly once in each row and each column.
There is a column that looks exactly like the guide column.
There is a row that looks exactly like the guide row.
The "products" have the associative property. That is, for all X, Y, and Z in the set,
X◦(Y◦Z) = (X◦Y)◦Z.
Exercise on Properties of Tables
1.
2.
10
11
Check that the table of compositions of permutations of pairs is associative.
Figure out and compare the following pairs of complex products: (A◦A)◦A and
A◦(A◦A); (A◦A)◦B and A◦(A◦B); (A◦B)◦A and A◦(B◦A); (B◦A)◦A and B◦(A◦A);
(A◦B)◦B and A◦(B◦B); (B◦A)◦B and B◦(A◦B); (B◦B)◦A and B◦(B◦A); (B◦B)◦B
and B◦(B◦B). If, in all possible cases, the pairs of products are equal, then the
composition operation has the associative property.
Composition of permutations of triples and of quadruples also have the
associative property, but it would take a long time to check all the cases.10 Using
the table of composition of permutations on quadruples, show that (F◦J)◦M =
F◦(J◦M), (C◦E)◦L = C◦(E◦L), (R◦M)◦T = R◦(M◦T), and (X◦H)◦Q = X◦(H◦Q).11
Each case involves three (not necessarily different) permutations. There are 8 = 23 possible combinations of
three permutations of pairs; there are 63 = 216 possible combinations of three permutations of triples; there
are 243 possible combinations of three permutations of quadruples. Do you understand why?
Checking less than all of the possible combinations does not prove that the operation is associative. You
could check them all, but see previous footnote. Mathematicians would prove that it is associative by a
deductive proof that would apply to permutations of any size of n-tuple.
152
3.
4.
10 POSTULATIONAL MATHEMATICS
Go through the table of composition of permutations of quadruples and find all
those pairs where X◦Y = Y◦X. Note any patterns (if any) that you discover.
Is composition of permutations of triples commutative? Is composition of
permutations of quadruples commutative? Give reasons for your answers.
Groups
One of the simplest algebraic systems is called a group.
A group is an algebraic system consisting of a nonempty set (call it G) of elements
and one binary operation12 that satisfies four postulates (see below). One
convention for describing an algebraic system is to put the set(s) and operator(s)
inside angle-brackets, as G, ◦ .
The members (elements) of the set G (which we denote, below, by the small
letters a, b, c, …) are undefined objects, and the operation ◦ is an undefined
operation. The properties of the objects and the operation are given explicitly in the
postulates.
The postulates that must be satisfied if G, ◦ is a group are:
G1: To every pair of elements a and b of G, given in the stated order, there corresponds a definite unique element of G, denoted by a◦b. This is called the law
of closure for the operation ◦. This property of groups is called the "closure
property." We say that a group is closed for an operation ◦ if and only if ◦
exhibits the closure property.
G2: For all a, b, and c in G, (a◦b)◦c = a◦(b◦c). This is the associative law for the
operation ◦. It describes the associative property of groups.
G3: There exists an element i in G such that, for any a in G, a◦i = i◦a = a. The
element i is called an identity element of the group. We can prove that a group
will never contain more than one identity element.
G4: For each element a of G there is an element a of G such that a◦a = a ◦a = i.
The element a is called the inverse of a. We can prove that an element a of a
group possesses only one inverse element.
If the following postulate is also satisfied,13 the group is called a commutative or
Abelian group.
G5: For all a, b in G, a◦b = b◦a. This is the commutative law for operation ◦.
12
13
I've called it ◦, so that we don't confuse it with any of the usual symbols for arithmetic operators.
That is, if the operation ◦ is commutative.
GROUPS
153
A group for which postulate G5 does not hold is called a non-Abelian group. If
the set G of a group contains only a finite number of distinct elements, the group is
called a finite group; otherwise it is called an infinite group.
Our three tables of compositions of permutations are examples of finite groups.
Another way of saying this is to say that they are interpretations of the postulates that
define a finite group. The identity element in each table is the element called A; that is,
for every X in G, X◦A = A◦X = X, so our tables satisfy postulate G3. Postulate G4
requires that the identity element must appear at least once in every row, in which case
the element in the guide row at the top of that column will be the inverse element for
the element in the guide column at the left end of that row, and vice versa. Thus, in the
big table, C is the inverse element for B (i.e., B = C) because B◦C = C◦B = A.
We can easily check that the table of composition of permutations on pairs
represents an Abelian group, but the table of composition of permutations on triples
does not (because, for example, D◦B = F, but B◦D = E, and C◦E = D, but E◦C = F,
and E◦F = C, whereas F◦E = B. A glance at the table of composition of permutations
on quadruples shows that B◦D = L, but D◦B = J, so it is not an Abelian group.
Exercise on Group Properties
1. Prove that a group can never have more than one identity element.
2. Prove that an element of a group cannot have more than one inverse.
3. Consider a set of two objects {E, O} (think of E as standing for "even" and O as
standing for "odd"). Make a table for an operation ◦ on {E, O}, where the
"product" is odd or even depending on whether the sum of the two operands14 is
odd or even. For example, the sum of two even numbers is an even number, so
E◦E = E, and the sum of an even plus an odd number is odd, so E◦O = O. Does
this table specify a group (i.e., does {E, O}, ◦ satisfy the definition of "group")?
Is the table like (isomorphic to) any group we have already considered?
4. ("Clock arithmetic") Imagine a "clock" having the numbers 0, 1, 2, 3, 4, 5, 6
equally spaced on its face. To find the sum of any two numbers in that set of
numbers, we start at the first number and move clockwise around the dial a
number of spaces equal to the second number. Thus, the "sum" of 2+3 is found
by starting at 2 and moving clockwise three places (to 3, then 4, then 5). The
"sum" is 5. 4+5 is found by starting at the 4 position and moving through 5, 6, 0,
1, 2, giving a "sum" of 2. Write out the table of clock-arithmetic "sums" formed
from the set {0, 1, 2, 3, 4, 5, 6}. Does this table define a group? Why or why not?
14
When you use an operation (like addition or multiplication or ◦) on one or more objects, the objects are called
the operands of the operation.
154
5.
4.
10 POSTULATIONAL MATHEMATICS
Each of the following algebraic systems satisfies the definition of an infinite
group. In each example, name the identity element and specify the inverse x for
each element x.
a) ¯, + (i.e., the set of integers and the operation of ordinary arithmetic
addition).
b) ¦-, · (i.e., the set of rational numbers excluding 0 and the operation of
ordinary arithmetic multiplication). Why was it necessary to exclude 0?
Explain why the following algebraic systems are not infinite groups:
a)
¯, · (i.e., the set of integers and the operation of ordinary arithmetic
multiplication).
b) £, + (i.e., the set of natural numbers with ordinary addition).
In this exercise we saw that parts of our ordinary arithmetic systems can be
studied as interpretations of the abstract algebraic systems called groups. We can also
dream up new "arithmetics" that are groups.
We can go on to study the abstraction itself. We can derive theorems from the
postulates, and know that any system that satisfies the definition of a group will be
such that all the theorems will be true for that system. Moses Richardson's
Fundamentals of Mathematics gives a number of theorems and their proofs. Many
books and articles in scientific journals are devoted to the study of groups.
Beyond this, mathematicians study more complicated abstract algebraic systems
like rings, subrings, ideals, integral domains, ordered integral domains, and fields.
These involve more than one operator and additional postulates and properties. One or
the other system will describe almost any arithmetic system, including those that we
invent with our new kinds of numbers. In studying such abstract systems, we can
study our ordinary arithmetic at a level behind the mere numbers, and discover
isomorphisms between arithmetic systems and other deductive structures.
11 Estimating Arithmetic
Numeracy
Most children enjoy their first encounters with arithmetic. Mastering the basic operations
is “empowering” – it gives one power over numbers. After years of practicing these operations
over and over – adding long columns of numbers or multiplying long strings of digits – it
becomes tedious. Students come to associate mathematics with boring drill. Exploration of
mathematical concepts bogs down because of the need to perform calculations. Calculators
and computers permit one to study mathematical ideas and free one from long, error-prone
computations. That’s good.
On the other hand, the use of calculators often leads people to be alienated from numbers.
They can get numeric results quickly, but they often fail to understand what the numbers
mean. When a student takes out a calculator to figure out what 12% of 200 is, or to figure
what percentage 20 is of 1000, she demonstrates innumeracy. That’s bad.
To divide one integer by another integer and report the result to eight or ten decimal
places is innumerate and ignorant. Such precision is utterly bogus and meaningless, but the
calculator or computer gives results to that precision. People learn to trust the calculator, even
when it seems to tell them that 42 is 0.251497005988% or 2.51497005988% of 167. Numerate
people would notice that these answers are obviously wrong.
Numerate people know how to do manual calculations, and how to simplify the task of
manual calculation. They should also know how to use calculators, without losing the “handson” familiarity and “feel” for numbers that comes from intimate acquaintance. To be numerate
requires that one have a “vocabulary” of numbers against which one can compare other numbers that one encounters. One should know (approximately) the population of one’s city, one’s
province or state, one’s country, and the world. One should have a bunch of distances (how
wide is North America? how far away is Europe? the moon? the sun? the nearest star?). Sizes
of commonly-met things (cars, bricks, books, packages of 500 sheets of paper, your hand,
your height, the height of one storey of a building) are useful for comparisons.
One should exercise this vocabulary frequently. When driving, estimate (mentally
calculate) the distance traveled, the average speed, the distance remaining to the destination.
While shopping, practice keeping a running total in one’s head. Exercising one’s “arithmetic
muscles” leads to a familiar ease with numbers, just as frequent reading and talk improve