Download Lecture 2

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Spatial Statistics and Spatial Knowledge
Discovery
First law of geography [Tobler]: Everything is related to everything, but
nearby things are more related than distant things.
Drowning in Data yet Starving for Knowledge [Naisbitt -Rogers]
Lecture 2 : Basic Statistics with R
Pat Browne
Combinatorial Analysis
•
•
•
•
•
Counting
Permutations: the order does matter
Combinations: the order doesn't matter
The Pigeonhole Principle
Examples
Combinatorial Analysis
• Combinatorial analysis deals with
permutations of a set or bag and
combinations of a set, which lead to
binomial coefficients and the Binomial
Theorem.
Rules of Counting
• Rule of sums: The size of the union on n finite
pair wise disjoint sets is the sum of their sizes.
• Rule of product (a.k.a. the fundamental principle
of counting): The size of the cross product of n
sets is the product of their sizes.
• Rule of difference: The size of a set with a
subset removed is the size of the set minus the
size of the subset.
Product Rule Example
• If each license plate contains 3 letters and
2 digits. How many unique licenses could
there be?
• Using the rule of products.
• 26  26  26  10  10 = 1,757,600
All Permutation of a set
• A permutation of a set of elements is a linear
ordering (or sequence) of the elements e.g.
• {1,4,5}
• Example Permutation 1 : 1, 4, 5
• Example Permutation 2 : 1, 5, 4
• An anagram is a permutation of words.
• There are n  (n – 1)  (n - 2) .. 1
permutations of a set of n elements.
• This is factorial n, written n!
All Permutation of a set
• There are:
• n  (n – 1)  (n - 2) .. 1
• permutations of a set of n elements.
• Calculated using factorial n, written n!
• In R this is written factorial(3)
• In R we can calculate the number of 2permutations from 4 items as:
• factorial(4)/factorial(4-2)
Some Permutation of a set
• More generally permutation of size r from
a set of size n.
• (16.4) P(n,r) = n!/(n-r)!
• The number of 2 permutations of BYTE is
• P(4,2) = 4!/(4-2)! = 4  3 = 12
• BY,BT,BE,YB,YT,YE,TB,TY,TE,EB,EY,ET
• P(n,0) = 1
• P(n,n-1) = P(n,n) = n!
Order
• P(n,1) = n
matters
3-permuations of set
• Calculate the number of 3-permutations of the
word HAND.
• There are 4 objects and taking 3 at a time.
• 4Pick3 = 4! / (4-3)! = 4! / 1! = 24 / 1 = 24.
• Would you consider HDA and HAD as equivalent
permutations?
• They are not equivalent, because each
permutation is a sequence, hence < HDA >
and < HAD> are considered distinct and both
are included in the permutation.
r-Permutation with repetition of a
set
• An r-permutations is a permutation that allows
repetition. Here are all the 2-permutation of the
letters in SON:
SS,SO,SN,OS,OO,ON,NS,NO,NN.
• Given a set of size n, in constructing an rpermutation with repetition, for each element we
have n choices.
• (16.6) The number of r permutations with
repetition of a set of size n is nr. Note, repetition
is allowed in the permutation but does not occur
in the original set.
r-Permutation with repetition of a
set
• (16.6) The number of r permutations with
repetition of a set of size n is:
• nr
• repetition is allowed in the permutation but
not in the original set, because it is a set.
Permutation of a bag
• A bag may have duplicate elements.
• Transposition of equal (or duplicate) elements in
a permutation does not yield a different
permutation e.g. AA=AA.
• Hence, there will be fewer permutations of a bag
than a set of the same size. The permutations
on the set {S,O,N} and the bag M,O,M
are:
• {S,O,N} = SON,SNO,OSN,ONS,NSO,NOS
• M,O,M = MOM,MMO,OMM
Permutation of a bag:
General Rule
• (16.7) The number of permutations of
a bag of size n with k distinct
elements occurring n1, n2, n3,.. nk
times is:
n!
n1!  n2!  n3! ...  nk!
Calculating permutation of a set
and a bag
• In R calc. number of 3-perms of set
{S,O,N}
• factorial(3)
• R calc. number of 3-perms of bag
M,O,M
factorial(3) / (factorial(1) * factorial(2))
• O occurs once, M twice, gives 3
Permutation of a bag
• Consider the permutation of the 11 letters of
MISSISSIPPI. M occurs 1 time, I occurs 4 times, S
occurs 4 times, and P occurs 2 times.
Calculate this in R as follows:
factorial(11)/(factorial(4)*factorial(4)*factorial(2))
Test the following in R:
factorial(0) == factorial(1)
Permutation of a bag
• O a single permutation
• M1,O, M2 , label the two copies of M.
• If we choose to distinguish the Ms we get.
M1M2O,M2M1O,M1OM2,M2OM1,OM1M2,OM2M1
• If we do not distinguish the Ms we get
• M,O,M = MOM,MMO,OMM
Permutation of a bag
• Calculate the number of distinguishable
permutations of the word LITTLE.
• Basic formula is
• N = total number of letters
• ni is frequency of each different letter
• Note double counting of letters T and L,
• 6!/2!*2!*1!*1!= 720/4 = 180
Permutations in R
•
On previous slides we calculated the number of permutauions. Below we
calculate the actual permuations themseves.
• > install.packages('combinat')
•
•
•
•
•
•
•
•
library(combinat)
m <- t(array(unlist(permn(3)), dim = c(3, 6)))
we get 3!=6 permutations
m
[,1] [,2] [,3]
[1,] 1
2
3
[2,] 1
3
2
[3,] 3
1
2
[4,] 3
2
1
[5,] 2
3
1
[6,] 2
1
3
What does this function do?
•
sample(1:3, 3)
•
Evaluate it 6 times and see.
•
Evaluate
•
•
>
>
#
>
permn(3),
Evaluate
unlist(permn(3))
Permutations in R
• Alternative way to calculate the actual permuations.
> install.packages(“gregmisc”)
> library(gregmisc)
> length(permutations(n=3,r=2))/2
[1] 6
> permutations(n=3,r=2)
[,1] [,2]
[1,]
1
2
[2,]
1
3
[3,]
2
1
[4,]
2
3
[5,]
3
1
[6,]
3
2
n Size of the source vector
r Size of the target vectors
Combinations of a set
• An r-combination of a set is a subset of
size r. Note the while a permutation is a
sequence a combination is a set.
Example: Combinations of a Set
• An r-combination of a set is a subset of
size r. A permutation is a sequence while
a combination is a set.
• The 2-permutations (seq.) of SOHN is:
<SO>,<SH>,<SN>,<OH>,<ON>,<OS>,<HN>,<HS>,<HO>,<NS>,<NO>,<NH>
• The 2-combinations (set) of SOHN is:
{S,O},{S,H},{S,N},{O,H},{O,N},{H,N}
Note in combinations order is ignored in sets {a,b}={b,a}
Combinations of a Set
• The binomial coefficient, “n choose r is written”:
n
n!
  
 r  r!(n  r )!
Combinations in R
•
install.packages(“combinat”) if not
already done
• There are 15 ways to choose 2 items from 6
items.
> choose(6,2)
[1] 15
• There are zero ways to pick 6 items from 2
items.
> choose(2,6)
[1] 0
Combinations in R
# install.packages('combinat') if not already done
# Generate all combinations of
# the letters a,b,c,d taking 2 at a time.
combn(letters[1:4], 2)
#Gives
[,1] [,2] [,3] [,4] [,5] [,6]
[1,] "a" "a" "a" "b" "b" "c"
[2,] "b" "c" "d" "c" "d" "d"
Pascal’s Triangle
Beginning with row 0 and place (or column) 0, the number 20 appears in row 6,
place 3. Using R we can check this.
> choose(6,3)
[1] 20
Check these on the triangle.
> choose(7,4)
[1] 35
> choose(7,2)
[1] 21
chose(6,3) . – gives 20
At the tip of Pascal's Triangle is the number 1, which makes up the 0th row. The
first row (1 & 1) contains two 1's, both formed by adding the two numbers above
them to the left and the right, in this case 1 and 0 (all numbers outside the
Triangle are 0's). Do the same to create the 2nd row: 0+1=1; 1+1=2; 1+0=1. And
the third: 0+1=1; 1+2=3; 2+1=3; 1+0=1. In this way, the rows of the triangle go
on infinitely. A number in the triangle can also be found by nCr (n Choose r)
where n is the number of the row and r is the element (column) in that row. For
example, in row 3, 1 is the 0th element, 3 is element number 1, the next three is
the 2nd element, and the last 1 is the 3rd element.
Special Combinations of a Set
n
   1
0
n
   1
n
n
   n
1 
How would you verify these laws in R?
n 

  n
 n 1
Calculating combinations using
factorial and division
8! 8  7  6!

 8  7  56
6!
6!
Calculating
n
 
k 
"n choose k".
8  8 7
  
 28
 2  1 2
Cancellation
already done for
these
9  98 7  6
  
 126
 4  1 2  3  4
12  12 1110  9  8
  
 792
1 2  3  4  5
5 
Combinations of a Set
• (16.10) The number of r-combinations of n
elements is  n 
r 
• A student has to answer 6 out of 9 questions on
an exam. How many ways can this be done?
 9  9!
9 87

 84
  
 6  6!3! 3  2  1
Combinations with repetitions of a
Set
• An r-combination with repetitions of a set S
of size n is a bag of size r all of whose
elements are in S. An r-combination of a
set is a subset of that set; an rcombination with repetition of a set is a
bag, since its elements need not be
distinct.
Combinations with repetitions of a
Set
• For example, the 2-combinations with
repetition of SON are the bags:
• S,O,S,N,O,N,S,S,O,
O,N,N
• On the other hand, the 2-permutations
with repetition are the sequences:
• <S,S>,<S,O>,<S,N>,<O,S>,<O,O>,<O,N>,<N,S>,<N,O>,<N,N>
Note SO and OS are
distinct permutations
Combinations with repetitions of a
Set
• (16.12) The number of r-combinations with
repetition of a set of size n is:
Repetitions
size
 n  r  1


r


Combination
size
Combinations with repetitions of a
Set
• Out of 7 people each gets either a burger,
a cheese burger, or fish (3 choices). How
many different orders are possible? The
answer is the number of 7-combinations
with repetition of a set of 3 elements.
 3  7  1
9!
 36

 
 7  7!2!
The Equivalence of three
statements
• (16.13) The following numbers are equal:
• The number of integer solutions of the
equation x1+x2+x3+...+xn=r.
• The number of r-combinations with
repetition of a set of size n.
• The number of identical ways r identical
objects can be distributed among n
different containers.
Permutations
• How many permutations of the letters are
there in the following words:
• LIE: n=3, 3! = 6
• BRUIT: n=5, 5! = 120
• CALUMMNY: n=7, 7!=5040
Permutations of a bag
• A coin is tossed 5 times, landing Head or
Tails to form an outcome. One possible
outcome is HHTTT.
• How many possible outcomes are there?
• How many outcomes have one Head?
• How many outcomes contain at most one
Head?
Permutations of a bag
How many possible outcomes are there?
Rule of product giving 25=32 possible
outcomes.
Permutations of a bag
How many outcomes have one Head?
Permutation of a bag with 1 Head and four Tails.
5!
5
1!4!
Permutations of a bag
• How many outcomes contain at most one
Head?
5
!
• One Head
5
1!4!
• No Heads
5!
1
0!5!
• At most one Head 1 + 5 = 6 (rule of sums)
Combinations of Set
• A chairman has to select a committee of 5 from
a facility of 25. How many possibilities are
there?
 25 
25!
 53130
  
 5  5!20!
• How many possibilities are there if the chair
should be on the committee?
 24 
24!
 10626
  
 4  4!20!
3-permuations of set
• Calculate the number of 3-combinations of the
string ABCDE.
• Combinations: In each of the 3 letters groups
the position does not matter (e.g. ABC, ABD,
ABE, ACD, ACE, ADE are distinct but CBA is not
because it is considered equivalent to ABC)
• C(n,r) = n!/(r! * (n-r)!)
• Formula: C(5,3) = 5!/(3!*(5-3)!) =
5*4*3*2*1/(3*2*1*2*1)) = 5*2 = 10
Calculate the number of
combinations of a set of 6 elements
Formula: C(n,r) = n!/(r! * (n-r)!)
Example: C(6,2) = 6!/(2!*(6-2)!) = 15
In R
> choose(6,1) + choose(6,2) +
choose(6,3) + choose(6,4) +
choose(6,5) + choose(6,6)
Gives 63
The Pigeonhole Principle
• (16.43) If more than n pigeons are placed in n
holes, at least one hole will contain more than
one pigeon.
• With more than n pigeons in n holes the average
number of pigeons per hole is greater than one.
• The statement “at least one hole will contain
more than one pigeon” is equivalent to “the
maximum number of pigeons in any whole is
greater than one”.
The Pigeonhole Principle
Related documents