Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Counting in probability Permutations The number of orderings of different events Combinations The number ways that outcomes can be grouped 1 Permutations The number of orderings of different events. Three cards: AKQ Four digits: 7412 AKQ, AQK, KQA, KAQ, QAK QKA = 6 24 Five symbols: # $ % & @ 120 2 Three different cards: AKQ AKQ, AQK, KQA, KAQ, QAK QKA = 6 Complete permutations P N! N N N! ( N )( N 1)( N 2) (2)(1) P 3 * 2 *1 6 3 3 3 Partial Permutations Four Different Digits (of 10) In the first spot: 10 different digits possible In the second spot: 9 possible left In the third spot: 8 possible left In the last spot: 7 possible left 10*9*8*7 = 5040 Pr N N! ( N r )! P410 10! 10 * 9 * 8 * 7 * 6 * 5 * 4 * 3 * 2 *1 6! 6 * 5 * 4 * 3 * 2 *1 4 Combinations The number ways that outcomes can be grouped Calculate the number of combinations of N objects taken r (r <= N) at a time. Example: 13 cards (AKQ…2) taken 5 at a time = 1287 N! C r!*( N r )! N r 13! 13 *12 *11*10 * 9 5!*8! 5 * 4 * 3 * 2 *1 5 Review the problem We want to be able to say something about the population from a single sample that we have drawn. This is the problem of statistical inference What can we infer from our sample? 6 Our logic for proceeding If the population has certain characteristics, then our sample will probably include certain outcomes and probably not include other outcomes If our sample has outcomes that are unlikely to come from that population, it probably did not come from that population 7 Our logic (continued) If our sample has outcomes that are unlikely to come from that population, it probably did not come from that population Our conclusion must be either that the population is as hypothesized or it is not as hypothesized. We reject or fail to reject the hypothesized population characteristic 8 Once More We reason from population characteristics to probability distribution of all possible samples -- the sampling distribution. We calculated this for the population of 7 black and 3 red marbles We noted that the sampling distribution began to look like a normal distribution 9 Sampling distributions There are a few sampling distributions that occur very frequently binomial normal student’s t statistic chi-square statistic F statistic 10 Binomial This is the distribution we have already calculated for the red (p=.3) and black (p=.7) marbles. It is also the distribution for coin tosses where p=.5 for heads and p=.5 for tails It is the distribution for all binary outcomes. 11 Normal This is the distribution for many empirical distribution. This is the distribution which many other distributions approach when working with large numbers: binomial, t, chisquare, etc. This is the distribution for the sum of a set of random variables. 12 Normal distribution characteristics The formula 1 e 2 1 x x 2 2 13 Normal distribution characteristics The shape .4 68% norm .3 .2 .1 0 -5 -4 -3 -2 -1 0 x 1 2 3 4 5 14 Normal distribution characteristics The shape .4 95% norm .3 .2 .1 0 -3 -2 -1 0 x 1 2 3 15 Normal distribution characteristicsMean The shape .4 One unit is one std. dev. norm .3 .2 .1 0 -3 -2 -1 0 x 1 2 3 16 Why the normal distribution? Central Limit Theorem Sampling distribution of the mean approaches normal distribution Mean of sampling distribution approaches mean of population sd of sampling distribution approaches population sd / sqrt(n) as n becomes large 17 Thus pop mean = 5, pop sd = 1 If n=4, sampling dist mean = 5, sd = .5 If n=9, sampling dist mean = 5, sd = .333 If n=25, samp dist mean = 5, sd = .20 If n=900, samp dist mean = 5, sd = .033 18 .4 Pop norm .3 .2 .1 0 2 3 4 5 x 6 7 8 .4 norm .3 N=4 .2 .1 0 2 3 4 5 xx 6 7 8 .4 norm .3 N=9 .2 .1 0 2 3 4 5 xxx 6 7 8 .4 N=900 norm .3 .2 .1 0 2 3 4 5 xxxx 6 7 8 19