Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Statistics Discrete Probability Distributions 1/71 Contents       Random Variables Discrete Probability Distributions Expected Value and Variance Binomial Distribution Poisson Distribution Hypergeometric Distribution .40 .30 .20 .10 0 1 2 3 4 STATISTICS in PRACTICE   Citibank makes available a wide range of financial services. Citibanking’s automatic teller machines (ATMs) located in Citicard Banking Centers (CBCs), let customers do all their banking in one place with the touch of a finger. STATISTICS in PRACTICE   Periodic CBC capacity studies are used to analyze customer waiting times and to determine whether additional ATMs are needed. Data collected by Citibank showed that the random customer arrivals followed a probability distribution known as the Poisson distribution. Random Variables    A random variable is a numerical description of the outcome of an experiment. A discrete random variable may assume either a finite number of values or an infinite sequence of values. A continuous random variable may assume any numerical value in an interval or collection of intervals. Differences between outcomes and random variables Example: Tossing a dice Possible outcomes: 1, 2, 3, 4, 5, and 6 One can define random variables as 1 if outcome is greater than 3, and 0 if outcome is smaller or equal to 3.  Or 1 if outcome is odd numbers 0 if outcome is even numbers  Discrete Random Variables  Example 1. The certified public accountant (CPA) examination has four parts. Define a random variable as x = the number of parts of the CPA examination passed and It is a discrete random variable because it may assume the finite number of values 0, 1, 2, 3, or 4. Discrete Random Variables Example 2. An experiment of cars arriving at a tollbooth. The random variable is x = the number of cars arriving during a one-day period. The possible values for x come from the sequence of integers 0, 1, 2, and so on. x is a discrete random variable assuming one of the values in this infinite sequence. Discrete Random Variables  Examples of Discrete Random Variables Example: JSL Appliances  Discrete random variable with a finite number of values Let x = number of TVs sold at the store in one day, where x can take on 5 values (0, 1, 2, 3, 4) Example: JSL Appliances  Discrete random variable with an infinite sequence of values Let x = number of customers arriving in one day, where x can take on the values 0, 1, 2, . . . We can count the customers arriving, but there is no finite upper limit on the number that might arrive. Random Variables Question Family size Random Variable x x = Number of dependents reported on tax return Distance from x = Distance in miles from home to store home to the store site Own dog or cat Type Discrete Continuous Discrete x = 1 if own no pet; = 2 if own dog(s) only; = 3 if own cat(s) only; = 4 if own dog(s) and cat(s) Continuous Random Variables Example 1. Experimental outcomes based on measurement scales such as time, weight, distance, and temperature can be described by continuous random variables. Continuous Random Variables Example 2. An experiment of monitoring incoming telephone calls to the claims office of a major insurance company. Suppose the random variable of interest is x = the time between consecutive incoming calls in minutes. This random variable may assume any value in the interval x ≥ 0. Continuous Random Variables  Example of Continuous Random Variables Discrete Probability Distributions  The probability distribution for a random variable describes how probabilities are distributed over the values of the random variable.  We can describe a discrete probability distribution with a table, graph, or equation. Discrete Probability Distributions  The probability distribution is defined by a probability function, denoted by f(x), which provides the probability for each value of the random variable. The required conditions for a discrete probability function are: f(x) > 0 f(x) = 1 Discrete Probability Distributions  Example: Probability Distribution for the Number of Automobiles Sold During a Day at Dicarlo Motors. Discrete Probability Distributions Using past data on TV sales, …  a tabular representation of the probability distribution for TV sales was developed. Number x f(x) 80/200  Units Sold 0 1 2 3 4 of Days 80 50 40 10 20 200 0 1 2 3 4 .40 .25 .20 .05 .10 1.00 Discrete Probability Distributions Graphical Representation of Probability Distribution .50 Probability  .40 .30 .20 .10 0 1 2 3 4 Values of Random Variable x (TV sales) Discrete Uniform Probability Distribution The discrete uniform probability distribution is the simplest example of a discrete probability distribution given by a formula. Discrete Uniform Probability Distribution The discrete uniform probability function is f(x) = 1/n the values of the random variable are equally likely where: n = the number of values the random variable may assume Discrete Uniform Probability Distribution  Example: An experiment of rolling a die we define the random variable x to be the number of dots on the upward face. There are n = 6 possible values for the random variable; x = 1, 2, 3, 4, 5, 6. The probability function for this discrete uniform random variable is f (x) = 1/6 x = 1, 2, 3, 4, 5, 6. Discrete Uniform Probability Distribution x 1 2 3 4 5 6 f (x) 1/6 1/6 1/6 1/6 1/6 1/6 Discrete Uniform Probability Distribution  Example: Consider the random variable x with the following discrete probability distribution. x 1 2 3 4 f (x) 1/10 2/10 3/10 4/10 This probability distribution can be defined by the formula f (x) = x/ 10 for x = 1, 2, 3, or 4. Expected Value and Variance  The expected value, or mean, of a random variable is a measure of its central location. E(x) =  = x f(x)  The variance summarizes the variability in the values of a random variable. Var(x) =  2 = (x - )2f(x) Expected Value and Variance   The standard deviation,  , is defined as the positive square root of the variance. Here, the expected value and variance are computed from random variables instead of outcomes Expected Value and Variance  Example: Calculation of the Expected Value for the Number of Automobiles Sold During A Day at Dicarlo Motors. Expected Value and Variance  Example: Calculation of the Variance for the Number of Automobiles Sold During A Day at Dicarlo Motors. The standard deviation is   1.25  1.118 Expected Value and Variance  Expected Value x 0 1 2 3 4 expected number of TVs sold in a day f(x) xf(x) .40 .00 .25 .25 .20 .40 .05 .15 .10 .40 E(x) = 1.20 Expected Value and Variance  Variance and Standard Deviation x x- 0 1 2 3 4 -1.2 -0.2 0.8 1.8 2.8 (x - )2 1.44 0.04 0.64 3.24 7.84 f(x) .40 .25 .20 .05 .10 (x - )2f(x) .576 .010 .128 .162 .784 Variance of daily sales =  2 = 1.660 TVs squared Standard deviation of daily sales = 1.2884 TVs Binomial Distribution  Four Properties of a Binomial Experiment 1. The experiment consists of a sequence of n identical trials. 2. Two outcomes, success and failure, are possible on each trial. 3. The probability of a success, denoted by p, does not change from trial to trial. stationarity assumption 4. The trials are independent. Binomial Distribution  Our interest is in the number of successes occurring in the n trials.  We let x denote the number of successes occurring in the n trials. Binomial Distribution  Binomial Probability Function where: f(x) = the probability of x successes in n trials, n = the number of trials, p = the probability of success on any one trial. Binomial Distribution Number of experimental n! x !(n  x )! = outcomes providing exactly x successes in n trials (nx ) p (1  p) x = Probability of a particular sequence of trial outcomes with x successes in n trials Binomial Distribution  Binomial Probability Function Binomial Distribution   Example: The experiment of tossing a coin five times and on each toss observing whether the coin lands with a head or a tail on its upward face. we want to count the number of heads appearing over the five tosses. Does this experiment show the properties of a binomial experiment? Binomial Distribution  Note that: 1. The experiment consists of five identical trials; each trial involves the tossing of one coin. 2. Two outcomes are possible for each trial: a head or a tail. We can designate head a success and tail a failure. Binomial Distribution Note that: 3. The probability of a head and the probability of a tail are the same for each trial, with p = .5 and 1- p = .5. 4. The trials or tosses are independent because the outcome on any one trial is not affected by what happens on other trials or tosses.  Binomial Distribution  Example: Evans Electronics Evans is concerned about a low retention rate for employees. In recent years, management has seen a turnover of 10% of the hourly employees annually. Thus, for any hourly employee chosen at random, management estimates a probability of 0.1 that the person will not be with the company next year. Binomial Distribution  Using the Binomial Probability Function Choosing 3 hourly employees at random, what is the probability that 1 of them will leave the company this year? Let: p = .10, n = 3, x = 1 n! f ( x)  p x (1  p ) (n  x ) x !( n  x )! 3! f (1)  (0.1)1 (0.9)2  3(.1)(.81)  .243 1!(3  1)! Binomial Distribution  T r e e D i a g r a m 1st Worker 2nd Worker Leaves (.1) Leaves (.1) 3rd Worker L (.1) x 3 Prob. .0010 S (.9) 2 .0090 L (.1) 2 .0090 S (.9) 1 .0810 L (.1) 2 .0090 S (.9) 1 .0810 1 .0810 0 .7290 Stays (.9) Leaves (.1) Stays (.9) L (.1) Stays (.9) S (.9) Binomial Distribution  Using Tables of Binomial Probabilities p n x .05 .10 .15 .20 .25 .30 .35 .40 .45 .50 3 0 1 2 3 .8574 .1354 .0071 .0001 .7290 .2430 .0270 .0010 .6141 .3251 .0574 .0034 .5120 .3840 .0960 .0080 .4219 .4219 .1406 .0156 .3430 .4410 .1890 .0270 .2746 .4436 .2389 .0429 .2160 .4320 .2880 .0640 .1664 .4084 .3341 .0911 .1250 .3750 .3750 .1250 Binomial Distribution  Expected Value E(x) =  = np  Variance Var(x) =  2 = np(1  p)  Standard Deviation   np(1  p ) Binomial Distribution  Expected Value E(x) =  = 3(.1) = .3 employees out of 3  Variance Var(x) =  2 = 3(.1)(.9) = .27  Standard Deviation   3(.1)(.9)  .52 employees Poisson Distribution  A Poisson distributed random variable is often useful in estimating the number of occurrences over a specified interval of time or space  It is a discrete random variable that may assume an infinite sequence of values (x = 0, 1, 2, . . . ). Poisson Distribution  Examples of a Poisson distributed random variable:  the number of knotholes in 14 linear feet of pine board  the number of vehicles arriving at a toll booth in one hour Poisson Distribution  Two Properties of a Poisson Experiment 1. The probability of an occurrence is the same for any two intervals of equal length. 2. The occurrence or nonoccurrence in any interval is independent of the occurrence or nonoccurrence in any other interval. Poisson Distribution  Poisson Probability Function f ( x)   x e x! where: f(x) = probability of x occurrences in an interval,  = mean number of occurrences in an interval, e = 2.71828. Poisson Distribution  Example: We are interested in the number of arrivals at the drive-up teller window of a bank during a 15-minute period on weekday mornings. Assume that the probability of a car arriving is the same for any two time periods of equal length and that the arrival or nonarrival of a car in any time period is independent of the arrival or nonarrival in any other time period. Poisson Distribution The Poisson probability function is applicable. Suppose that the average number of cars arriving in a 15-minute period of time is 10; in this case, the following probability function applies.  x 10 10 e f ( x)  x! The random variable here is x = number of cars arriving in any 15-minute period. Poisson Distribution  Example: Mercy Hospital Patients arrive at the emergency room of Mercy Hospital at the average rate of 6 per hour on weekend evenings. What is the probability of 4 arrivals in 30 minutes on a weekend evening? MERCY Poisson Distribution  Using the Poisson Probability Function  = 6/hour = 3/half-hour, x = 4 3 4 (2.71828)3 f (4)   4! .1680 MERCY Poisson Distribution  Using Poisson Probability Tables  x 0 1 2 3 4 5 6 7 8 2.1 .1225 .2572 .2700 .1890 .0992 .0417 .0146 .0044 .0011 2.2 .1108 .2438 .2681 .1966 .1082 .0476 .0174 .0055 .0015 2.3 .1003 .2306 .2652 .2033 .1169 .0538 .0206 .0068 .0019 2.4 .0907 .2177 .2613 .2090 .1254 .0602 .0241 .0083 .0025 2.5 .0821 .2052 .2565 .2138 .1336 ..0668 .0278 .0099 .0031 2.6 .0743 .1931 .2510 .2176 .1414 .0735 .0319 .0118 .0038 2.7 .0672 .1815 .2450 .2205 .1488 .0804 .0362 .0139 .0047 2.8 .0608 .1703 .2384 .2225 .1557 .0872 .0407 .0163 .0057 2.9 .0550 .1596 .2314 .2237 .1622 .0940 .0455 .0188 .0068 3.0 .0498 .1494 .2240 .2240 .1680 .1008 .0504 .0216 .0081 MERCY Poisson Distribution Poisson Distribution of Arrivals Poisson Probabilities 0.25 Probability  0.20 actually, the sequence continues: 11, 12, … 0.15 0.10 0.05 0.00 0 1 2 3 4 5 6 7 8 9 Number of Arrivals in 30 Minutes 10 Poisson Distribution A property of the Poisson distribution is that the mean and variance are equal.  =2 MERCY Poisson Distribution  Variance for Number of Arrivals During 30-Minute Periods =2=3 Hypergeometric Distribution The hypergeometric distribution is closely related to the binomial distribution. However, for the hypergeometric distribution: the trials are not independent, and the probability of success changes from trial to trial. Hypergeometric Distribution  Hypergeometric Probability Function  r  N  r     x  n  x  for 0 < x < r  f ( x)  N   n where: f(x) = probability of x successes in n trials, n = number of trials, N = number of elements in the population, r = number of elements in the population labeled success. Hypergeometric Distribution  Hypergeometric Probability Function for 0 < x < r Hypergeometric Distribution  Hypergeometric Probability Function r  number of ways x successes can be   = selected from a total of r successes  x in the population  N  r  number of ways n – x failures can be   = selected from a total of N – r failures  n  x  in the population number of ways N   = a sample of size n can be selected n  from a population of size N Hypergeometric Distribution   Example: A Quality Control Application. Electric fuses produced by Ontario Electric are packaged in boxes of 12 units each. Suppose an inspector randomly selects 3 of the 12 fuses in a box for testing. If the box contains exactly 5 defective fuses, what is the probability that the inspector will find exactly 1 of the 3 fuses defective? Hypergeometric Distribution  In this application, n = 3 and N = 12. With r = 5 defective fuses in the box. Hypergeometric Distribution  The probability of finding x = 1 defective fuse is  What is the probability of finding at least 1 defective fuse? Hypergeometric Distribution  The probability of x = 0 is  we conclude that the probability of finding at least 1 defective fuse must be 1 - .1591 = .8409. Hypergeometric Distribution  Example: Neveready Bob Neveready has removed two dead batteries from a flashlight and inadvertently mingled them with the two good batteries he intended as replacements. The four batteries look identical. Bob now randomly selects two of the four batteries. What is the probability he selects the two good batteries? Hypergeometric Distribution  Using the Hypergeometric Function where: x = 2 = number of good batteries selected n = 2 = number of batteries selected N = 4 = number of batteries in total r = 2 = number of good batteries in total Hypergeometric Distribution  Mean  Variance Hypergeometric Distribution  Mean  Variance Hypergeometric Distribution   Consider a hypergeometric distribution with n trials and let p = (r/n) denote the probability of a success on the first trial. If the population size is large, the term (N – n)/(N – 1) approaches 1. Hypergeometric Distribution  The expected value and variance can be written E(x) = np and Var(x) = np(1 – p).  Note that these are the expressions for the expected value and variance of a binomial distribution. continued Hypergeometric Distribution  When the population size is large, a hypergeometric distribution can be approximated by a binomial distribution with n trials and a probability of success p = (r/N).