Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Statistics Chapter 4: Discrete Random Variables Where We’ve Been Using probability to make inferences about populations Measuring the reliability of the inferences McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 2 Where We’re Going Develop the notion of a random variable Numerical data and discrete random variables Discrete random variables and their probabilities McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 3 4.1: Two Types of Random Variables A random variable is a variable hat assumes numerical values associated with the random outcome of an experiment, where one (and only one) numerical value is assigned to each sample point. McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 4 4.1: Two Types of Random Variables A discrete random variable can assume a countable number of values. Number of steps to the top of the Eiffel Tower* A continuous random variable can assume any value along a given interval of a number line. The time a tourist stays at the top once s/he gets there *Believe it or not, the answer ranges from 1,652 to 1,789. See Great Buildings McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 5 4.1: Two Types of Random Variables Discrete random variables Number of sales Number of calls Shares of stock People in line Mistakes per page Continuous random variables Length Depth Volume Time Weight McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 6 4.2: Probability Distributions for Discrete Random Variables The probability distribution of a discrete random variable is a graph, table or formula that specifies the probability associated with each possible outcome the random variable can assume. p(x) ≥ 0 for all values of x p(x) = 1 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 7 4.2: Probability Distributions for Discrete Random Variables Say a random variable x follows this pattern: p(x) = (.3)(.7)x-1 for x > 0. This table gives the probabilities (rounded to two digits) for x between 1 and 10. x P(x) 1 .30 2 .21 3 .15 4 .11 5 .07 6 .05 7 .04 8 .02 9 .02 10 .01 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 8 4.3: Expected Values of Discrete Random Variables The mean, or expected value, of a discrete random variable is E( x) xp( x). McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 9 4.3: Expected Values of Discrete Random Variables The variance of a discrete random variable x is E[( x ) ] ( x ) p( x). 2 2 2 The standard deviation of a discrete random variable x is E[( x ) ] 2 2 (x ) McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 2 p( x). 10 4.3: Expected Values of Discrete Random Variables Chebyshev’s Rule Empirical Rule ≥0 .68 P ( 2 x 2 ) ≥ .75 .95 P ( 3 x 3 ) ≥ .89 1.00 P( x ) McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 11 4.3: Expected Values of Discrete Random Variables In a roulette wheel in a U.S. casino, a $1 bet on “even” wins $1 if the ball falls on an even number (same for “odd,” or “red,” or “black”). The odds of winning this bet are 47.37% P ( win $1) .4737 P (lose $1) .5263 $1 .4737 $1 .5263 .0526 .9986 On average, bettors lose about a nickel for each dollar they put down on a bet like this. (These are the best bets for patrons.) McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 12 4.4: The Binomial Distribution A Binomial Random Variable n identical trials Two outcomes: Success or Failure P(S) = p; P(F) = q = 1 – p Trials are independent x is the number of Successes in n trials McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 13 4.4: The Binomial Distribution A Binomial Random Variable n identical trials Two outcomes: Success or Failure P(S) = p; P(F) = q = 1 – p Trials are independent x is the number of S’s in n trials Flip a coin 3 times Outcomes are Heads or Tails P(H) = .5; P(F) = 1-.5 = .5 A head on flip i doesn’t change P(H) of flip i + 1 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 14 4.4: The Binomial Distribution Results of 3 flips Probability Combined Summary HHH (p)(p)(p) p3 (1)p3q0 HHT (p)(p)(q) p2q HTH (p)(q)(p) p2q THH (q)(p)(p) p2q HTT (p)(q)(q) pq2 THT (q)(p)(q) pq2 TTH (q)(q)(p) pq2 TTT (q)(q)(q) q3 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables (3)p2q1 (3)p1q2 (1)p0q3 15 4.4: The Binomial Distribution The Binomial Probability Distribution p = P(S) on a single trial q=1–p n = number of trials x = number of successes n x n x P( x) p q x McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 16 4.4: The Binomial Distribution The Binomial Probability Distribution n x n x P( x) p q x McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 17 4.4: The Binomial Distribution Say 40% of the class is female. What is the probability that 6 of the first 10 students walking in will be female? n x n x P ( x) p q x 10 6 106 (.4 )(. 6 ) 6 210(.004096)(. 1296) .1115 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 18 4.4: The Binomial Distribution A Binomial Random Variable has Mean Variance Standard Deviation np npq 2 npq McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 19 4.4: The Binomial Distribution For 1,000 coin flips, np 1000 .5 500 npq 1000 .5 .5 250 2 npq 250 16 The actual probability of getting exactly 500 heads out of 1000 flips is just over 2.5%, but the probability of getting between 484 and 516 heads (that is, within one standard deviation of the mean) is about 68%. McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 20 4.5: The Poisson Distribution Evaluates the probability of a (usually small) number of occurrences out of many opportunities in a … Period of time Area Volume Weight Distance Other units of measurement McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 21 4.5: The Poisson Distribution P( x) x e x! = mean number of occurrences in the given unit of time, area, volume, etc. e = 2.71828…. µ= 2 = McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 22 4.5: The Poisson Distribution Say in a given stream there are an average of 3 striped trout per 100 yards. What is the probability of seeing 5 striped trout in the next 100 yards, assuming a Poisson distribution? P( x 5) x e 5 3 3e .1008 x! 5! McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 23 4.5: The Poisson Distribution How about in the next 50 yards, assuming a Poisson distribution? Since the distance is only half as long, is only half as large. P( x 5) x e 5 1.5 e x! 5! McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 1.5 .0141 24 4.6: The Hypergeometric Distribution In the binomial situation, each trial was independent. Drawing cards from a deck and replacing the drawn card each time If the card is not replaced, each trial depends on the previous trial(s). The hypergeometric distribution can be used in this case. McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 25 4.6: The Hypergeometric Distribution Randomly draw n elements from a set of N elements, without replacement. Assume there are r successes and N-r failures in the N elements. The hypergeometric random variable is the number of successes, x, drawn from the r available in the n selections. McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 26 4.6: The Hypergeometric Distribution r N r x n x P( x) N n where N = the total number of elements r = number of successes in the N elements n = number of elements drawn X = the number of successes in the n elements McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 27 4.6: The Hypergeometric Distribution r N r x n x P( x) N n nr N r ( N r ) n( N n) 2 N 2 ( N 1) McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 28 4.6: The Hypergeometric Distribution Suppose a customer at a pet store wants to buy two hamsters for his daughter, but he wants two males or two females (i.e., he wants only two hamsters in a few months) If there are ten hamsters, five male and five female, what is the probability of drawing two of the same sex? (With hamsters, it’s virtually a random selection.) 5 10 5 2 2 2 (10)(1) P( M 2) P( F 2) .22 45 10 2 P( M 2 or F 2) P( M 2) P( F 2) 2 .22 .44 McClave, Statistics, 11th ed. Chapter 4: Discrete Random Variables 29