Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Continuous Random Variables Week 6, Friday Continuous Random Variable The Discrete Random Variables we’ve seen so far are easy to represent in table form; however, some random variables are impossible to represent in this form. Example: Consider the number of seconds it takes to run one mile. For very fast runners, this number could be around 300. For very slow runners, this number could be as high as 1,200. Furthermore, it is possible to get more precise values by rounding to some decimal, for example 400.25 seconds. To put all of these values into a table would be impractical. For the purpose of this class, we will define such random variables as continuous random variables. But if we don’t define X by a table then how do we define it? ANSWER: with a function Examples of a Continuous R.V. Suppose I give a 10 point quiz. There are 100 questions, and partial credit is possible for each one. Then the score for students would be a continuous random variable because decimal scores such as 8.38495/10 may be possible, and these values are tough to put into a table. The way we describe such a random variable is through a function. 1 Consider the following function as a probability model: f ( x) 10 where x takes on values between 0 and 10. 1 10 0 P[0 ≤ X ≤ 10] = 100% What is the Area Under This Curve? 10 * (1/10) = 1.00 = 100% How does this link to Probability? For continuous random variables, P[a ≤ X ≤ b] can be found by calculating the area under the curve, between x=a and x=b 10 Examples of a Continuous R.V. Suppose I give a 10 point quiz. There are 100 questions, and partial credit is possible for each one. Then the score for students would be a continuous random variable because decimal scores such as 8.38495/10 may be possible, and these values are tough to put into a table. The way we describe such a random variable is through a function. 1 Consider the following function as a probability model: f ( x) 10 where x takes on values between 0 and 10. P[passing] = ? 1 10 0 P[6 ≤ X ≤ 10] = 4/10 = 40% What is the Area Under This Curve? 10 * (1/10) = 1.00 = 100% How does this link to Probability? For continuous random variables, P[a ≤ X ≤ b] can be found by calculating the area under the curve, between x=a and x=b 6 10 Examples of a Continuous R.V. Suppose I give a 10 point quiz. There are 100 questions, and partial credit is possible for each one. Then the score for students would be a continuous random variable because decimal scores such as 8.38495/10 may be possible, and these values are tough to put into a table. Alternative model: Define the function: f ( x) x 50 What is the Area Under This Curve? 10 * (.2) * (1/2) = 1.00 = 100% Now, what is the probability of Passing the quiz? P[passing] = 1 – area of the small triangle = 1 – ( 4 * .2 / 2 ) = .6 = 60% .2 0 6 10 The Most Important Continuous R.V. Consider the function that is defined for every number from (-∞,∞) f ( x) 1 2 What is the Area Under This Curve? 1 = 100% What is the Symmetry/Skewness? Symmetric e ( x )2 2 2 How many Normal Distributions exist? Infinite Amount For each combination of mu and sigma, there is a unique normal distribution The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA The Most Important Continuous R.V. 1 e Don’t memorize the complicated formula: f ( x) 2 ( x )2 2 2 If mu and sigma (mean and standard deviation) are given to you then you should know exactly what the distribution looks like: The distribution is CENTERED at MU The distribution is more SPREAD OUT for larger values of SIGMA N ( , ) The Most Important Continuous R.V. Example: Brown University gives a standardized exam to all students interested in joining the MBA program. The exam is graded in such a way that the scores are normally distributed with a mean of 100 and standard deviation of 10. “Empirical Rule”: For normal distributions, 68% of the values are within 1 standard deviation of the mean, 95% of the values are within 2 standard deviation of the mean, 99.8% of the values are within 3 standard deviation of the mean 10 100 90 110 The Most Important Continuous R.V. Example: Brown University gives a standardized exam to all students interested in joining the MBA program. The exam is graded in such a way that the scores are normally distributed with a mean of 100 and standard deviation of 10. “Empirical Rule”: For normal distributions, 68% of the values are within 1 standard deviation of the mean, 95% of the values are within 2 standard deviation of the mean, 99.8% of the values are within 3 standard deviation of the mean 10 100 80 90 110 120 The Most Important Continuous R.V. Example: Brown University gives a standardized exam to all students interested in joining the MBA program. The exam is graded in such a way that the scores are normally distributed with a mean of 100 and standard deviation of 10. “Empirical Rule”: For normal distributions, 68% of the values are within 1 standard deviation of the mean, 95% of the values are within 2 standard deviation of the mean, 99.8% of the values are within 3 standard deviation of the mean 10 100 70 80 90 110 120 130 Calculations with the Normal Distribution Recall the last example: Average test score (mu) is 100 with 10 as sigma. What’s the probability of scoring over 105? P[X ≥ 105] cannot be found by simple geometry, so we have to do something else – a “Z calculation” – and look up the probability on a table 10 100 105 Calculations with the Normal Distribution Recall the last example: Average test score (mu) is 100 with 10 as sigma. What’s the probability of scoring over 105? Step 1: Transform the variable X, N(mu, sigma) to Z, N(0,1) Z X 105 100 .5 10 1 0 .5 Calculations with the Normal Distribution Recall the last example: Average test score (mu) is 100 with 10 as sigma. What’s the probability of scoring over 105? Step 1: Transform the variable X, N(mu, sigma) to Z, N(0,1) Step 2: Look up the values in a “z table” (page A58, A59) 1 0 .5 For any value k of Z, you can look up the probabity P[Z≤k] Z X 105 100 .5 10 We are interested in the 6th row (0.5) and the first column, since the second decimal point is 0 (0.50) Therefore P[Z ≤ 0.50] = .6915 Calculations with the Normal Distribution Recall the last example: Average test score (mu) is 100 with 10 as sigma. What’s the probability of scoring over 105? Step 1: Transform the variable X, N(mu, sigma) to Z, N(0,1) Step 2: Look up the values in the a “z table” (page A58, A59) P[Z ≤ 0.5] = .6915 1 0 .5 P[Z > 0.5] = .3085