Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
ECO 173 Brief review of Discrete Probability Distributions & Chapter 8: Continuous Probability Distributions Lecture 2a Instructor: Naveen Abedin 1 Recap from last class • Random Variable: A function or rule that assigns a number to each outcome of an experiment. • Example: the dual coin toss experiment. The variable we are interested to measure is the number of heads appearing in each outcome of the experiment. The possible values of the variable are 0, 1 and 2 Coin 1 Coin 2 Total Number of Heads (X) H H T T H T H T 2 1 1 0 2 Probability Distribution • A table, formula or graph that contains/describes the values a random variable can take, and the probability associated with these values is called probability distribution. Notation: • X : symbol for random variable • x : symbol for observation values that the random variable can take • P: symbol for probability. P(X = x) or P(x) denotes the probability that the random variable X can attain the value x. 3 Probability Distribution (cont.) • The probability distribution of the dual coin toss experiment would be as such: • Let X be the random variable that denoted the number of heads obtained from the dual coin toss. The possible observation values X can take are x = {0, 1, 2} Probability Distribution of X (number of heads) x P (X = x) 0 0.25 1 0.5 2 0.25 Total 1 4 Probability Distribution (cont.) • 2 fundamental requirements of a Probability Distribution of a Discrete Random Variable:1. 0 ≤ P(x) ≤ 1, for all x 2. ∑ P(x) = 1 (sum of all probabilities must equal to 1) 5 Probability Distribution (cont.) • Example 1: 116 million houses were surveyed. A list of the number of household members, and the corresponding number of houses are listed in the table below: Number of Household Members (X) Number of Households (millions) 1 31.1 2 38.6 3 18.8 4 16.2 5 7.2 6 2.7 7 or more 1.4 Total 116 6 Probability Distribution (cont.) • Let X be the random variable that denotes the number of household members. The values that X can take are x = {1, 2, 3, 4, 5, 6, 7 or more}. Probability distribution of X is as follows: x P(x) 1 31.1/116 = 0.268 2 38.6/116 = 0.333 3 0.162 4 0.140 5 0.062 6 0.023 7 or more 0.012 Total 1 7 Expected Value & Mean • The typical method for computing mean is the formula:- • However, when we are dealing with a very large population, i.e. N is very large, this method for computing average may not be feasible 8 Continuous Random Variable • A continuous random variable is one that can assume an uncountable number of values. (Since there are infinite possible number of values the random variable can take, we cannot make a list of values like in discrete random variable case.) • Since x can take an infinite possible number of values, the probability of each individual value is virtually 0 (e.g. P(X = 5) = 0). Hence, for continuous probability distributions, we calculate the probability of a range of values (e.g. P(5 ≤ X ≤ 10) = ?) instead of a specific value like P(X=5). 9 Continuous Random Variable (cont.) • In order to find probability of ranges of values of a particular continuous random variable, we need two things:1. Set fixed intervals/range 2. Calculate relative frequency within the range 10 Probability Density Function • Example 1: The following histogram records telephone bills for long-distance calls. There are a total of 200 records. 11 Probability Density Function (cont.) • A probability distribution for $15 intervals can be presented in a table: Billing Interval ($) Frequency (Number of Houses) Relative Frequency (Probabilities) 0 ≤ X ≤ 15 71 71/200 = 0.355 15 < X ≤ 30 37 37/200 = 0.185 30 < X ≤ 45 13 13/200 = 0.065 45 < X ≤ 60 9 9/200 = 0.045 60 < X ≤ 75 10 10/200 = 0.05 75 < X ≤ 90 18 18/200 = 0.09 90 < X ≤ 105 28 28/200 = 0.14 105 < X ≤ 120 14 14/200 = 0.07 Total 200 1 12 Probability Density Function (cont.) • In order to make the area under the histogram equal to 1, we need to rescale the vertical axis, by dividing the relative frequency by the interval, 15. This is going to make the area of each bar equal to the relative frequency, and as shown in the table, the sum of relative frequencies is equal to 1. 13 Probability Density Function (cont.) • This is helpful to find probabilities between ranges of values that overlap intervals. Find P (50 ≤ X ≤ 80). 14 Probability Density Function (cont.) • Suppose now that we increase the number of intervals. This will cause the width of each interval to shrink. • This will help us gradually get an idea of a smooth curve overlaying the histogram. This curve is called the probability density function (p.d.f). The area under p.d.f is equal to the probability of an interval. Therefore, the total area under the p.d.f curve is equal to 1. 15 16 Probability Density Function (cont.) • Requirements for probability density function, whose range is from a ≤ x ≤ b. 1. f(x) ≥ 0 for all x between a and b, (i.e. relative frequency/interval width cannot be negative) 2. Total area under the curve between a and b must equal to 1. 17 Normal Distribution • The range of this function is from −∞ to ∞ 18 Normal Distribution (cont.) • The normal distribution is primarily described through two parameters – the mean (μ) and the standard deviation (σ). • The mean, μ, is the value at the center of the normal distribution. Hence, increasing the mean shifts the entire curve to the right, and decreasing the mean shifts the entire curve to the left. 19 Normal Distribution (cont.) • Standard deviation, σ, is the square root of variance, measuring the degree of deviation from the mean. Larger values of σ makes the curve wider, while smaller values of σ makes the curve narrower. 20 Normal Distribution (cont.) • As before, to calculate the probability that a normal random variable falls into any interval, the area under the curve has to be calculated for that interval. • In order to calculate probabilities of random variables following normal distribution, we first need to standardize the random variable. • A variable can be standardized by subtracting the mean from the variable (X) and dividing it by the standard deviation. 21 Normal Distribution (cont.) • When a normal random variable is standardized, it is called a standard normal random variable, and is denoted by the symbol, Z. • Once a normal random variable has been standardized, the mean of the standard normal variable is 0, and the standard deviation is equal to 1. 22 Normal Distribution (cont.) • Example 4: Demand for gasoline is normally distributed with a mean of 1000 gallons and standard deviation of 100 gallons. The manager noted that there is exactly 1100 gallons of regular gasoline in storage. The next delivery is scheduled later today at the close of business. The manager would like to know the probability that he will have enough regular gasoline to satisfy today’s demand. 23 Normal Distribution (cont.) • Through the process of standardization we convert the distribution on the left to the one on the right. 24 Normal Distribution (cont.) 25 Normal Distribution (cont.) 26 Probability ZA • ZA represents the value z such that the area to its right under the standard normal curve is A. • To find A, we use the standard normal probabilities table backwards – we locate the probability from the table and identify the corresponding value of Z for that probability. 27 Standard Deviation • Standard deviation is a measure that is used to quantify the amount of variation or dispersion of a set of data values. A low standard deviation indicates that the data points tend to be close to the mean of the set, while a high standard deviation indicates that the data points are spread out over a wider range of values. • Standard deviation is expressed in the same units as the data. • For example, each of the three populations {0, 0, 14, 14}, {0, 6, 8, 14} and {6, 6, 8, 8} has a mean of 7. Their standard deviations are 7, 5, and 1, respectively. The third population has a much smaller standard deviation than the other two because its values are all close to 7. 28 68-95-99.7 Rule • The 68-96-99.7 rule is a shorthand to remember the percentage of values that lie within one, two and three standard deviations, respectively, of the mean of a normal distribution. • The rule can be used as a simple test for outliers. For example, it might be considered in a study that data points that fall more than 3 standard deviations from the mean are likely outliers. 29 68-95-99.7 Rule (cont.) 30 68-95-99.7 Rule (cont.) • Example 5: Miss N’s scores in Chemistry this semester were rather inconsistent: 100, 85, 55, 95, 75, 100. For this population, how many scores are within one standard deviation of the mean? • Answer: All but 55 lie within 1 standard deviation of the mean. This means 100, 85, 85, 75, and 100 lie within 68% of the mean 31