Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 8 Continuous Random Variables Stat-Slide-Show, Copyright 1994-95 by Quant Systems Inc. Prof. Vinod Introduction 8- 2 Continuum • The previous chapter was primarily devoted to random variables that were counts of some phenomenon that was called a “success.” • These counts could only take on discrete values usually starting at zero. • In this chapter the focus will be on variables that take on any value in some continuum, where a continuum is simply a range of numbers on the real number line. 8- 3 Example 1 • The range of newborn baby weights lies on a continuum between 2 and 13 pounds. [ ] 2 13 pounds • Observations measured in a continuum can be very close together; for example, two weights could be 8.5672538764522134599... inches, and 8.5672538764522134598... inches. 8- 4 Discrete or Continuous? • Many continuous variables are discretized (rounded) by the instruments used to measure them. • It is important to distinguish the nature of the variable from its physical measurement. • Variables like heights and weights are continuous random variables even though they are usually given to the nearest unit. 8- 5 Continuous vs. Discrete • One of the striking differences in discrete and continuous variables concerns the way in which probability is defined. • In a probability distribution for a discrete random variable, each possible outcome of the random variable is assigned its own probability. • However, for continuous random variables, there are infinitely many outcomes, and each has no probability. 8- 6 Outcomes of a Continuous Random Variable • Outcomes of a continuous random variable do not have probability assigned to any one point, because there are too many points. • If we attempted to assign each value, even an infinitesimal probability, the sum of all the probabilities would exceed one. (sum >1?) • Thus, for continuous random variables, probability is only assigned to intervals. 8- 7 Probability is area,(strict=fuzzy) • Since probability is assigned to intervals when x is a continuous random variable (rv), the probability is associated with the area under the pdf along the specified interval. • Bottom line strict inequality and the fuzzy (e.g. ) are equivalent for continuous rv: • P(x<some constant) = P(xthat same constant) 8- 8 The Continuous Uniform Distribution 8- 9 Continuous Uniform Distribution • Both uniform distributions distribute probability evenly across a sample space. • For the continuous uniform distribution, the probability density is spread out evenly over some range a to b as shown in the figure below. 1 b-a a b 8- Probability Density Functions (p.d.f) • Continuous random variables do not have probability distribution functions. • Instead, they have probability density functions which are denoted f(x). 8 - 11 The Uniform Probability Density Function 1 , f(x) = b - a 0 for a x b otherwise The parameters of the density function are the minimum and maximum value of the random variable and are referred to as a and b, respectively. 8- Expected Value and Standard Deviation • The expected value (mean) of the continuous uniform random variable is ab b-a . and = 2 12 • The standard deviation of the continuous uniform random variable is ab b-a an = . 2 12 8- Shape of the Continuous Uniform Distribution • When the uniform density function is graphed, it produces a rectangle or square. • The probability of observing a random variable in some interval is expressed as an area under the density function associated with the interval. 1 b-a a b 8- Calculating the Probability of an Interval Because the density function for the uniform distribution produces a rectangle, calculating the probability of an interval requires very simple geometry. Area of a rectangle =Height * width We want area to be 1, width from a to b is b-a, hence the height must be 1/(b-a) 8- Example 2 (Time to reach the fire) • The fire department records how long it takes each of its trucks to reach the scene of a fire. • Suppose that the distribution of arrival times is uniform with the minimum time being 2.0 minutes and the maximum time being 15 minutes. • What is the probability that a truck will reach a reported fire scene within five to ten minutes? 8- Ex.2 – Solution (5<time<10 minutes) Area = Height Width 1 = 1 = 1 Height = b - a 15 - 2 13 1 13 2 5 Area = 1 13 10 15 (10 - 5) = 5 13 P(5 X 10) = 5 13 8- Ex.2continued expected timeE(X), V(X) • What is the expected time until arrival? E(X) = = a + b = 15 + 2 = 8.5 min. 2 2 • What is the standard deviation of truck arrival times? = b - a = 15 - 2 = 3 .753 min. 12 12 8- What does data from a uniform random variable look like? • While the density function for the uniform distribution has a flat top, a histogram of data from a uniformly distributed random process will not be so perfectly flat. 100 observations 1000 observations 20 130 18 120 16 110 100 14 90 12 80 10 70 8 60 6 50 4 40 2 30 2 3.4 4.8 6.2 7.6 9 10.4 11.8 13.2 14.6 2 3.4 4.8 6.2 7.6 9 10.4 11.8 13.2 14.6 • When more observations are generated, the top of the distribution will begin to level. 8- The Normal Distribution 8- Normal Distribution • The normal distribution was originally called the Gaussian distribution, named after Karl Gauss who published a work in 1833 describing the mathematical definition of the distribution. • Gauss developed this distribution to describe the error in predicting the orbits of planets. 8- Shape of the Normal Distribution • Normal distributions are all bell shaped, but the bells come in various shapes and sizes. • However, since normal distributions are all bell shaped and symmetric, the mean, mode, and median are equal. mean median mode 8- Properties of the Normal Distribution Although the distribution can range in value from minus infinity to positive infinity, red values that are a great distance from the mean rarely occur. Range: -4<z<4 8- Properties of the Normal Distribution One of the more important properties of normal random variables is that within a fixed number of standard deviations from the mean, all Normal densities contain the same fraction of their probabilities. Remember one-sigma or twosigma rules? We now show how they are related to area under the standard Normal density z~N(0,1) curve. 8- Area Under the Curve • In the figure, the shaded area represents the probability of being within 1 of the mean. • The probability of a normal random variable being in some interval corresponds to the shaded area under the curve. 8- Area Under the Curve • The total area under the curve from - to equals one. • The shaded area under the curve and the probability of being within one standard deviation (1) of the mean equals .68. 8- Area Under the Curve 2 The shaded area in the figure above represents the probability of being within 2 of the mean which equals .9475 for every normal distribution. 8- Area Under the Curve 3 • The shaded area in the figure above represents the probability of being within 3 of the mean which equals .997 for every normal distribution. As you can see virtually all of the area under the curve is within three standard deviations of the mean. 8-