Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Continuous random variables I probability density, mean, s.d. I Normal distribution A random variable X is continuous if P(X = x) = 0 for all x uniform random variable I Suppose X is a number picked at ‘random ’ from the interval [0, 1] I P(X = x) = 0 for all x I But Probability X falls in an interval is equal to the length of the interval,and is nonzero. Probabilities involving X can be modeled by a continuous curve 0.0 1.0 2.0 Identifies prob with area under a function −1.0 −0.5 0.0 0.5 1.0 1.5 2.0 Continuous Probability Density Function The graphical form of the probability distribution for a continuous random variable x is a smooth curve Density curves A density curve is a mathematical model of a distribution. The total area under the curve, by definition, is equal to 1, or 100%. The area under the curve for a range of values is the probability of all observations for that range. Histogram of a sample with the smoothed, density curve describing theoretically the population. Density curves come in any imaginable shape. Some are well known mathematically and others aren’t. Our interest is in a special type of density called Normal density or Normal Distribution Normal random variables I Describes many random processes or continuous phenomena I Can be used to approximate discrete probability distributions Example: binomial I Basis for classical statistical inference Normal density A Normal density has two parameters µ, σ N(µ, σ) : f (x) = √ 1 2πσ e (x−µ)2 2σ 2 −∞<x <∞ E(X ) = µ, V (X ) = σ 2 , s.d(X ) = σ Normal Distribution Normal Distribution f(x ) x Mean Median Mode Normal Distribution Normal distributions x x Normal Distribution Here, means are the same (m = 15) while standard deviations are different (s = 2, 4, and 6). 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Here, means are different (m = 10, 15, and 20) while standard deviations are the same (s = 3) 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Normal Distribution I How to calculate probabilities? I Suppose we have N(5,3) and want to find P(2 < X < 7), how do we do that? N(5,3)table? I Do we need a table for every possible value of mean and s.d? Standard Normal Distribution I Suppose X is N(µ, σ), i.e a normal random variable with mean µ, and s.d σ I Define the “ standardized variable Z by I Z = X −µ σ I Then mean of Z =0 , s.d (Z ) = 1 and I Z is N(0,1) and is called STANDARD NORMAL I The transformation X 7−→ Z = X −µ σ I converts a N(µ, σ) to a standard normal , i.e N(0, 1) I As a consequence P(a < X < b) = P I Use standard normal tables a−µ b−µ <Z < σ σ Standard Normal Standard Normal Distribution The standard normal distribution is a normal distribution with µ = 0 and = 1. A random variable with a standard normal distribution, denoted by the symbol z, is called a standard normal random variable. Standard Normal table Standard Normal table The Standard Normal Table: P(0 < z < 1.96) Standardized Normal Probability Table (Portion) Z .04 .05 s=1 .06 1.8 .4671 .4678 .4686 .4750 1.9 .4738 .4744 .4750 2.0 .4793 .4798 .4803 m= 0 1.96 z 2.1 .4838 .4842 .4846 Probabilities Shaded area exaggerated Standard Normal table The Standard Normal Table: P(–1.26 z 1.26) Standardized Normal Distribution s=1 .3962 .3962 –1.26 1.26 z m=0 Shaded area exaggerated P(–1.26 ≤ z ≤ 1.26) = .3962 + .3962 = .7924 Standard Normal table The Standard Normal Table: P(z > 1.26) Standardized Normal Distribution s=1 P(z > 1.26) = .5000 – .3962 = .1038 .5000 .3962 1.26 m=0 z Find the probability that a N(µ, σ) variable is I within one standard deviation of the mean I within two standard deviation of the mean I within three standard deviation of the mean Problems 98,107,114,203 Problem 4.98 I Let X be the support. X is Normal with µ = 67.755 and σ = 26.871 I We will write this compactly as X is N(67.755,26.871). I Find P(X < 40) I P(X < 40) = P(Z < 40−67.755 26.871 ) = P(Z < −1.03) 0.1515 −3 −1.03 0 3 Problem 4.98 I X is N(67.755,26.871) I Find P(40 < X < 120) I P(< 40 < X < 120) = P( 40−67.755 26.871 < Z < I = P(1 − .03 < Z < −1.944) 10−67.755 26.871 ) Problem 4.98 0.8225 −3 −1.03 0 1.944 3 Problem 4.98 I Find P(40 < X < 120) I P(X > 120) = P(Z < 10−67.755 26.871 ) = P(Z < −1.944) Problem 4.98 0.0259 −3 0 1.944 3 Problem 4.98 I Find c such that P(X < c)0.25 I P(X < c) = P(Z < I from tables P(Z < −.6744) = .25 I so I c = 26.871 × (−.6744) + 67.755 = 49.63 c−67.755 26.871 c−67.755 26.871 ) = .25 = −.6744 Problem 4.98 0.2502 −3 −0.674 0 3 Problem 4.107 I X : Rating is N(50,15) I Find c such that P(X > c) = .1 I P(X > c) = P(Z > I from tables P(Z > 1.28) = .1 I c−50 15 I c−50 15 ) = 1.28 c = 15(1.28) + 50 = 69.2 = .1 Problem 4.107 0.0999 −3 0 1.282 3 I X : Rating is N(50,15) I Find c such that P(X > c) = .7 I P(X > c) = P(Z > I from tables P(Z > −.524) = .1 I c−50 15 I c−50 15 ) = .7 = −.524 c = 15(−.524) + 50 = 42.14 Problem 4.114 I X: quantity injected in a unit is N(10,.20 I cost of each unit fill $20 I If fill is less than 10, cost of refill$10 I sale price$230 Problem 4.114 I Find the probability that a unit is underfilled, not underfilled I P(X < 10) = P(Z < I Prob not underfilled = 0.5 10−10 .2 ) = P(Z < 0) = .5 Problem 4.114 I A container initially underfilled, now refilled has 10.6 units. What is the profit? I 230 - (10 + 20(10.6)) = 8 Problem 4.114 I Now X is N(10.1,0.2) I What is the expected profit? I for each unit profit is 230- 20 X ( since prob of underfilling is 0) I Expected profit = 230- 20 E(X) =230 - 20 (10.1) =28 Problem 4.203 I X: load is N(20,000,σ), σ unknown I P(10, 000 < X < 30, 000) = .95 I P( 10,000−20,000 <Z < σ I From tables P(−1.96 < Z < 1.96) = .95 I So 30,000−20,000 σ 30,000−20,000 ) σ = 1.96 or σ = = .95 10,000 1.96 = 5102.041 Normal Approximation to the Binomial I Normal distribution can be used to approximate probabilities of a Binomial distribution p np(1 − p) I Recall if X is Bin(n,p) then µ = np and σ = I It can be shown that if n is large then the probabilities of X can be approximated by a N(np, p np(1 − p) Normal Approximation to the Binomial Normal Approximation of Binomial Distribution 1. Useful because not all binomial tables exist 2. Requires large sample size 3. Gives approximate probability only 4. Need correction for continuity n = 10 p = 0.50 p(x) .3 .2 .1 .0 0 x 2 4 6 8 10 Normal Approximation to the Binomial Why Probability Is Approximate p(x) .3 .2 .1 .0 x 0 2 Binomial Probability: Bar Height 4 6 8 10 Normal Probability: Area Under Curve from 3.5 to 4.5 Correction for Continuity 1. A 1/2 unit adjustment to discrete variable 2. Used when approximating a discrete distribution with a continuous distribution 3. Improves accuracy 3.5 (4 – .5) 4 4.5 (4 + .5) problem 104 I X is Bin(100,000,.01) √ P(X < 950) ≈ P Z < 950−.5−(100,000).01 I P(Z < −1.604) = .0542 I 100,000(.01)(.99) Problem 113 I V: The number who carry visa is Bin(100,.5), E(V) = 50 I D: The number who carry Discover is Bin(100,.09) = 9 I P(V ≥ 50) = P(Z > √50−.5−50 ) = .5398 100(.5)(.5)