Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The normal distribution (Session 08) SADC Course in Statistics Learning Objectives At the end of this session you will be able to: • describe the normal probability distribution • state and interpret parameters associated with the normal distribution • use a calculator and statistical tables to calculate normal probabilities • appreciate the value of the normal distribution in practical situations To put your footer here go to View > Header and Footer 2 The Normal Distribution • In the previous two sessions, you were introduced to two discrete distributions, the Binomial and the Poisson. • In this session, we introduce the Normal Distribution – one of the commonest distributions followed by a continuous random variable • For example, heights of persons, their blood pressure, time taken for banana plants to grow, weights of animals, are likely to follow a normal distribution To put your footer here go to View > Header and Footer 3 Example: Weights of maize cobs Graph shows histogram of 100 maize cobs. Data which follows the bell shape of this histogram are said to follow a normal distribution. To put your footer here go to View > Header and Footer 4 Frequency definition of probability In a histogram, the bar areas correspond to frequencies. For example, there are 3 maize cobs with weight < 100 gms, and 19 maize cobs with weight < 120 gms. Hence, using the frequency approach to probability, we can say that Prob(X<120) = 19/100 = 0.019 The areas under the curve can be regarded as representing probabilities since the curve and edges of histogram would coincide for n=. To put your footer here go to View > Header and Footer 5 Probability Distribution Function f(x) 0.4 Two parameters associated with the normal distribution, its mean and variance 2. 0.2 0.0 x The mathematical expression describing the form of the normal distribution is f(x) = exp(–(x–)2/22)/(22) To put your footer here go to View > Header and Footer 6 Properties of the Normal Distribution f(x) 0.4 0.2 0.0 x • Total area under the curve is 1 • characterised by mean & variance: N(,2 ) • symmetric about mean () • 95% of observations lie within ± 2 of mean To put your footer here go to View > Header and Footer 7 The Standard Normal Distribution This is a distribution with =0 and =1, shown below in comparison with N(0,2), =3. 0.4 f(x) 0.2 -6 -4 -2 0 +2 To put your footer here go to View > Header and Footer +4 x +6 8 The Standard Normal Distribution This is a distribution with =0 and =1. Tables give probabilities associated with this distribution, i.e. for every value of a random variable Z which has a standard normal distribution, values of Pr(Z<z) are tabulated. In graph on right, P=Pr(Z<z). P Symmetry means any area (prob) can be found. 0 z To put your footer here go to View > Header and Footer 9 Calculating normal probabilities Any random variable, say X, having a normal distribution with mean and standard deviation , can be converted to a value (say z) from the standard normal distribution. This is done using the formula z = (X - ) / The z values are called z-scores. The z scores can be used to compute probabilities associated with X. To put your footer here go to View > Header and Footer 10 An example The pulse rate (say X) of healthy individuals is expected to have a normal distribution with mean of 75 beats per minute and a standard deviation of 8. What is the chance that a randomly selected individual will have a pulse rate < 65? We need to find Pr(X < 65) i.e. Pr(X - 75 < 65 - 75) = Pr[ (X – 75/8) < (-10/8) ] = Pr(Z < -1.28) Pr(Z<-1.3) = 0.0968 n) (using tables of the standard normal dist To put your footer here go to View > Header and Footer 11 A practical application Malnutrition amongst children is generally measured by comparing their weight-for-age with that of a standard, age-specific reference distribution for well-nourished children. A child’s weight-for-age is converted to a standardised normal score (an z-score), standardised to and of the reference distribution for the child’s gender and age. Children whose z-score<-2 are regarded as being underweight. To put your footer here go to View > Header and Footer 12 A Class Exercise Similarly to the above, height-for-age is used as a measure of stunting again converted to a standardised z score (stunted if z-score<-2). Suppose for example, the reference distribution for 32 months old girls has mean 91 cms with standard deviation 3.6 cms. What is the probability that a randomly selected girl of 32 months will have height between 83.8 and 87.4 cms? Graph below shows the area required. A class discussion will follow to get the answer. To put your footer here go to View > Header and Footer 13 Depicting required probability as an area under the normal curve 83.8 87.4 91.0 94.6 98.2 Answer = To put your footer here go to View > Header and Footer 14 Is a child stunted? Suppose a 32 month old girl has height-forage value = 82.1 Would you consider this child to be stunted? Discuss this question with your neighbour and write down your answer below. To put your footer here go to View > Header and Footer 15 Cumulative normal distribution Cumulative distribution is given by the function F(x) = P(X ≤a) a x In example above, the shaded area is 0.6, the value of a from tables of the standard normal distribution is 0.726. To put your footer here go to View > Header and Footer 16 a b P(a<X<b) = F(b)-F(a) is the area under the cumulative normal curve between points a and b. To put your footer here go to View > Header and Footer 17 Practical work follows … To put your footer here go to View > Header and Footer 18