Download L13A-Stat-Norm

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
MECH 373
Instrumentation and Measurements
Lecture 13/A
Statistical Analysis of Experimental Data
(Chapter 6)
• Introduction
• General Concepts and Definitions
• Probability
• Probability Distribution Function
• Parameter Estimation
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
1
PDF with Engineering Applications
A number of distribution functions are used in
engineering applications. We will discuss briefly about
some common distribution functions.
►Binomial (values either true/false)
►Poisson
►Normal (Gaussian)
►Student’s t
► χ2 (Chi-squared)
►Weibull
►Exponential
►Uniform
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
2
Probability Distribution Function
 Binomial Distribution
• The binomial distribution is a distribution which describes discrete random
variables that can have only two possible outcomes: “success” or “failure”.
• This distribution has applications in production quality control, when the
quality of a product is either acceptable or unacceptable.
• The binomial distribution provides the probability (P) of finding exactly r
successes in a total of n trials and is expressed as
n r
P(r )    p (1  p) n r
r
where, p is the probability of success which remains constant throughout the
experiment, and  n 
n!
  
 r  r!(n  r )!
is called as n combination r, which is the number of ways that we can choose r
identical items from n items.
• The expected number of successes in n trials for binomial distribution is   np
• The standard deviation of the binomial distribution is   np(1  p)
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
3
Probability Distribution Function
 Poisson Distribution
• The Poisson distribution is used to estimate the number of random occurrences
of an event in a specified interval of time or space if the average number of
occurrences is already known.
• The probability (P) of occurrence of x events is given by
e   x
P( x) 
x!
where, λ is the expected or mean number of occurrences during the interval of interest.
• The expected value of x for the Poisson distribution, the same as the mean (μ),
is given by
E (x)    
• The standard deviation is given by
 
• Probability that the number of occurrences is less than or equal to k is given by
e  i
P( x  k )  
i!
i 0
k
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
4
Probability Distribution Function
 Normal (Gaussian) Distribution
• The Normal or Gaussian distribution is a simple distribution function that is useful for
a large number of common problems involving continuous random variables.
1
 ( x  ) 2 / 2  2
f ( x) 
e
 2
• Symmetric about μ
• Bell-shaped
• Mean μ : the peak of
the density occurs
• Standard deviation σ:
indicates the spread of
the bell curve.
-1
Lecture 14
0
1
2
2
Lecture Notes on MECH 373 – Instrumentation and Measurements
3
4
5
5
Standard Normal Distribution
(mean=0, standard deviation=1)
For a given population, the probability of having a single value of x between a
lower limit of x1 and upper limit of x2 is
x2
z2
x1
z1
P( x1  x  x2 )   f ( x)dx  
1
 ( x  ) 2 / 2  2
e
dx
 2
Since f(x) is in the form of an error function, the integration must be
performed numerically. To simplify the integration process, the integrand is
modified with a change of variable. A non-dimensional variable z is defined
as:
x
z

A function f(z) which is called the standard normal density function is defined as
1 z2 / 2
f ( z) 
e
2
This function represents the normal probability density function for a random variable
z, with mean μ = 0, and standard deviation σ = 1. The function f(z) is plotted below
versus z.
Lecture Notes on MECH 373 – Instrumentation and Measurements
Lecture 14
6
Standard Normal Distribution
(mean=0, standard deviation=1)
x2
z2
x1
z1
P( x1  x  x2 )   f ( x)dx   f ( z )dz
The probability that x is between x1 and x2 is the same as the
probability that the transformed variable z is between z1 and z2.
That is
P( x1  x  x2 )  P( z1  z  z 2 )  P(
Lecture 14
x1  


Lecture Notes on MECH 373 – Instrumentation and Measurements
x


x2  

)
7
Confidence Intervals in Sample Population
It is also of interest to determine the probability that a measurement will fall within one
or more standard deviations of the mean. The values are listed in the following table:
This implies that if a measurement is performed in an environment where the deviation from
the mean value is totally influenced by random variables, we are
• 68.26% confident that the value of the random variable will fall within one standard
deviation (1σ) from the mean.
• 95.44% confident that the value of the random variable will fall within two standard
deviation (2σ) from the mean.
• 99.74% confident that the value of the random variable will fall within three standard
deviations (3σ) from the mean.
• 99.96% confident that the value of the random variable will fall within 3.5 standard
deviations (3.5σ) from the mean.
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
8
Standard Normal Distribution
(mean=0, standard deviation=1)
z
x
1 z2 / 2
f ( z) 
e
2

x2
z2
x1
z1
P( x1  x  x2 )   f ( x)dx   f ( z )dz
P( x1  x  x2 )  P( z1  z  z 2 )  P(
1
Z[1,1]
Lecture 14


x


x2  

)
3
Z [-3,3]
2
Z [-2,2]
68%
x1  
95%
Lecture Notes on MECH 373 – Instrumentation and Measurements
99.7%
9
Normal Distribution Example 1
• The distribution of heights of American women aged 18 to 24 is
approximately normally distributed with mean 65.5 inches and
standard deviation 2.5 inches.
• 68% of these American women have heights between 65.5 –
1(2.5) and 65.5 + 1(2.5) inches, or between 63 and 68 inches.
• 95% of these American women have heights between 65.5 2(2.5) and 65.5 + 2(2.5) inches, or between 60.5 and 70.5 inches.
• 99.7% of these American women have heights between 65.5 3(2.5) and 65.5 + 3(2.5) inches, or between 58 and 73 inches.
68%
Lecture 14
95%
Lecture Notes on MECH 373 – Instrumentation and Measurements
99.7%
10
Normal Distribution Example 2
•
•
•
•
•
•
•
•
•
•
•
Normal distribution mean value u = 10.0, Standard deviation σ = 1.0
A, Probability a single value between 9 and 12?
Z1 = (x – u)/σ = (9 - 10) / 1 = -1
P(-1 ≤ Z ≤ 0) = 0.3413 (Area from 0 to -1)
Z2 = (12 – 10) / 1 = 2
P(0 ≤ Z ≤ 2) = 0.4772 ( area from 0 to 2)
P(-1 ≤ Z ≤ 2) = 0.3413 + 0.4772 = 0.8185 (82%).
B, Probability less than 9?
Z1 = -1
P(-1 ≤ Z ≤ 0) = 0.3413 (Area from 0 to -1)
P (-∞ ≤ Z ≤ -1) = 0.5 – 0.3413 = 0.1587 (from -1 to negative infinity)
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
11
Normal Distribution Example 3
•
•
•
•
•
•
•
•
•
•
Diameter tolerance of cylinders – normal distribution.
Average diameter u = 4in, Standard deviation = 0.002.
A, Probability of D ≤ 4.002.
Z = (D – u) /σ = (4.002 – 4) / 0.002 = 1
P(0 ≤ Z ≤ 1) = 0.3413 (area from 0 to 1)
P (-∞ ≤ Z ≤ 0) = 0.5 (area from 0 to negative infinity)
P(-∞ ≤ Z ≤ 1) = 0.5 + 0.3413 = 0.8413 (84%)
B, D > 2 σ rejected?
P(0 ≤ Z ≤ 2) = 0.4772, also P(-2 ≤ Z ≤ 0) = 0.4772
P(-2 ≥ Z ≥ 2) = 1 – 2*0.4772 = 0.0456 (about 5%).
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
12
Parameter Estimation
In most experiments the sample size is small relative to the
population, yet the principle intention from the statistical
aspect is to estimate the parameters describing the entire
population. The most common estimated parameter is the
population mean and in some cases it is also necessary to
estimate the population standard deviation.
 Estimate population mean using sample mean
x
 Estimate population standard deviation using sample
standard deviation
S 
 However, different samples from the sample population
will generate different values. Therefore, it is necessary
to determine the uncertainty intervals of estimated
parameters. Lecture Notes on MECH 373 – Instrumentation and Measurements
Lecture 14
13
Interval Estimation of Population Mean
• Estimate of population mean
  x 
• Confidence interval
x     x 
• Confidence level (degree of confidence)
P( x      x   )
• Level of significance
Confidence level  1 a
where α is the probability that the mean will fall
outside the confidence interval.
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
14
From Sample to Population
 Choose many samples from population
 Find the mean of each sample
 Determine the uncertainly range of the means
(Sample mean is a variable !!!)
How to estimate the statistics of a population from
only one single sample?
The central limit theorem makes it possible to make an
estimate of the confidence interval with a suitable
confidence level.
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
15
Central Limit Theorem (a)
 Consider a population of the random variable x with a mean of μ
and standard deviation of σ
 Several different samples, each of size n, mean of x
i
 If sample size n is sufficiently large (>30), then

x 
n
(standard error of the mean)
 For the central limit theorem to apply, the sample size n must be
large (typically greater than 30)
 If original population is normal, the distribution for the xi is normal.
We can use the following equation
z
x
x

x
/ n
 If original population is not normal and n is large (n > 30) the
distribution for the xi is normal
 If original population is not normal and n < 30 the distribution for the
is only approximately normal
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
xi
16
Interval Estimation of Population
P[ za / 2  z  za / 2 ]  1  a
P[ x  za / 2

n
  x  za / 2
   x  za / 2

z

n
x
x
]  1a
Area  a / 2
Area  a / 2
n
with confidence level 1-a
Confidence Interval
 za / 2
Lecture 14
x

/ n
Lecture Notes on MECH 373 – Instrumentation and Measurements
z 0
za / 2
17
Interval Estimation of Population
• If z is zero, this means that x  
• However, we only expect the true value of μ to lie
somewhere within the confidence interval, which is  za / 2
• If the sample size is less than 30, the assumption that
the population standard deviation can be represented by
the sample standard deviation may not be accurate.
• In case of small samples, a statistic called Student’s t
distribution is used. That is
x 
t
S/ n
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
18
Normal Distribution Table
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
19
Example 6.11
We would like to determine the confidence interval of the mean of a batch of
resistor made in a certain process. Based on 36 measurements, the average
resistance is 25  and the sample stanadard deviation is 0.5 . Determine the
90% confidence interval of the mean resistance of the batch.





n = 36
Average = 25 
Sample standard deviation S = 0.5 
Determine 90% confidence interval of the mean
Solution:
 1-α=90%, -> α = 0.1
 0.5- α/2=0.5-0.05=0.45
 Table 6.3 -> Z α/2=1.645
S
S
0.5
0.5
   x  za / 2
 25  1.645 
   25  1.645 
n
n
36
36
 (24.86    25.14)
x  za / 2
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
20
Student’s t-distribution (n<30)
In case of small samples, a statistic called Student’s t distribution
is used. That is
 1
x
t
S/ n
(
f (t , ) 

2
 ( )(1 
2
)
t2

) ( 1) / 2
• Student’s t, degree of freedom ν = n-1
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
21
Interval Estimation of Population
Mean (n<30)
P[ta / 2  t  ta / 2 ]  1  a
P[ x  ta / 2
x
t
S/ n
S
S
   x  ta / 2
]  1a
n
n
  x  ta / 2
S
n
with confidence level 1-a
Area  a / 2
Confidence Interval
 ta / 2
Lecture 14
Area  a / 2
Lecture Notes on MECH 373 – Instrumentation and Measurements
t 0
ta / 2
22
Student’s t-Distribution Table
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
23
Example 6.12
•
•
•
•
n=6
1250, 1320, 1542, 1464, 1275, and 1383 h
Estimate population mean and 95% confidence interval on the mean
Solution:
– Mean=(1250+1320+1542+1464+1275+1383)/6=1372 h
– S=[(1250-1372)2+(1320-1372)2+ (1542-1372)2+ (1464-1372)2+
(1275-1372)2+(1383-1372)2]/(6-1)]1/2 = 114 h
– ν=n-1=6-1=5, α=1-95%=0.05, α/2=0.05/2=0.025
– Table 6.6 -> t α/2=2.571
S
S
   x  ta / 2
n
n
114
114
1372  2.571
   1372  2.571
6
6
 (1252    1492)h  1372  120h
x  ta / 2
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
24
Example 6.13
•
•
•
•
•
•
•
•
•
•
Lecture 14
Reduce the 95% confidence interval to ±50 h from ±120 h
Determine how many more systems should be tested
Solution: Assume n>30,



   x  za / 2
n
n
x  50    x  50
x  za / 2
za / 2
n
n  ( za / 2
α=1-95%=0.05, 0.5-α/2=0.5-0.05/2=0.475
Table 6.3 -> z α/2=1.96
n = (1.96x114/50)2=20
<30 => t-distribution
ν=n-1=20-1=19, α/2=0.05/2=0.025
S
x  ta / 2
Table 6.6 -> t α/2=2.093
n
n = (2.093x114/50)2=23
 50

50
)2
   x  ta / 2
S
n
x  50    x  50
Lecture Notes on MECH 373 – Instrumentation and Measurements
25
Chi-squared Distribution Function
n
 
2
2
(
x


)
 i
i 1
ν=4
2
f ( 2 )
n
S2 
2
2
(
x


)
 i
Area=α/2
ν = 10
i 1
Area=α/2
n 1
S2
 ( n  1) 2

2,1a / 2
•Relates the sample variance to
the population variance
2
( )
e
f ( ) 
2 / 2 ( / 2)
2
(  2 ) / 2
2
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
 2 / 2
2,a / 2
for 2  0
26
Interval Estimation of Population
Variance
P[ 
2
 ,1a / 2
 
2
2
 ,a / 2
P[ 2,1a / 2  (n  1)
S2

2
]  1a
  (n  1)
2
S2
2
 2,a / 2 ]  1  a
ν=4
(n  1) S

2
 ,a / 2
2
 
2
(n  1) S
2
Area=α/2
ν = 10

2
 ,1a / 2
With confidence level 1-a
Lecture 14
f ( 2 )
Area=α/2
2,1a / 2
Lecture Notes on MECH 373 – Instrumentation and Measurements

2
2,a / 2
27
Chi-squared Distribution Table
Lecture 14
Lecture Notes on MECH 373 – Instrumentation and Measurements
28
Example 6.14
• n = 20, mean = 0.32500 in, S = 0.00010 in
• obtain 95% confidence interval for the standard deviation
• Solution:
– ν=n-1=20-1=19, α=1-95%=0.05, α/2=0.05/2=0.025, 1α/2=1-0.05/2=0.975
– Table 6.7 -> χ2ν, α/2=32.825, χ2ν, 1-α/2=8.9065
(n  1) S

2
 ,a / 2
Lecture 14
2
 
2
(n  1) S
2,1a / 2
2
(20  1)0.00012
(20  1)0.00012
2
 
32.825
8.9065
0.000076   2  0.00015
Lecture Notes on MECH 373 – Instrumentation and Measurements
29
Related documents