Download Prezentace aplikace PowerPoint

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Distributions Commonly
Used in Statistics
Several types of probability distributions are used in
process of making statistical inferences about a
population on the basis of a sample.
The knowledge in this commonly used distributions and
their properties is needed to use and interpret statistical
methods correctly.
We can differ between two groups of the most often used distributions:
A)
Distributions used for populations
B)
Distributions used for samples
Distibutions most often used in statistics
(Summary):
A) For Populations:
Gaussian normal distribution
Standard normal disribution
Nonnormal distribution
B) For Samples:
t-distribution (Student´s)
Chi-square distribution (Pearson´s)
F- distribution (Fisher-Snedecor´s)
A) Gaussian Normal Distribution
Commonly, most of biological data follow Gaussian Normal Distribution:
bell-shaped curve - the majority of values is located around the mean (centre of
symmetry) with progressively less observations toward the extremes.
A) Gaussian Normal Distribution
The shape of the curve is determined by 2 parameters:
 - defines the centre of symmetry of the curve (middle of the population), indicates location
of the curve on the numerical axis of studied variable
 - defines the spread of the curve in the inflexion point (variability of the population)
A) Gaussian Normal Distribution
Within the range of:
  1 - 68.3% of all values in the population
  2 - 95.5%
  3 - 99.7%
A) Standard Normal Distribution
A) Non-normal Distribution („Irregular, Unknown“)
We can define the median only (as a centre). The spread of the curve
is not possible to assess.
B) Sample Distributions
t – Distribution (Student)
is defined for a theoretical variable t - is calculated from the mean and
SD in a sample.
(was published in 1908 by William Gosset under the pseudonym „Student“)
is used for samples selected from the population with Gaussian Normal
Distribution  there are several (many) curves for samples differing
in number of items (n).
B) t - Distribution (Student)
p(t)
Normal
n=6
n=10
n=
0
t
Height and spread of the curve - according to the sample size:
- the smaller is the sample size the broader and lower is the curve
- the bigger is the sample size the narrower and higher is the curve (in the extreme: n= the curve
joins the normal distribution)
B) t - Distribution (Student)
p(t)
/-2 = spread of the curve
Normal
1
5/3
(=n-1) (DF)
Degree of
Freedom
n=6
n=10
n=
0
t
Spread of the curve is defined by /-2: when n=  spread = 1 (normal)
when n=6  spread = 5/3
t-distribution - Use:
Statistical tables of t-distribution:
 critical values in testing for difference between two means
(Student t-test)
 coefficients used in calculations
B) Chi-square ( 2- Distribution (Pearson)
Asymmetrical curve – different shapes for different sample sizes (=n-1 determines the shape):
-
the smaller is the sample size the higher and more asymmetrical is the curve shape
-
the bigger is the sample size the lower and more symmetrical is the curve shape
Chi-square distribution - Use:
Statistical tables of  2- distribution:
 critical values in testing for difference between frequencies
 coefficients used in calculations
B) F - Distribution (Fisher – Snedecor)
Asymmetrical curve – different shapes for different sample sizes (1,2 determine the
shape):
-
the smaller is the sample size the lower and more asymmetrical is the curve shape
-
the bigger is the sample size the higher and more symmetrical is the curve shape
F-distribution - Use:
Statistical tables of F- distribution:
 critical values in testing for difference between 2
variances (F-test)
Related documents