Download S519 Statistical Sessions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
S519 Statistical Sessions
Wrap up
Things we’ve covered
•
•
•
•
•
•
•
•
•
Descriptive Statistics
Normal Distributions
Z-test
Hypothesis Testing
T-test
ANOVA
Correlation
Linear regression
Chi-square
Descriptive Statistics
• Central Tendency
– Mean
– Median
– Mode
• Variance
– Range
– Standard deviation
– Variance
Normal Distributions
• Skewness
• Kurtosis
Z-test
Hypothesis Testing
1. State the hypothesis
– Null hypothesis
– Research hypothesis
•
•
Directional
Non-directional
2. Set decision criteria
3. Collect data and compute sample statistic
4. Make a decision (accept/reject)
T-test
T-test
• Degree of freedom=n-1
• TTEST (array1, array2, tails, type)
– array1 = the cell address for the first set of data
– array2 = the cell address for the second set of data
– tails: 1 = one-tailed, 2 = two-tailed
– type: 1 = a paired t test; 2 = a two-sample test
(independent with equal variances); 3 = a twosample test with unequal variances
ANOVA
• Analysis of Variance
• A hypothesis-testing procedure used to evaluate
mean differences between two or more treatments
(or populations).
• Advantages:
– 1) Can work with more than two samples.
– 2) Can work with more than one independent variable
ANOVA
• In ANOVA an independent or quasiindependent variable is called a factor.
• Factor = independent (or quasi-independent)
variable.
• Levels = number of values used for the
independent variable.
• One factor → “single-factor design”
• More than one factor → “factorial design”
ANOVA
• Df for independent ANOVA
– Between-group degree of freedom=k-1
• k: number of groups
– Within-group degree of freedom=N-k
• N: total sample size
• Df for dependent ANOVA
– Between-group degree of freedom=k-1
• k: number of groups
– Within-group degree of freedom=N-k
• N: total sample size
– Between-subject degree of freedom=n-1
• n: number of subjects
– Error degree of freedom=(N-k)-(n-1)
ANOVA
• Three different ANOVA:
– Independent measures design: Groups are samples of
independent measurements (different people)
ANOVA: single factor
– Dependent measures design: Groups are samples of
dependent measurements (usually same people at
different times) “Repeated measures”
ANOVA: two factors without replication
– Factorial ANOVA (more than one factor)
ANOVA: two factors with replication
Correlation
• Pearson correlation
– CORREL function or Pearson function
– Toolpak for more than two variables (matrix)
• The correlation represents the association
between two or more variables
• It has nothing to do with causality (there is no
cause relation between two correlated
variables)
Correlation
rxy value
Interpretation
0.8 ~ 1.0
Very strong relationship (share most of the things in common)
0.6 ~0.8
Strong relationship (share many things in common)
0.4 ~ 0.6
Moderate relationship (share something in common)
0.2 ~ 0.4
Weak relationship (share a little in common)
0.0 ~ 0.2
Weak or no relationship (share very little or nothing in common)
Correlation
Linear regression
• Y’ = bX + a
– b = SLOPE()
– a = INTERCEPT()
•
Chi-square
• Non-parametric vs. parametric
•
2
(
O

E
)
2  
E
– O: the observed frequency
– E: the expected frequency
• df=r-1 (r= number of categories)
Related documents