Download statistic - Oklahoma State University

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
ADVANCED STATISTICS
FOR MEDICAL STUDIES
Mwarumba Mwavita, Ph.D.
School of Educational Studies
Research Evaluation Measurement and Statistics (REMS)
Oklahoma State University
Statistics

Set of methods and rules for organizing,
summarizing, and interpreting information.

Two categories of statistical procedures to
organize and interpreting data.
Descriptive and Inferential
Statistics

Descriptive statistics are statistical
procedures that are used to summarize,
organize, and simplify data.

Inferential statistics – techniques used to
study samples and make generalizations about
the populations from which they were selected.
Descriptive Statistics

Descriptive measure computed from the
data of a sample is called a statistic

Descriptive measure computed form the
data of a population is called a parameter
Central Tendency

A statistical measure that identifies a single score
as representative for an entire distribution

The goal of central tendency is to find the single
score that is most typical or most representative of
the entire group
Mean – commonly referred as the average
Mode – most frequent score in a distribution
Median – the middle value in a distribution
Variability






Range - highest score – lowest score
Semi-interquartile range - (Q3 – Q1)/2
Standard Deviation – the standard distance from mean
Variance - the mean of the squared deviations
Coefficient of Variation (CV) - useful for comparing two or
more data with different units of measurement because it is
expressed in percentage (CV = SD/mean x 100%)
Confidence Interval (CI) - is a measure of the precision of
the point estimate
Normal distribution

A bell shape distribution

It is symmetrical
Terms

IV – Independent variable (treatment)

DV – Dependent variable (outcome)
Z- Test

Used in hypothesis testing when a sample
mean is used to test a hypothesis about an
unknown population, generally a population
that has received treatment

Note the parameters of the population that
did not receive treatment are known
T- test
T statistic is used to test hypotheses about
µ when the value for population standard
deviation is not known
 Uses a t-distribution- thus degree of
freedom (number of scores in a sample that
are free to vary)
 Sample size determines use of t-distribution

Independent and Dependent t-test

Independent t-test uses two samples of
the treatment conditions. (rule of thumb at
least 10 subjects per each group)

Dependent also is referred as repeatedmeasure. A single sample of individuals is
measured more than once on the same
dependent variable
ANOVA (Analysis of Variance)

ANOVA - hypothesis-testing procedure used to
1. test hypotheses about population variances
2. evaluate mean differences between two or more
treatments (or populations)

Uses variances to determine if the means are
significantly different.
ANOVA (Analysis of Variance)
1. Single factor (one way) - one treatment under
different levels
2. Factorial designs – involves more than one
factor (treatment)
3. Repeated measures – assess a measurement
on the same participants under different
condition/time
Correlation and Regression
Analysis

Correlation analyses mathematically identify and
describe relationships between variables

Regression analysis attempts to predict or
estimate the value of a response variable form the
known values of one or more explanatory
variables
Factor Analysis

Exploratory factor analysis – used when the
researcher does not know how many factors are
necessary to explain the inter-relationships among
a set of characteristics, indicators, or items
(Reduction)

Confirmatory factor analysis- assess the extent
to which the hypothesized organization of a set of
identified factor fits the data
Survival Analysis

Survival/failure analysis is a family of techniques
dealing with the time it takes for something to
happen: cure, a failure, a relapse, a death and so on

Two major varieties of the technique are life tables,
which describe the course of survival of one or more
groups of cases

The second one encompasses a set of regression
techniques in which the DV is survival time
Nonparametric techniques




Usually do not state hypotheses in terms of a
specific parameter
They make vary few assumptions about the
population distribution- distribution-free tests.
Suited for data measured in ordinal and nominal
scales
Not as sensitive as parametric tests; more likely
to fail in detecting a real difference between two
treatments
Types of nonparametric tests

Chi-square statistic tests for Goodness of Fit
(how well the obtained sample proportions fit the
population proportions specified by the null
hypothesis

Test for independence – tests whether or not
there is a relationship between two variables
More Terms

Type I error – rejecting a true null hypothesis.
(treatment has an effect when in fact the
treatment has no effect)

Alpha level for a hypothesis test is the probability
that the test will lead to a Type I error
Scenario 1

Alcohol appears to be involved in a variety
of birth defects, including low birth weight
and retarded growth. A researcher would
like to investigate the effect of prenatal
alcohol on birth weight.
 How
will the researcher do this?
 D.V.
 I.V.
 Participants
Scenario 2

A researcher would like to know whether
room temperature affects eating behavior.
 Design
 I.V.
 D.V.
 Others
 Participants
Scenario 3

A patient recently visited her physician complaining
of backache. The physician is aware of a new
technique of disc replacement. The physician
would like to test the technique but does not want
to use it on the patient.
What would you advise the physician to
do in this case?
Scenario 4

You notice that students from a nearby
elementary school that you have attended suffer
from the common cold, a disease that has been
at the school for a while. How does this school
compare to an elementary school across town?
How would you go about investigating this
problem?
Scenario 5

Suppose you are interested in finding out
how a new treatment on osteoporosis
among women will work.
 Design
 IV
 DV
 Others
Scenario 6

Using scenario 5, how can we make it a twoway ANOVA?

How could we make it a Repeated-measures
ANOVA?
Scenario 7

Diabetes has been on the increase among
American adolescents. A researcher is
interested in determining factors that
contribute to rise of diabetes among
adolescents
Scenario 8

A physician is interested in finding out the
factors that contribute to lung cancer. How
would you design this study?
Scenario 9

How would you investigate factors that
contribute to high blood pressure among
people?
Summary
 Problem
 Design
issues
 Variables
 Participants
 Sample size