Download Chapter 7 Measuring of data

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Foundations of statistics wikipedia , lookup

History of statistics wikipedia , lookup

Misuse of statistics wikipedia , lookup

Psychometrics wikipedia , lookup

Transcript
Chapter 7
Measuring of data
• Reliability of measuring instruments
• The reliability* of instrument is the consistency
with which it measures the target attribute.
• Example
If a scale weighed a person at 80 kg one minute and 85 kg
the next, we would consider it unreliable. The less variation
an instrument produces in repeated measurements, the
higher its reliability. Thus, reliability can be equated with a
measure’s stability, consistency, or dependability. Reliability
also concerns a measure
Dr. Areefa
VALIDITY
The second important criterion for evaluating a quantitative
instrument is its validity.
Validity is
the degree to which an instrument measures what it
is supposed to measure. When researchers develop
an instrument to measure hopelessness, how can
they be sure that resulting scores validly reflect this
construct and not something else, like depression?
Reliability and validity are not independent
qualities of an instrument. A measuring device that
is unreliable cannot possibly be valid.
Types
Face validity refers to the degree to which an assessment or test
subjectively appears to measure the variable or construct that it is
supposed to measure. In other words, face validity is when an
assessment or test appears to do what it claims to do
example
Lila is a researcher who has just developed a new assessment that is meant to
measure mathematical ability in college students. She selects a sample of 300
college students from three local universities and has them take the test. After
the students complete the test, Lila asks all 300 participants to complete a
follow-up questionnaire.
In the questionnaire, Lila asks the participants what they think the purpose of
the test is, what construct they believe is being measured, and whether or not
they feel the assessment was an adequate measure of their mathematical
ability. After analyzing the follow-up results, Lila finds that most of the
participants agree that Lila's assessment accurately measures their
mathematical ability. Lila's has just demonstrated that her assessment has face
validity.
Content Validity
Content validity concerns the degree to which an
instrument has an appropriate sample of items for the
construct being measured. Content validity is relevant for
both affective measures. content validity (also known as
logical validity). For example,
a depression scale may lack content validity if it only assesses the
affective dimension of depression but fails to take into account
the behavioral dimension. An element of subjectivity exists in
relation to determining content validity, which requires a degree
of agreement about what a particular personality trait such as
extraversion represents. A disagreement about a personality trait
will prevent the gain of a high content validity
Construct Validity
• Validating an instrument in terms of construct Validity is a
challenging task. The key construct validity questions are: What is
this instrument really measuring? Does it adequately measure the
abstract concept of interest? involves logical analysis and tests
predicted by theoretical considerations.
Analyzing Quantitative
Data: Descriptive Statistics
•
•
•
•
Statistical analysis helps researchers make
sense of quantitative information. Without
statistics, quantitative data would be useless
mass of numbers. Statistical procedures
enable researchers
• to summarize, organize, evaluate, interpret,
• and communicate numeric information.
• Statistics are either descriptive or inferential.
• Descriptive statistics are used to describe and
• synthesize data. Averages and percentages are
examples
• of descriptive statistics. Actually, when such
indexes are calculated on data from a
population, they are called parameters.
A descriptive
• index from a sample is called a statistic.
Research questions are about parameters, but
researchers calculate sample statistics to
estimate them, using inferential statistics to
make inferences about the population.
LEVELS OF MEASUREMENT
• Scientists have developed a system for
categorizing measures.
• This system is important because the analyses
that can be performed on data depend on
their measurement level.
• The four major classes, or levels, of
measurement are nominal, ordinal, interval,
• and ratio.
• Nominal Measurement
• The lowest level of measurement is nominal
measurement,
• which involves assigning numbers to
• classify characteristics into categories. Examples
• of variables amenable to nominal measurement
• include gender, blood type, and marital status.
The numeric codes assigned in nominal measurement
do not convey quantitative information.
If we classify males as 1 and females as 2, the numbers
have no inherent meaning. The number 2 clearly
does not mean “more than” 1. It would be perfectly
acceptable to reverse the code and use 1 for females
and 2 for males. The numbers are merely symbols ‫رمز‬
that represent two different values of the gender
attribute. Indeed, instead of numeric codes, we could
have used alphabetical symbols, such as M and F.
• Interval measurement
• occurs when researchers can specify the rankordering of objects on an attribute and can
assume equivalent distance between them.
• Most psychological and educational tests are
based on interval scales.
• Ratio Measurement
they tell us the exact value between units, AND
they also have an absolute zero–which allows
for a wide range of both descriptive and
inferential statistics to be applied. At the risk of
repeating myself, everything above about
interval data applies to ratio scales + ratio scales
have a clear definition of zero. Good examples
of ratio variables include height and weight.
Test your self
What type of measure?
What type of measure?
Temperature, date
interval
Wt. ht.
ratio
Constructing Frequency Distributions
• Frequency distributions are a method of
organizing numeric data.
• A frequency distribution is a systematic
• arrangement of values from lowest to highest,
• together with a count of the number of times
• each value was obtained.
The Mode
• The mode is the most frequently occurring score
• value in a distribution. The mode is simple to
determine;
• it is not computed but rather is established
• by inspecting a frequency distribution. In the
following
• distribution of numbers, we can readily see
• that the mode is 53:
• 50 51 51 52 53 53 53 53 54 55 56
The Median
The median is the point in a distribution above
which and below which 50% of cases fall.
As an example, consider the following set of values:
2 2 3 3 4 5 6 7 8 9 The value that divides the cases exactly
in half is 4.5, which is the median for this set of numbers.
The point that has 50% of the cases above and
below it is halfway between 4 and 5. An important
characteristic of the median is that it does not take into
account the quantitative values of scores. The median is an
index of average position in a distribution. It is insensitive to
extreme values.
• The Mean
The mean is equal to the sum of all scores divided
by the total number of scores. The mean is the
index usually referred to as an average. The
computational