Download Chapter Two: Practice Problem Key

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Psychometrics wikipedia , lookup

History of statistics wikipedia , lookup

Misuse of statistics wikipedia , lookup

Categorical variable wikipedia , lookup

Transcript
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
Chapter Two: Practice Problem Key
Practicing the Concepts—Variables and Scales
1. Here are some variables one might consider in doing research to explore the conditions for
optimal language learning. Be prepared to discuss what type of scale might be used in defining
the levels of each: nominal, ordinal, or interval.
I think you’ll notice when you discuss your responses with others that people define some of
these variables in different ways—the “right” answer for a variable should reflect an individual’s
understanding of what the variable is.
Variable name
Native language
of learner
Language
learning aptitude
Type of scale
nominal
Language ability
interval,
ordinal, or
nominal
Learner’s age
interval,
ordinal, or
nominal
interval,
ordinal, or
nominal
Comments
An individual can be placed into a category representing
his or her native language.
Language learning aptitude is a person’s potential ability
to learn a second or foreign language. I envision this
variable as being interval scale, because I am familiar with
several interval-scale tests of language learning aptitude;
however, others may envision the variable as being
ordinal, particularly if they are aware of how the outcomes
of the Defense Language Aptitude Battery may be used to
place foreign language students into an easier or more
difficult language of study. Some people may even
envision the variable as being nominal—one person told
me he thought of the variable as having two categorical
levels, “able to learn foreign languages” and “not able to
learn foreign languages.” A researcher who includes this
variable in a study should provide a definition that makes
good sense in the context of the study, taking into
consideration the research question as well as features of
the research design.
There are many types of tools for measuring or describing
language ability. Tests like the Internet Based TOEFL
(IBT) yield interval scale data; the American Council for
Teachers of Foreign Language has an Oral Proficiency
Interview that yields ordinal scale data; and some tests for
grades K – 12 provide results that allow teachers to
categorize their learners according to their language
ability—these tests yield nominal scale outcomes. A
researcher who includes language ability as a variable in a
study needs to provide a definition that makes sense in the
context of the study, taking into consideration the research
question as well as features of the research design.
Time, the concept that is typically used to define age, is an
interval scale variable. However, a researcher may decide
to define age as an ordinal variable, placing participants in
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
Learner’s sex
nominal
Teacher’s
experience
interval,
ordinal, or
nominal
Teacher’s
education
interval,
ordinal, or
nominal
Teacher’s
personality
nominal;
perhaps ordinal
or interval?
Class meeting
time
nominal
Frequency of
class
nominal
Duration of class
nominal
Configuration of
seats
nominal
ordered levels. In some studies, age may even be defined
as a nominal variable; for example, in a study in which the
participants are placed into two groups, learners between
the ages of 21 and 35 and learners who are either younger
or older than students in the first group.
Sex is typically defined as a nominal variable with two
levels, male and female, in language education research.
Teacher’s experience could be defined in a number of
ways, including years of experience, which could be
defined on either an interval scale (simply number of
years) or an ordinal scale (with teachers placed into
ordered categories depending on the number of years’
experience they have). Teacher’s experience could be
defined using a nominal scale, too, perhaps as type of
experience.
Teacher’s education could be defined in a number of
ways, too, including years of education, expressed using
an interval scale (simply number of years), or an ordinal
scale (with teachers placed into ordered categories
depending on the number of years of education they’ve
had). Teacher’s education could be defined using a
nominal scale, too, with the levels being something like
high school diploma, associate degree, bachelor’s degree,
master’s degree, doctoral degree.
Teacher’s personality can be defined in many ways, too.
A researcher might choose to use a tool that categorizes or
characterizes individuals according to their personal
traits—a nominal scale. Alternatively, a single personality
trait might be identified, such as degree of extroversion or
introversion—and a data collection tool used that yields
an interval or ordinal degree of extroversion score.
Though time is an interval scale variable, I envision class
meeting time as a nominal variable—the time of day at
which a class meets is simply a category. The categories
might be given numerical labels (8 a.m., 6 p.m., etc.) or
semantic ones (morning, early afternoon, evening, etc.).
Just as classes might be characterized by when they meet,
they might also be categorized by how frequently they
meet (once a week; twice a week; five times a week, etc.).
Classes can be characterized or categorized by how many
hours each session is, or whether the course is a weekend
seminar or a full-length course.
I understand this variable, configuration of seats, as the
manner in which a classroom is set up. I consider it a
nominal variable because I can categorize or characterize
classrooms by the way the furniture is arranged (rows of
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
desks, small tables with seating for 4, conference table,
etc.).
2. No key for this question.
Study A. A Spanish vocabulary test was given to a group of Spanish-speaking first-graders. The
school district requires you to provide the mean and standard deviation for your students.
Complete the chart below to guide your computation of these values.
First the mean score must be determined using this formula: X =
X .
n
The sum of the scores (ΣX) is 326. The number of test takers (n) is 13. Divide 326 by 13 to
determine the mean score; 326/13 = 25.07692. I rounded that number off to 25.08 in my work
below: X = 25.08.
X
(X - X )
(X - X )2
41
41 - 25.08 = +15.92
(+15.92)2 = +253.45
33
33 - 25.08 = +7.92
(+7.92)2 = +62.73
32
32 - 25.08 = +6.92
(+6.92) 2 = +47.89
29
29 - 25.08 = +3.92
(+3.92) 2 = +15.37
27
27 - 25.08 = +1.92
(+1.92) 2 = +3.69
27
27 - 25.08 = +1.92
(+1.92) 2 = +3.69
26
26 - 25.08= +0.92
(+0.92) 2 = +0.85
24
24 - 25.08= -1.08
(-1.08) 2 = +1.17
19
19 - 25.08= -6.08
(-6.08) 2 = +36.97
19
19 - 25.08= -6.08
(-6.08) 2 = +36.97
18
18 - 25.08= -7.08
(-7.08) 2 = +50.13
17
17 - 25.08= -8.08
(-8.08) 2 = +65.29
14
14 - 25.08= -11.08
(-11.08) 2 = +122.77
The sum of the values in the third column, Σ(X - X )2, is 700.97
(X
The standard deviation formula is:
 X )2
n 1
.
The sum of the (X - X )2 values in the third column above is 700.97 and the number of
participants (n) is 13, so n – 13 = 12.
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
(X
So
 X )2
n 1
=
700.97
13  1
=
700.97
12
.
Carrying out the math, I find that the standard deviation is 7.64 points.
700.97
12
=
58.41 = 7.64
3. What is the mean? 25.07692 (which I rounded off to 25.08)
4. What is the mode? This distribution is bimodal; there are two scores that are most frequently
occurring, 19 and 27.
5. What is the median? There are 13 scores. The median is the score in the physical center of
the distribution when the scores are arranged from highest to lowest or vice versa so the median
(which is highlighted in blue in the table above) is 26.
6. What is the range? The range is the highest score minus the lowest score so this distribution
has a range of 27 points (41 – 14 = 27).
7. What is the standard deviation? 7.64 points
8. Here’s my representation of the distribution...
X
... 14
X
X X X
16 18 20 22
X
X X X X
24 26 28 30
X X
32 34
X
36
38
40
42 ...
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
Study B
9. Here are the R commands I used to enter the data and provide the information needed to
answer questions 9 through 14.
Enter the data.
data = c (88, 87, 54, 97, 34, 78, 56, 99, 87, 73,
74, 69, 85, 87, 86, 87)
Use the summary
command to calculate the
mean (blue highlighting)
and identify the median
(green highlighting). (The
minimum and maximum
scores are highlighted in
yellow.)
Use the subset (table)
commands to determine
the mode.
summary (data)
Here’s the output of that command:
Min. 1st Qu. Median Mean 3rd Qu. Max.
34.00 72.00 85.50
77.56 87.00
99.00
subset (table (data),
table(data)==max(table(data)))
Here’s the output—it means that the most
frequent score is 87, and that there are 4 scores of
87.
87
4
To calculate the standard
deviation, I used the
shortcut command sd
(pink highlighting).
I used the hist command
to make a frequency
distribution of the scores.
(I added color and a
label).
To calculate the range, I
retrieve the maximum
and minimum scores
from the output for the
summary command and
find the difference. (The
maximum and minimum
scores are highlighted in
yellow in the summary
output.)
sd (data)
Here’s the output:
17.23163
hist (data, col = "lavender", main =
"Frequency Distribution of Reading Quiz
Scores")
The histogram is presented below in question #9.
99 – 34
R presents the difference like this:
[1] 65
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
9. Make a frequency distribution of the data.
10. Calculate the mean (highlighted in blue in the chart of R commands above). 77.56
11. Calculate the standard deviation (highlighted in pink in the chart of R commands above)—I
round it off to 17.23 here.
13. Report the median (highlighted in green in the R commands). 85.5
14. What is the range? The range is 65 points (the difference between the maximum and the
minimum scores).
Standardized Scores
When scores are in a normal distribution, one can use that fact to ‘regularize’ them; that is, to
convert the scores from the raw score scale to a recognized ‘common’ scale (a standardized
scale), so the scores from different tests (or test forms) can be directly compared even when the
number of points on the different tests, the averages, and the standard deviations are different.
Keep in mind that scores can be standardized only when they are in a normal distribution
because the process relies on the defined characteristics of a normal distribution.
Here is some test performance information.
Midterm test
Mean = 47
s=3
n = 45
k = 100
Rob’s score = 53
Final test
Mean = 51
s=6
n = 43
k = 100
Rob’s score = 53
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
Michael’s score = 39
Michael’s score = 49
15. Calculate the percentages for Rob’s and Michael’s midterm scores and final scores.
Midterm
Final
Rob
53/100 = 53%
53/100 = 53%
Michael
39/100 = 39%
49/100 = 49%
16. Calculate the z-scores for Rob’s and Michael’s midterm scores and final scores.
The z-score formula resets the mean at zero and resets the standard deviation to 1.
The formula looks like this: z =
Midterm
Final
Rob
X  X 53  47
=
=
3
s
X X
53  51
=
=
6
s
X X
s
6
= +2
3
2
= + .33
6
Michael
X X
39  47 6
=
=
= -2
3
3
s
X  X 49  51 2
=
=
= - .33
6
6
s
17. Convert the z-scores to the T-scale or the CEEB scale.
Here’s the formula for converting z-scores to T-scores:
T = 10z + 50
Here’s the formula for converting z-scores to CEEB-scores:
CEEB = 100z + 500
T-scale conversion:
Midterm
Final
Rob
T = 10z + 50 =
(10)(+2) + 50 =
20 + 50 = 70
T = 10z + 50 =
(10)(+.33) + 50 =
3.3 + 50 = 53.30
Michael
T = 10z + 50 =
(10)(-2) + 50 =
- 20 + 50 = 30
T = 10z + 50 =
(10)(-.33) + 50 =
-3.3 + 50 = 46.70
Turner, J. Using statistics in small-scale language education research: Focus on non-parametric data
CEEB scale conversion
Midterm
Final
Rob
CEEB = 100z + 500 =
(100)(+2) + 500 =
200 + 500 = 700
CEEB = 100 z + 500 =
(100)(+.33) + 500 =
33 + 500 = 533
Michael
CEEB = 100z + 500 =
(100)(-2) + 500 =
- 200 + 500 = 300
CEEB = 100z + 500 =
(100)(-.33) + 500 =
-33 + 500 = 467
I believe I’d use the T-scale scores to discuss Rob’s and Michael’s performance with them and
their parents.
For Rob and his parents, I think I’d stress that in comparison to his peers he was doing very well
at midterm—as well as or better than 84% of them. However, on the final he didn’t do so well—
his score on the final test is only slightly above the mean. Unless he was ill when he took the
final test or there was some problem during the test administration, his performance on the final
(in comparison to his peers’ performance) suggests that he’s having difficulty with the material.
He may need some extra tutoring, a remedial summer class, or a curfew!
For Michael and his parents, I believe I’d note that he is working below the level of most of his
peers, but that he had made astounding progress. On the midterm, he performed the same as or
better than 16% of his peers. However, his score on the final test approaches the mean score—he
performed as well as or better than nearly half of his peers. I’d congratulate him on his progress
and encourage him to continue his hard work. He might benefit from extra tutoring and a
remedial summer class.