Download Table 1. Aggregate comparisons of CAOS scores Average percent

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Table 1. Aggregate comparisons of CAOS scores
Average percent correct
Sample size
National sample (NT)
Hope sample Fall 2007 (HT)
Hope sample Fall 2009 (HR)
763
198
202
Pre-test
Mean (SD)
1
44.9 (- )
48.4 (11.2)
44.7 (9.3)
Post-test
Mean (SD)
1
54.0 (- )
57.2 (11.8)
55.7 (11.8)
Difference in average percent
correct
Mean (SD)
9.1 (12.0)
8.9 (9.9)
11.0 (11.3)
1. Standard deviations for the pre and posttest were not available, but the standard deviation of the difference was recorded in delMas et al.
(2007)
Table 2. Items for which students in the HR cohort learned significantly more than the HT cohort
% of Students Correct
Item
number
on CAOS
Item Description (Topic)
1
Cohort
Pretest
Posttest
Difference
McNemar’s
Test p-value
Cohort
p-value2
aOR (95%CI)2
7
Understanding of the purpose of
randomization in an experiment (Data
collection and design)
NT
HT
HR
8.5
4.6
3.5
12.3
9.7
20.8
3.8
5.1
17.3
0.013
0.076
<0.001
0.001
0.5 (0.3, 0.8)**
0.4 (0.2, 0.7)**
1.0
19
Understanding that low p-values are
desirable in research studies (Tests of
significance)
NT
HT
HR
49.9
56.9
56.9
68.5
85.6
96.0
18.6
28.7
39.1
<0.001
<0.001
<0.001
<0.001
0.1 (0.0, 0.2)***
0.2 (0.1,0.5)**
1.0
23
Understanding that no statistical significance
does not guarantee that there is no effect
(Tests of significance)
NT
HT
HR
63.1
66.2
65.2
64.4
72.7
85.1
1.3
6.5
19.9
0.630
0.130
<0.001
<0.001
0.3 (0.2, 0.5)***
0.4 (0.3, 0.8)**
1.0
25
Ability to recognize a correct interpretation
of a p-value (Tests of significance)
NT
HT
HR
46.8
36.1
42.3
54.5
41.0
60.0
7.7
4.9
17.7
0.005
0.402
0.002
<0.001
0.8 (0.6, 1.1)
0.5 (0.3, 0.7)***
1.0
26
Ability to recognize an incorrect
interpretation of a p-value. Specifically,
probability that a treatment is not effective.
(Tests of significance)
NT
HT
HR
53.1
59.8
58.9
58.6
68.6
79.7
5.5
8.8
20.8
0.044
0.085
<0.001
<0.001
0.4 (0.3, 0.5)***
0.6 (0.3, 0.9)*
1.0
37
Understanding of how to simulate data to
find the probability of an observed value
(Probability)
NT
HT
HR
20.4
20.0
20.0
19.5
20.0
32.2
-0.9
0.0
12.2
0.713
1.000
0.009
0.001
0.5 (0.4, 0.7)***
0.5 (0.3, 0.8)**
1.0
1.
2.
NT= National sample with the Traditional curriculum, HT= Hope sample (2007) with the traditional curriculum, HR= Hope sample (2009) with the new
curriculum
Results from a logistic regression model predicting post-test (right/wrong) by curriculum, controlling for pre-test right/wrong. Cohort p-value gives the
overall p-value for the cohort term, and aOR gives the adjusted odds ratio (and corresponding 95% CI) comparing each curriculum to the new
randomization based curriculum. *p<0.05, **p<0.01 and ***p<0.001.
Table 3. Items for which students in the HR cohort learned significantly less than the HT cohort
% of Students Correct
Item
number
on CAOS
Item Description (Topic)
Ability to correctly estimate and compare
standard deviations for different
histograms. (Descriptive statistics)
14
1.
2.
1
Cohort
NT
HT
HR
Pretest
34.3
44.8
36.3
Posttest
Difference
McNemar’s
Test p-value
51.7
70.8
48.5
17.4
26.0
12.2
<0.001
<0.001
0.006
Cohort
p-value2
aOR (95%CI)2
<0.001
1.2 (0.9, 1.7)
2.5 (1.6, 3.9)***
1.0
NT= National sample with the Traditional curriculum, HT= Hope sample (2007) with the traditional curriculum, HR= Hope sample (2009) with the new
curriculum
Results from a logistic regression model predicting post-test (right/wrong) by curriculum, controlling for pre-test right/wrong. Cohort p-value gives the
overall p-value for the cohort term, and aOR gives the adjusted odds ratio (and corresponding 95% CI) comparing each curriculum to the new
randomization based curriculum. *p<0.05, **p<0.01 and ***p<0.001.
Table 4. Learning differences by topic1
Topic
Data collection
and Design
Descriptive
statistics
Total
numb
er of
items
4
New curriculum performed
better than Hope traditional
(Table 2)
New curriculum performed
worse than Hope traditional
(Table 3)
Other significant
differences between
samples
(Table A1)
7
3
No significant differences
between groups
(Table A2)
22, 24, 38
14
15, 18
Graphical
representations
Boxplots
9
6
1, 3, 4, 5, 11, 12, 13, 33
4
2
8,9,10
Bivariate data
3
39
20, 21
Probability
2
Sampling
variability
5
17
16,32,34,35
Confidence
Intervals
4
28,29,30
31
Tests of
significance
6
37
19, 23, 25, 26
1. CAOS item numbers are in the table
36
27,40
Table A1. Other items with significant differences between cohorts
% of Students Correct
Item
number
on CAOS
Item Description (Topic)
1
Cohort
Pretest
Posttest
Difference
McNemar’s
Test p-value
Cohort
p-value2
aOR (95%CI)2
2
Ability to recognize two different graphical
representations of the same data (boxplot
and histogram) (Boxplots)
NT
HT
HR
45.5
49.0
53.0
56.3
66.7
72.3
10.8
17.7
19.3
<0.001
<0.001
<0.001
<0.001
0.5 (0.4, 0.7)***
0.8 (0.5, 1.2)
1.0
6
Understanding that to properly describe the
distribution of a quantitative variable, a
graph like a histogram is needed (Graphical
Representations)
NT
HT
HR
15.1%
9.7%
5.9%
25.2%
15.4%
11.4%
10.10%
5.70%
5.50%
<0.001
0.052
0.061
0.001
2.2 (1.4, 3.6)**
1.3 (0.7, 2.4)
1.0
17
Understanding of expected patterns in
sampling variability (Sampling variability)
NT
HT
HR
42.8
54.1
40.1
50.3
66.2
57.9
7.5
12.1
17.8
<0.001
0.002
<0.001
0.001
0.7 (0.5, 0.9)*
1.2 (0.8, 1.9)
1.0
28
Ability to detect a misinterpretation of a
confidence level (the percentage of sample
data between confidence limits)
(Confidence intervals)
NT
HT
HR
48.4
48.7
52.0
43.2
50.3
58.7
-5.2
1.6
6.7
<0.001
0.679
0.762
<0.001
0.5 (0.4, 0.7)***
0.7 (0.5, 1.1)
1.0
29
Ability to detect a misinterpretation of a
confidence level (percentage of population
data values between confidence limits)
(Confidence Intervals)
NT
HT
HR
32.6
34.9
35.3
67.6
50.8
58.4
35.0
15.9
23.1
<0.001
0.001
<0.001
<0.001
1.5 (1.1, 2.1)*
0.7 (0.5, 1.1)
1.0
30
Ability to detect a misinterpretation of a
confidence level (percentage of all possible
sample means between confidence limits)
(Confidence Intervals)
NT
HT
HR
31.4
35.2
35.1
44.2
26.3
32.8
12.8
-8.9
-2.3
<0.001
0.085
0.665
<0.001
1.6 (1.1, 2.3)**
0.7 (0.5, 1.1)
1.0
36
Understanding of how to calculate
appropriate ratios to find conditional
probabilities using a table of data
NT
HT
HR
52.7
49.7
46.8
53.0
71.8
71.1
0.3
22.1
24.3
0.955
<0.001
<0.001
<0.001
0.4 (0.3, 0.6)***
1.0 (0.7, 1.6)
1.0
(Probability)
39
Understanding of when it is not wise to
extrapolate using a regression model
(Bivariate data)
1.
2.
NT
HT
HR
17.9
9.7
15
24.5
11.8
8.9
6.6
2.1
-6.1
0.001
0.618
0.066
<0.001
3.5 (2.0, 5.9)***
1.5 (0.8, 2.9)
1.0
NT= National sample with the Traditional curriculum, HT= Hope sample (2007) with the traditional curriculum, HR= Hope sample (2009) with the new
curriculum
Results from a logistic regression model predicting post-test (right/wrong) by curriculum, controlling for pre-test right/wrong. Cohort p-value gives the
overall p-value for the cohort term, and aOR gives the adjusted odds ratio (and corresponding 95% CI) comparing each curriculum to the new
randomization based curriculum. *p<0.05, **p<0.01 and ***p<0.001.
Table A2. Items without significant differences between cohorts
% of Students Correct
Item
number
on CAOS
Item Description (Topic)
1
Cohort
Pretest
Posttest
Difference
McNemar’s
test p-value
Cohort
p-value2
aOR (95%CI)2
1
Ability to describe and interpret the overall
distribution of a variable as displayed in a
histogram (Graphical Representations)
NT
HT
HR
71.1
75.9
68.3
73.6
78.5
80.2
2.5
2.6
11.9
0.291
0.597
0.012
0.088
0.7 (0.5, 1.0)
0.9 (0.5, 1.4)
1.0
3
Ability to visualize and match a histogram
to a description (negative skewed
distribution for scores on an easy quiz)
(Graphical representations)
NT
HT
HR
56.7
71.3
60.9
73.2
86.7
76.7
16.5
15.4
15.8
<0.001
<0.001
<0.001
0.019
0.9 (0.6, 1.3)
1.7 (1.0, 3.0)
1.0
4
Ability visualize and match a histogram to a
description of a variable (bell-shaped
distribution) (Graphical Representations)
NT
HT
HR
48.0
53.6
41.3
63.1
63.1
60.9
15.1
9.5
19.6
<0.001
0.027
<0.001
0.931
1.0 (0.7, 1.4)
1.0 (0.6, 1.5)
1.0
5
Ability to visualize and match a histogram
to a description of a variable (uniform
distribution) (Graphical representations)
NT
HT
HR
55.9
68.6
55.9
71.1
81.5
68.3
15.2
12.9
12.4
<0.001
<0.001
0.004
0.066
1.2 (0.8, 1.7)
1.8 (1.1, 3.0)*
1.0
8
Ability to determine which of two boxplots
represents a larger standard deviation
(Boxplots)
NT
HT
HR
54.7
52.8
56.7
59.2
62.6
48.0
4.5
9.8
-8.7
0.068
0.025
0.082
0.004
1.6 (1.2, 2.2)**
1.9 (1.2, 2.8)**
1.0
9
Understanding that boxplots do not provide
accurate estimates for percentages of data
above or below values except for the
quartiles (Boxplots)
NT
HT
HR
23.3
19.6
10.0
26.6
23.1
23.4
3.3
3.5
13.4
0.114
0.360
<0.001
0.742
1.0 (0.7, 1.5)
0.9 (0.5, 1.4)
1.0
10
Understanding of the interpretation of a
median in the context of boxplots
(Boxplots)
NT
HT
HR
19.6
21.0
17.3
28.3
33.8
33.2
8.7
12.8
15.9
<0.001
<0.001
<0.001
0.197
0.8 (0.5, 1.1)
1.0 (0.7, 1.6)
1.0
11
Ability to compare groups by considering
where most of the data are, and focusing on
distributions as single entities (Graphical
representations)
NT
HT
HR
88.0
93.3
89.6
88.2
94.9
92.5
0.2
1.6
2.9
0.928
0.629
0.362
0.027
0.6 (0.3, 1.1)
1.4 (0.6, 3.2)
1.0
12
Ability to compare groups by comparing
differences in averages (Graphical
representations)
NT
HT
HR
85.3
89.2
85.1
85.8
88.7
85.6
0.5
-0.5
0.5
0.804
1.00
1.00
0.716
1.0 (0.6, 1.6)
1.2 (0.7, 2.3)
1.0
13
Understanding that comparing two groups
does not require equal sample sizes in each
group, especially if both sets of data are
large (Graphical Representations)
NT
HT
HR
61.8
63.1
55.2
73.5
79
71.8
11.7
15.9
16.6
<0.001
<0.001
<0.001
0.302
1.0 (0.7, 1.4)
1.3 (0.8, 2.2)
1.0
15
Ability to correctly estimate standard
deviations for different histograms.
(Descriptive statistics)
NT
HT
HR
38.3
42.3
41.8
46.9
51.3
48.5
8.6
9.0
6.7
<0.001
0.053
0.203
0.688
1.0 (0.7, 1.3)
1.1 (0.7, 1.7)
1.0
16
Understanding that statistics from small
samples vary more than statistics from large
samples (Sampling variability)
NT
HT
HR
22.8
23.1
21.8
31.9
42.1
29.4
9.1
19.0
7.6
<0.001
<0.001
0.026
0.008
1.1 (0.7, 1.6)
1.9 (1.2, 2.9)**
1.0
18
Understanding the meaning of variability in
the context of repeated measurements, and
in a context where small variability is
desired (Descriptive statistics)
NT
HT
HR
80.6
86.7
82.6
80.6
87.7
78.7
0.0
1.0
-3.9
1.000
0.856
0.280
0.084
1.2 (0.8, 1.8)
1.9 (1.1, 3.3)*
1.0
20
Ability to match a scatterplot to a verbal
description of a bivariate relationship
(Bivariate Data)
NT
HT
HR
90.5
95.4
92.1
92.5
92.8
94.5
2.0
-2.6
2.4
0.159
0.383
0.541
0.644
0.7 (0.4, 1.4)
0.7 (0.3, 1.6)
1.0
21
Ability to correctly describe a bivariate
relationship shown in a scatterplot when
there is an outlier (influential point)
(Bivariate data)
NT
HT
HR
73.6
80.5
83.7
89.7
10.1
9.2
<0.001
0.010
0.004
0.165
1.1 (0.7, 1.6)
1.7 (0.9, 3.1)
1.0
71.1
82.7
11.6
22
Understanding that correlation does not
imply causation (Data collection and
design)
NT
HT
HR
54.6
52.1
44.1
52.6
54.4
61.9
-2.0
2.3
17.8
0.404
0.640
<0.001
0.011
0.6 (0.4, 0.8)**
0.7 (0.4, 1.0)
1.0
24
Understanding that an experimental design
with random assignment supports causal
inference (Data collection and design)
NT
HT
HR
58.5
64.6
56.3
59.5
65.5
59.4
1.0
0.9
3.1
0.731
1.000
0.505
0.441
0.9 (0.7, 1.3)
1.2 (0.8, 1.8)
1.0
27
Ability to recognize an incorrect
interpretation of a p-value. Specifically, as
the probability a treatment is effective.
(Tests of significance)
NT
HT
HR
42.3
37.1
52.7
47.7
10.4
10.6
<0.001
0.027
0.073
0.128
1.3 (1.0, 1.8)
1.1 (0.7, 1.6)
1.0
35.8
44.6
8.8
31
Ability to correctly interpret a confidence
interval (Confidence intervals)
NT
HT
HR
47.1
46.2
41.8
74.3
80.5
67.8
27.2
34.3
26.0
<0.001
<0.001
<0.001
0.017
1.4 (1.0, 1.9)
2.0 (1.2, 3.1)**
1.0
32
Understanding of how sampling errors are
used to make an informal inference about a
sample mean (Sampling variability)
NT
HT
HR
16.9
14.4
17.4
17.1
8.2
13.4
0.2
-6.2
-4.0
0.941
0.073
0.302
0.009
1.3 (0.8, 2.1)
0.6 (0.3, 1.1)
1.0
33
Understanding that a distribution with the
median larger than mean is most likely
skewed to the left (Graphical
representations)
NT
HT
HR
41.5
42.6
39.7
43.6
-1.8
1.0
0.511
0.941
0.755
0.312
1.2 (0.8, 1.6)
1.4 (0.9, 2.1)
1.0
37.6
35.8
-1.8
34
Understanding the law of large numbers for
a large sample by selecting an appropriate
sample from a population given the sample
size (Sampling variability)
NT
HT
HR
55.3
65.6
53.5
65.2
70.3
55.9
9.9
4.7
2.4
<0.001
0.368
0.649
0.020
1.5 (1.1, 2.0)*
1.7 (1.1, 2.6)**
1.0
35
Ability to select an appropriate sampling
distribution for a population and sample
size (Sampling variability)
NT
HT
HR
34.5
37.6
30.5
44.2
50.5
43.1
9.7
12.9
12.6
<0.001
0.013
0.010
0.341
1.0 (0.7, 1.4)
1.3 (0.9, 1.9)
1.0
Understanding of the factors that allow a
sample of data to be generalized to the
population (Data collection and design)
38
40
1.
2.
NT
HT
HR
26.0
25.1
20.8
37.9
34.4
29.5
11.9
9.3
8.7
<0.001
0.038
0.033
0.143
1.4 (1.0, 2.0)
1.2 (0.8, 1.9)
1.0
Understanding of the logic of a significance
NT
41.9
52
10.1
0.820
0.9 (0.7, 1.3)
<0.001
test when the null hypothesis is rejected
HT
36.4
53.3
16.9
1.0 (0.7, 1.5)
<0.001
(Tests of significance)
HR
40.6
53.5
12.9
0.010
1.0
NT= National sample with the Traditional curriculum, HT= Hope sample (2007) with the traditional curriculum, HR= Hope sample (2009) with the new
curriculum
Results from a logistic regression model predicting post-test (right/wrong) by curriculum, controlling for pre-test right/wrong. Cohort p-value gives the
overall p-value for the cohort term, and aOR gives the adjusted odds ratio (and corresponding 95% CI) comparing each curriculum to the new
randomization based curriculum. *p<0.05, **p<0.01 and ***p<0.001.
Related documents