Download Statistical Analysis of Data: Detection Thresholds for 0.025 Contrast

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Frances Wu
22S: 30 Spring 2004
Statistical Analysis of Data: Detection Thresholds for 0.025 Contrast Level
When collecting data, statistical analysis is necessary to determine the
significance of the data. This is used to establish whether or not the data holds
for a certain hypothesis. For example, data is being collected in an observational
study for Ergonomics 56:147 and must undergo statistical analysis. This study,
“Detection Thresholds for Rectangular Targets with Different Aspect Ratios and
Orientations,” is trying to determine if a difference exists for the detection of
stimuli with different contrasts, lengths, widths, and orientations. The research
question proposed in this paper is: Given two lines with the same contrast level,
is there an effect on the detection threshold if the overall luminance is higher or
lower?
Data collection for the study is an ongoing process and has only resulted in
eight sets of data at this point. The testing procedure calls for an individual to
walk towards a computer screen that is presenting a line at a very low contrast.
At the point where the subject can identify the orientation (horizontal, vertical, or
diagonal), the distance from the screen is recorded. A total of 81 runs are given
from a combination of three widths, three angle orientations, and nine
combinations of background and target luminance levels. The data has been
dramatically cut back for use in this statistical analysis to only two combinations
of variables. Two lines were chosen from the test to be used as the two
treatments and the distances from which each subject detected the line as the
1
response variable. The lines are both horizontal and have a contrast level of
0.025. One set of data is for the line with a background luminance (BG Lum) of
12 cd/m2 and a target luminance (TG Lum) of 12.3 cd/m2. The other line has a
background luminance of 4 cd/m2 and a target luminance of 4.1 cd/m2. The
contrast is determined by the equation:
Contrast 
BG Lum  TG Lum
BG Lum
(Equation 1.1)
By using Equation 1.1, the contrast for both lines is determined to be 0.025. The
eight sets of data collected are presented in Table 1.1.
TABLE 1.1: Collected Data for 0.025 Contrast Levels
BG Lum 12 (ft)
BG Lum 4 (ft)
Subject 1
13.50
4.00
Subject 2
3.50
6.75
Subject 3
7.50
1.00
Subject 4
10.00
1.75
Subject 5
12.75
9.25
Subject 6
1.25
5.50
Subject 7
10.00
2.75
Subject 8
4.00
1.75
This is the raw data that was used for further statistical analysis. For simplicity,
the distance observations will be further referred to as BG Lum 12 and BG Lum 4
for background luminance 12 cd/m2 and background luminance 4 cd/m2,
respectively.
From a comparison of a variety of statistical tests such as z-tests, t-tests, and
X2 tests, a paired t-test was chosen to be the best to analyze the data. The data
is self-paired because the measurements for BG Lum 12 and BG Lum 4 were
taken for each subject. t-tests allow for inference about the mean without
2
knowing the population standard of deviation. The paired t-test further allows for
a comparison of the difference in the distances recorded for each subject. This is
beneficial because eyesight abilities vary greatly for each individual and the
detection at an overall brighter luminance can be compared with a lower
luminance target. The difference was calculated by subtracting the distance for
BG Lum 4 from the distance for BG Lum 12. The mean of these distances were
then used for calculating the t-statistic. A two-sided test is completed because it
is not initially evident whether detection is easier at a higher or lower luminance
level. The null hypothesis used was that there exists no difference in the
detection thresholds for the BG Lum 12 and BG Lum 4.
H 0 :  BG Lum12   BG Lum 4
H a :  BG Lum12   BG Lum 4
The robustness for t-tests does not hold for data sets that contain outliers. The
effects of outliers are enhanced even more when the sample size is small. By
the central limit theorem, distribution approaches Normal for sample means as
the sample size increases. In this case, the sample size is only 8. More data is
undeniably needed for this analysis but is not available at this time due to the
concurrent progress for both projects. A graphical representation of the data
collected is shown on the following page in Figure 1.1. As a rule of thumb, t-tests
should only be conducted for sample sizes less than 15 if the data is near
Normal, contains no outliers, and is not skewed. For sample sizes greater than
15, the test still cannot be used with outliers. Because more data is not available
at this time, t-tests were still conducted as the best test for this situation.
3
Detection Thresholds for 0.025 Contrast Levels
16.00
14.00
Distance (ft)
12.00
10.00
BG Lum 12 cd/m^2
8.00
BG Lum 4 cd/m^2
6.00
4.00
2.00
0.00
1
2
3
4
5
6
7
8
Subject
Figure 1.1: Detection Thresholds for 0.025 Contrast Levels
SAS was used for the statistical analysis of the data, testing for a 0.05
significance level. The following program shows the input used to run the paired
t-tests:
options linesize = 79;
data contrast;
input twelve four 8.;
diff = twelve - four;
datalines;
13.50 4.00
3.50 6.75
7.50 1.00
10.00 1.75
12.75 9.25
1.25 5.50
10.00 2.75
4.00 1.75
;
run;
proc print data = contrast;
run;
4
proc means data = contrast n mean stdev stderr clm alpha = 0.05;
var diff;
run;
proc univariate data = contrast;
var diff;
run;
Appendix A shows the complete output generated from this program. The tstatistic can be calculated from the data produced by the proc means function
using Equation 1.2.
t
d  0
(Equation 1.2)
sd / n
The mean calculated by SAS is 3.71875 with a standard deviation of 5.1883067.
The t-statistic then calculates to be 2.0273. The degrees of freedom for this set
of data is (n-1) = 7. From Table C, the 95% confidence level for 7 degrees of
freedom has t = 2.365. Because the 2.0273 < 2.365, the data is determined to
not be reliable enough for the 95% confidence level. Therefore, the null
hypothesis cannot be rejected. The information generated from proc univariate
provides the same information just calculated. The t-statistic is given as
2.027292 and the p-value is already calculated as 0.0822. Because 0.0822 >
0.05, the significance level being tested, the hypothesis (as already determined)
cannot be rejected.
A simple graph generated in Excel appears to relay convincing data. The top
line portrays the mean distances taken for a series of increasing contrasts with a
background luminance of 12 and the bottom line has the same levels of
increasing contrasts with a background luminance of 4.
5
Rectangular Detection Thresholds
20
18
Distance (ft)
16
14
12
BG Lum 12
10
BG Lum 4
8
6
4
2
0
0
0.01
0.02
0.03
0.04
Contrast (cd/m^2)
Figure 1.2: Detection Thresholds for Increasing Contrast Levels
The data analyzed in this paper was 0.025 and the mean is represented as the
two points where x = 0.025. By just looking at this graph, it appears that a higher
background luminance increases the distance from which the stimuli can be
detected. More generally, the brighter the overall picture, the further away a line
can be seen. However, from the statistical analysis, the null hypothesis cannot
be rejected for a 95% confidence level. This is most likely due to the very small
sample size, which does contain outliers. This graph is a testament to the need
for both an appropriate sample size and statistical analysis.
6
Related documents