Download IS IT POSSIBLE TO FIND A GOOD POINT ESTIMATE OF A

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Probability wikipedia , lookup

Statistics wikipedia , lookup

Transcript
RADIOCARBON,
Vol 49, Nr 2, 2007, ρ 3 9 3 - 4 0 1
© 2 0 0 7 by the Arizona Board of Regents on behalf of the University of Arizona
IS IT POSSIBLE TO FIND A GOOD POINT ESTIMATE OF A CALIBRATED
RADIOCARBON DATE?
Adam Michczynski
Silesian University of Technology, Institute of Physics, Radiocarbon Laboratory, GAD AM Centre of Excellence, Boleslawa
Krzywoustego 2, 44-100 Gliwice, Poland. Email: [email protected].
ABSTRACT. The result from probabilistic calibration of a radiocarbon date is given in the form of a probability density
function. Consequently, reporting a 68% or 95% confidence interval has became a commonly accepted practice. However,
many users of C dates still try to present the results of calibration as a single point. This manner of presentation is often
applied during the construction of age-depth models due to its convenience and simplicity. In this paper, the author tests
whether it is possible to find a good point estimate of a calibrated C date. The idea of the tests is to compare, using computer
simulation, the true value of the calendar age with the age calculated based on the probabilistic calibration of the C date and
the method of finding the point estimate. The test is carried out for the following point estimates: mode, median, average, the
central point of the confidence intervals, and the local mode inside the confidence intervals. The results show that none of
these may be considered as a good estimate.
14
14
l 4
INTRODUCTION
The procedure for a probabilistic calibration of radiocarbon dates was described about 20 yr ago
(Aitchison et al. 1989; Stuiver and Reimer 1989) and is now widely known and universally applied.
The result of probabilistic calibration of a C date is given in the form of the probability distribution
(actually, the probability density function); therefore, reporting 68% and 95% confidence intervals
has became a commonly accepted practice. However, many users of C dates still try to present the
results of calibration as a single point, e.g. the center of the 95% confidence interval or date corresponding to the maximum of the probability density function. This manner of presentation has been
applied especially often during construction of age-depth models because of its convenience and
simplicity. Some users of C dates still apply the intercept method, which is not probabilistic.
14
, 4
14
The problem of point estimation for calibrated C dates has been discussed on various occasions,
but it is still an object of discussion. Telford et al. (2004) have analyzed how different estimates of
the calibrated C date behave as a consequence of small changes of the C date. They assume that
the value of a good point estimate should change only a little with a small change of the C date.
They conclude that there is no point estimate that can adequately describe the calibrated C date,
and wherever possible the full probability distribution should be used; however, when the point estimate must be used, they propose to apply the mean or median.
14
14
14
14
14
This paper tests another method to see whether it is possible to find a good point estimate of a calibrated C date. The idea of the test is to compare the true value of the calendar age of a sample (i.e.
the value we want to estimate) with the age calculated based on probabilistic calibration of the C
date and the method of finding the point estimate. Because we usually do not know the true value of
the calendar age of the sample, this comparison must be done using computer simulation.
14
14
METHODS
The idea behind the computer simulation, which allows us to compare the true value of the calendar
age with the estimated value of the calendar age, is shown schematically in Figure 1. It may be
described by the following steps:
1. We assume the value of the true calendar age of the sample, X .
T
© 2007 by the Arizona Board of Regents on behalf of the University of Arizona
Proceedings of the 18th International C Conference, edited by C Bronk Ramsey and TFG Higham
RADIOCARBON, Vol 49, Nr 2, 2007, ρ 3 9 3 - 4 0 1
14
393
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
394
A
Michczynski
2. The value X is transformed by the calibration curve in order to find the C date X corresponding with X .
3. We assume a value of uncertainty (error) of the C date, σ.
4. The C date X ± σ described by the Gaussian probability distribution is calibrated, and we
obtain the probability distribution of the calibrated C date.
5. We estimate a few statistical parameters of the distribution, which may be useful to estimate the
calendar age of the sample, and calculate the difference between the estimated value of the
parameter and the true calendar age.
14
T
R
T
14
14
R
14
3500
3400
3300
3200
C a l e n d a r years BP
1
3100
Figure 1 Schematic presentation for the computer simulation (detailed description in
text). The true calendar age is marked with a gray circle. A black triangle marks the
mode of probability distribution of the calibrated C date; gray rhomb: the median;
white square: the mean value.
14
In order to check which of the estimated parameters best estimate the true calendar age, the procedure is repeated for an assumed true calendar age that was changed every 20 yr in the range ΟΙ 4,000 cal BP. Calculations were carried out using the IntCal04 calibration curve (Reimer et al.
2004) and the revised and updated calibration module of the Gliwice Radiocarbon Laboratory calibration program GdCALIB (Aitchison et al. 1989; Pazdur and Michczynska 1989; Michczynska et
al. 1990).
RESULTS AND DISCUSSION
Mode, Median, and Mean Value
The parameters that were tested initially were the mode, median, and mean of the probability distribution because these parameters are often used for estimating the calendar age. The mode is a value
corresponding to the maximum of the density function, and the median is the value that divides the
graph of the density function into 2 parts with the same probability (= 0.5). The mean (or expected)
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
A Good
Point
Estimate
of the Calibrated
14
C
Date?
395
value is calculated as a weighted average of all possible age values given by the density function,
weighted by the probability that corresponds to that age. These 3 parameters have the same value for
the unimodal Gaussian distribution, but in other cases their values become different (see Figure 1).
The results of the simulation are presented in Figure 2. Black triangles denote the difference
between the true calendar age and mode of the probability distribution; gray rhombuses denote the
difference between the true calendar age and median; and white squares indicate the difference
between the true calendar age and the mean value. Figure 2 shows the results obtained for an
assumed uncertainty value (σ) equal to 25 yr (Figure 2A), 50 yr (Figure 2B), and 100 yr (Figure 2C).
It is clearly visible that for all tested parameters, large differences occur between the true calendar
age and the value of the parameter, i.e. none of the tested parameters is a very good estimate of the
true calendar age. We may also note that the values of calendar age estimated using the median and
mean value almost always differ from the true calendar age, while the values of the calendar age
estimated using the mode very often agree with the true calendar age. However, just for the mode we
can sometimes observe the largest differences between the true and estimated calendar age.
Figure 3 shows the frequency of appearance of particular differences between the true and estimated
calendar age of the sample in the range 0-14,000 cal BP for an assumed C uncertainty equal to
25 yr (Figure 3A), 50 yr (Figure 3B), and 100 yr (Figure 3C). The black line represents the mode,
dark gray line is the median, and light gray line is the mean value. We may observe that the differences for the median and mean value depend on the uncertainty of the C date; they become smaller
when the uncertainty decreases and larger when the uncertainty increases. For example, the differences between the true calendar age and the mean value are less than 35 yr for 66.2%, 54.9%, and
48.3% of the cases if the uncertainty is equal to 25, 50, and 100 yr, respectively. However, the differences for the mode are the same regardless of the uncertainty of the C date. This shows that the
reason for the differences is the shape of the calibration curve and the non-Normality of the resulting
calibrated age density.
14
14
14
Figure 3 confirms that good concordance between the true and estimated calendar age appears most
often when we estimate the calendar age using the mode of the probability distribution. The difference between the true calendar age and the mode of the probability distribution is equal to zero for
47.5%, and less than 35 yr for 7 3 % of the cases.
Figure 4 answers the question of where the largest differences occur and shows how these differences
may affect the chronology or time-depth model. We notice that the largest differences occur for those
calendar years where the calibration curve has large wiggles and rather flat areas with steep parts at
the ends. The probability distributions of calibrated C dates are in these cases multimodal or flat
and significantly differ from the Gaussian distribution. An influence of these differences on inference concerning chronology may be shown by the example of 2 samples with calendar ages equal
to 2700 and 2400 cal BP (gray circles on the χ axis in Figure 4). Depending on which estimating
parameter we choose, for a sample with the true calendar age equal to 2700 cal BP we obtain the
result shifted about 110-140 yr towards smaller age values, while for a sample with true calendar age
equal to 2400 cal BP the results are shifted about 50-100 yr towards greater age values (shifts are
marked in Figure 4 with arrows). Therefore, the estimated calendar age for both samples is almost
the same (see Figure 4), and if we use these results for constructing a chronology or age-depth model
we may draw wrong conclusions. The explanation for this effect is the flat area of the calibration
curve occurring between 2400 and 2700 cal BP (Hallstadt period). It results in probability distributions for C dates related to this period that are very wide, rather flat, and cover almost the whole
period 2400-2700 cal BP, whereas the parameters that were applied here place themselves in the
middle part of the distribution, even if the true calendar age is located at the end of the distribution.
14
14
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
396
A
Michczynski
250
200
150
2
>«
1 0 0
~
x
50
ϊ
ο
I-
υ
I
i
-50
"° -100
-150
-200
10000
-250
8000
6000
4000
X - true calendar age of sample (yrs BP)
T
10000
8000
6000
4000
X - true calendar age of sample (yrs BP)
T
12000
10000
8000
6000
4000
X - true calendar age of sample (yrs BP)
T
Figure 2 The differences between the true (X ) and estimated (X ) calendar age of a sample for the true
calendar age of a sample changing in the range 0-14,000 cal BP. Results are obtained for the following
values of the assumed uncertainty of a C date: Α: σ = 25 yr, Β: σ = 50 yr, C: σ = 100 yr. Black triangles
denote the difference between the true calendar age and the mode of the probability distribution; gray
rhombuses are the difference between the true calendar age and the median; and white squares are the difference between the true calendar age and the mean value.
T
E
, 4
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
A Good Point Estimate of the Calibrated
14
C Date?
" T-Mean
—T-Med
—T-Mode
Ν
ο
-250
-200
-150
-100
-50
0
50
difference X - X
T
100
150
200
250
E
«T-Mean
—T-Med
—T-Mode
-250
-50
0
50
difference X - X
T
E
Figure 3 The frequency of appearance of particular differences between the true and estimated calendar age of a sample in the range 0-14,000 cal BP. The black line represents
the mode; dark gray line: median; light gray line: mean value. The assumed uncertainty
of C date σ is equal to 25 yr (A), 50 yr (B), and 100 yr (C).
, 4
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
398
A
Michczynski
3500
ι
ι
ι IntCal 04'
3250
ι
CL
*
I
I
3000
&
2750
I
2500
A
ι — V J
-t * -* - - Î V r r
ÄJi"
1
F
" I " ~~
A
1
ι
^ t
— -f — — ~ —H
ι
-I
«
1
- *
- Α -
8
I
ι
I
Ι^
_l_
I
I
2250
.4 _
2000
1750
3750
3500
3250
3000
2750
2500
2250
X - true calendar age of sample (yrs BP)
2000
1750
T
Figure 4 The differences between the true (X ) and estimated (X ) calendar age of a sample for the true calendar age of
sample changing in the range 1750-3750 cal BP and the assumed uncertainty of C date σ = 50 yr. The black line shows
the calibration curve IntCal04 (Reimer et al. 2004). Black triangles denote the difference between the true calendar age
and the mode of the probability density function; gray rhombuses are the difference between the true calendar age and the
median; and white squares are the difference between the true calendar age and the mean value. The meaning of the symbols on the χ axis is explained in the text.
T
E
14
Other Point Estimates
There are also other point estimates that are used often and should be discussed (see Figure 5):
• The local mode (maximum), i.e. age value, which corresponds to the maximum of the probability distribution included in the 68.2% or 95.4% confidence intervals (marked in Figure 5 with
triangles);
• The central point (middle) of the confidence interval (marked in Figure 5 with asterisks).
3500
3400
3300
Calendar years BP
3200
3100
Figure 5 An example of 2 parts of the 68.2% confidence interval. Symbols on the χ axis show the following: gray
triangle: the mode of the first part of the interval; white triangle: the mode of the second part of the interval; black
asterisk: the central point of the first part of the interval; and gray asterisk: the central point of the second part of
the interval.
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
A Good Point Estimate of the Calibrated
C Date?
14
399
300
T-68Max1
T-68Max2
250
200
1
Δ *
Δ
t
-50 -100 -
%*• i
-
*
Δ
Δ
Δ
Α '
.
δ
4
Δ
>}
*·
4, A
δ1 Δ1δ
δ
£
ΔΑΔ
Δ
4Î J
δ δ
δ ι
Δ
ί
Al
-S-
£
i
> W
V
Î;-
**ΔΔ
1 δ
'
_4_δ_
*
A t **
α Α
- 1 ^ -ft-^
*
α
δ
*
.-^
-
_ __ __
I
Α*^
Δ
^ δ / | *
Ι**
1
δ Ι δ
Α
L
Λ
****
#ï - -
-200 -
'
Δ Δ Δ
_
γΗ"^-γ
1 -
-150 -
é
-I
'
Δ
A :
^
i..».'*.*-
4
Δ_
Δ
'
10000
8000
6000
4000
Χ - true calendar age of sample (yrs BP)
τ
T-95Max1
T-95Max2 ;
250 1A_
200 -
f|AAA
——
-A.A. .
A
4
150 -
- -±2Γ - | -
Ή -
4 t /
100 -
Δ
*
- Δ - - ^
Δ
if
' Δ "I
- - Δ Δ
a — 4A_?|a_2-*J. I
Α Δ'
*
*•
Α Α
.
*
A. A * ' A
δ
tf>
A
. *
'AJ*
*
1
* AIA *
Δ
Δ
1
1
Δ
Δ
A
*
A
A
V\.JAA
a\MV
δ
/I
J
-δ
'
A_'
*»
*AAI
V*î
-50
j
-100
_ _f_ - * -
j _ _
U
Δ
-150
_ A.
'
Δ
%
Δ| Δ
Δ
*
Δ
Δ
-200
"ä
-250 •
_
Δ
I Δ
7
Δ -
Δ
iL
Δ_Δ_
'·
Δ
_A_AJ
Δ
V
^
Α " Ϊ '
10000
8000
6000
4000
Χ - true calendar age of sample (yrs BP)
τ
0.5
— T-68Mode1
— T-68Mode2
—T-95Mode1 •
—T-95Mode2
0.45
0.4
0.35
0.3
0.25
0.2
" 0.15
0.1
0.05
-300
-250
-200
-150
-100
-50
0
50
difference X - X
T
100
150
200
250
300
E
Figure 6 The differences between the true calendar age and the calendar age estimated using the mode of the first (gray triangles) and the second part (white triangles) of the 68.2% (A) and 95.4% (B) confidence intervals, assumed uncertainty of
C date σ = 50 yr. C) The frequency of appearance of particular differences. The thin black line represents the mode of the
second part of the 68.2% interval; thick gray line: the mode of the first part of the 95.4% interval; thick black line: the mode
of the second part of the 95.4% interval. The thin gray line, which should present the graph for the mode of the first part of
68.2% interval, is not visible because this graph is identical with the graph for the mode of the first part of the 95.4% interval.
14
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
400
A
Michczynski
14000
12000
10000
8000
6000
4000
Χ - true calendar age of sample (yrs BP)
2000
0
γ
Figure 7 The differences between the true calendar age and the calendar age estimated using the central
point of the first (black asterisks) and the second part (gray asterisks) of the 68.2% (A) and 95.4% (B)
confidence intervals, assumed uncertainty of C date σ = 50 yr. C) The frequency of appearance of particular differences. The thin black line represents the central point of the first part of the 68.2% interval;
thin gray line: central point of the second part of the 68.2% interval; thick black line: central point of
the first part of the 95.4% interval; thick gray line: central point of the second part of the 95.4% interval.
14
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326
A Good
Point
Estimate
of the Calibrated
C
14
Date
?
401
Because confidence intervals for calibrated C dates are often divided into several parts, the abovementioned point estimates were then calculated separately for each part. During the calculation,
only 2 of the most important parts were taken into account: the most probable part of the confidence
interval (which we term "the first part") and the second most probable part ("the second part").
14
Analysis was done for 68.2% and 95.4% confidence intervals and the assumed C date uncertainty
of 50 yr. The results are presented in Figures 6 and 7. We can see that the differences for the mode
of the first part of the interval (Figure 6) are almost the same as the differences for the main mode
(see Figure 3), because these modes are usually equivalent. The mode of the second part of the
68.2% confidence interval estimates the true calendar age of the sample for usually incorrectly, but
sometimes it gives a correct value for the true calendar age of the sample (see Figures 6A and 6C).
Figures 6B and 6C show, however, that the mode of the second part of a 95.4% confidence interval
estimates the true calendar age incorrectly. The results presented in Figure 7 show that the central
point of the first part of the 68.2% and 95.4% confidence intervals is a poor estimate of the true calendar age (due to the rather large differences between the true and estimated calendar age of the
sample), and the central point of the second part is completely wrong as an estimate of the true calendar age.
14
CONCLUSIONS
The results of the simulation unambiguously show that a very good method for point estimation of
the true calendar age of a sample does not exist. For all the tested parameters, one may observe differences between the true calendar age of a sample and the value of the calendar age estimated using
the parameter. Therefore, the best method for presenting the calibrated C dates is to use the whole
probability density function.
14
If it is really essential to use a point estimate for the calendar age of the sample, then the mode (the
value of the calendar age that corresponds to the maximum of the probability distribution of a calibrated C date) may be accepted as a point estimate, but we should remember that important differences between the mode and the true calendar age of the sample appear for some periods, which are
characterized by a specific shape of calibration curve (see Figure 4). The use of other parameters for
the point estimation of calendar age may lead to wrong conclusions.
14
ACKNOWLEDGMENTS
This work was supported by statutory funds of the Silesian University of Technology BK-2006.
REFERENCES
Aitchison TC, Leese M, Michczynska DJ, Mook WG, Otlet RL, Ottaway BS, Pazdur MF, van der Plicht J, Reimer PJ, Robinson SW, Scott EM, Stuiver M,
Weninger B. 1989. A comparison of methods used for
the calibration of radiocarbon dates. Radiocarbon
31(3):846-64.
Michczynska DJ, Pazdur MF, Walanus Α. 1990. Bayesian approach to probabilistic calibration of radiocarbon ages. In: Mook WG, Waterbolk HT, editors. Proceedings of the 2nd International Symposium C and
Archaeology, Strasburg. PACT 29: 69-79.
Pazdur MF, Michczynska DJ. 1989. Improvement of the
procedure for probabilistic calibration of radiocarbon
dates. Radiocarbon 31(3):824-32.
Reimer PJ, Baillie MGL, Bard E, Bayliss A, Beck JW,
14
Bertrand CJH, Blackwell PG, Buck CE, Burr GS, Cutler KB, Damon PE, Edwards RL, Fairbanks RG,
Friedrich M, Guilderson TP, Hogg AG, Hughen KA,
Kromer B, McCormac G, Manning S, Bronk Ramsey
C, Reimer RW, Remmele S, Southon JR, Stuiver M,
Talamo S, Taylor FW, van der Plicht J, Weyhenmeyer
CE. 2004. IntCal04 terrestrial radiocarbon age calibration, 0-26 cal kyr BP. Radiocarbon 46(3): 1029-58.
Stuiver M, Reimer P. 1989. Histograms obtained from
computerized radiocarbon age calibration. Radiocarbon 31(3):817-32.
Telford RJ, Heegaard E, Birks H JB. 2004. The intercept
is a poor estimate of a calibrated radiocarbon age. The
Holocene 14(2):296-8.
Downloaded from https:/www.cambridge.org/core. IP address: 88.99.165.207, on 12 Jun 2017 at 00:03:29, subject to the Cambridge Core terms of use, available at
https:/www.cambridge.org/core/terms. https://doi.org/10.1017/S0033822200042326