Download Fergany, Nader.; (1970)On the macro-dynamic stochastic treatment of the size and age structure of a human population."

Document related concepts

History of statistics wikipedia , lookup

Probability wikipedia , lookup

Statistics wikipedia , lookup

Transcript
I
I.
I
I
I
I
I
I
I
,_
I
I
I
I
I
I
I.
I
I
ON THE MACRO-DYNAMIC STOCHASTIC TREATMENT OF THE SIZE
AND AGE STRUCTURE OF A HUMAN POPULATION
By
Nader Abd-Elmaksoud Fergany
Department of Biostatistics
University of North Carolina at Chapel Hill
Institute of Statistics Mimeo Series No. 666
February 1970
I
'.
I
I
I
,
I
I
I
,_
NADER ABD-ELMAKSOUD FERGANY. On the Macro-Dynamic Stochastic Treatment
of the Size and Age Structure of a Human Population. (Under the
direction of H. B. WELLS.)
Macro-dynamic stochastic formulations of the size and age structuIe
of a human population are considered.
No restrictive assumptions are
placed on the physical process of population change and demographic variables are assumed to have the simple stochastic structure of a constant
depending on time plus a stationary random error component.
Probabilistic characterizations are obtained for the population
size at a point in time under discrete and continuous probability structures, and the probability distribution of the number of individuals in
two different age groups at a certain point in time is described by its
lower moments in general and in the special case of an underlying multi-
I
I
I
variate normal distribution.
I
of a sample of independent observations are biased and that it is possible
I
I
'.
I
t
The theory developed is illustrated using
data for the population of Egypt.
The problem of estimation of a certain function of joint moments
of a multivariate distribution using a sample of correlated observations
is discussed briefly.
It is shown that the usual estimators in the case
to arrive at unbiased estimators in the case of m-dependent processes.
The complexity of the variances of estimators considered prevented a
more conclusive study based on the mean square error criterion.
A Monte
Carlo study of a special case in small samples indicates that the degree
to which an estimator uses the sample information is the major factor in
determining how well the estimator performs in terms of mean square error.
I
I .•
I
I
ON THE MACRO-DYNAMIC STOCr~STIC
TREATMENT OF THE SIZE A~~ AGE
STRUCTURE OF A Hill-1AN POPULATION
I
t
,
by
I
I
,_
I
I
I
I
I
I
I.
I
I
Nader Abd-Elmaksoud Fergany
A thesis submitted to the faculty of
the University of North Carolina at
Chapel Hill in partial fulfillment of
the requirements for the degree of
Doctor of Philosophy in the Department
of Biostatistics
Chapel Hill
1970
I
Ie
I
I
t
,
I
I
,I
I_
I
I
I
I
I
I
Ie
I·
I
BIOGRAPHY
The author was born August 20, 1944, in Gieza, U.A.R.
He
graduated from the Saidia high school in Gieza in 1959.
He received the Bachelor of Science degree with a major in
applied statistics from the Faculty of Economics, Cairo University
in 1963.
He then joined the Faculty as an instructor.
Since
October, 1965 he has been on leave from his position pursuing
graduate studies in statistics at the University of North Carolina.
The author is married to Moushira Khattab of Cairo, U.A.R.
J
.'I
ACKNDtVLEDGME1TTS
The author expresses his appreciation to members of his advisory
committee, Professors A. Hawley, D. Quade, M. Sheps, H. Van Der Vaart
and H. B. Wells.
Professor Wells endured the arduous task of advising the author
for more than three years and all members of the Committee gave valuable
assistance and counsel.
The financial support of the Population Council and the Carolina
Population Center
~s
gratefully acknowledged.
Mrs. Gay Goss typed the dissertation beautifully and efficiently.
Finally, it gives the author
~reat
pleasure to acknowledge the
significant contributions of the many wonderful people who were associated with
the Faculty of Economics at Cairo University during his years
of study and work there to his education and edification.
I
I
t
I
,
t
••J
I
,
I
I
I
.1
I
.,
TABLE OF CONTENTS
Page
LIST OF TABLES .
vi
LIST OF FIGURES.
vii
Chapter
I.
INTRODUCTION AND REVIEW OF LITERATURS. .
1
1.1.
1.2.
1
2
2
8
1.3.
II.
POPULATION SIZE . •
2.2.
2.3.
2.4.
III.
Theoretical Formulation, Discrete Probability
Structure. . . • • • • . • . • • •
A Discrete Error Probability Law . .
Theoretical Formulation, Continuous
Probability Structure. •
An Example
.•••
AGE STRUCTURE. •
3.13.2.
3.4.
IV.
Introduction. • . . . • • • • .
..• .
Review of Macro-Analytic ~fude1s in Demography.
1.2.1. Deterministic Models • • • • • • • •
1.2.2. Stochastic Models • • . • • • • • • • •
Objectives of the Present Study and Approach
Adopted. •
• . . .
Introduction
Statistical Formulation.
3.2.1. Introduction ••
3.2.2. The Multivariate Normal Case •
A Discrete Approximation to the Human
Survivorship Function.
An Example . . • • • • •
SOME ASPECTS OF THE ESTIMATION PROBLEM •
4.1.
Estimation of a Certain Function of Joint
Moments of a Multivariate Distribution Using
the Same Function of the Joint Moments of a
Sample of Dependent Observations • • •
13
15
15
20
27
30
37
37
42
42
45
46
48
52
52
I
v
Page
Chapter
4.2.
A Monte Carlo Study of Some of the Properties
of Five Estimators for the Variance of the
Product of the Two Components of a Bivariate
Normal Time Series in Samll Samples . •
4.2.1. Introduction
4.2.2. Procedure ••
4.2.3. Conclusions.
.
.
V.
....
60
60
63
66
SUMMARY AND SUGGESTIONS FOR FURTHER RESEARCH .
69
5.l.
5.2.
69
71
SlJlTIIO.ary • • • • • • • • • • • • •
Suggestions for Further Research .
Appendices
A.
DATA AND INTERMEDIATE CALCULATIONS FOR SECTION (3.4) .
73
B.
COMPUTER PROGRAM . .
78
C.
NUMERICAL RESULTS OF THE MONTE CARLO STUDY
87
LIST OF REFERENCES • • • • • • • • • • • . .
92
-'I
I
t
I
I
I
I
-.
J
I
I
I
I
I
_I
.-
I'
I
I
Ie
f
I
I
LIST OF TABLES
2.2.1.
I
2.4.1.
I
3.4.1.
I
·A-l.
I
,-
A-2.
A-3.
t
A-4.
I
I
A-5.
I
I
I
Ie
I
I
Page
Table
C-l.
Comparison of the Discrete Error and Syrr~etric
Binomial Distributions for k = 1,2,3 . • . .
26
Estimated Probability Distribution of the Population
of Egypt in Thousands on 9/30/1960 . . • • • •
35
Estimated Male Births in Thousands and Survival
Probabilities. • • • •
50
Birth Rate per 1000 Population Corrected for
Under-Registration, Egypt (1940-1959).
73
Estimated Mid-Year Population in Thousands,
Egypt (~940-1959). • . • •
• ••••
74
Reported Death Rate per 1000
(0-4), Egypt (1940-1959) .
75
}~les
in the Age Group
. .•.•.
Average Ratio of Single-Year Death Rates to the Death
Rate of the Age Group (0-4), Egypt, 1937, 1947 and
1960 Life Tables .
• • • • •
••.•• • • .
76
Ratio of Crude Death Rates Corrected for UnderRegistration to Reported Rates, Egypt (1940-1959).
77
Relative Mean Square Error of the Five Estimators ••
87
I
.'I
LIST OF FIGURES
Figure
2.4.1.
Page
Annual Rate of Natural Increase,
Egypt, 1947-1959 . . . . • • . . • • • . • . . . • . . .
34
I
I
I
,I
I
I
••1
I
I
I
I
I
eI
I
I
I
'.
I
I
CHAPTER I
II
INTRODUCTION A1'n REVIEH OF LITERP.TURE
I
J
I
I
J
--
I
I
I
I
I
I
1.1.
IntroG'Jctic.d
The use of
math~natics
as a
mediu~
of expression in a certain
branch of human knO\vlcdgc. is considered by some to
of an eXact f.:ci.ence. g::owing out of that brallc.:h.
b(~
a sign of the davm
Vie,vcd in that light,
the incre.:'sed use of mathematical models in dCinography in recent year" -
as means of both enr:tching the theory and deriving sound and ,\\lorkable
methodologies - is a definite manifestation of the advent of an era of
sc:ienU.E:ie 8cphisticLdon in th:L:; arCH.
That consideration of stochastic varie.tJons in a Tllodel improves
its
resem'l:llanct:~
to reality and thus increases its utility is indicate!'!
by Mindel C. Sheps [27J in the special case of the study of patterns of
human reproduction:
"\\hile
wan or expf:c:tcd v2.1ucs given by proh2hiJ··
j1
istic model.s agree with those of deterministic
n~dcls
based on suffi-
ciently simile.T assumptions, stochastic models offer insight into the
effects of chance and into the variations that occur natu'rally in the
biologicaJ system."
However, we must not lose sieht of the fact that
the role played by pure chance variations in the problems of human
population c1Jsn.ge tends to be small since the number of individuals
considered is usually very large.
J.
I
I
Unfortun.1.tely, each of the existing m;1cro-analytic stochastic
models of population change suffers one or more of the following
comings:
~;hort­
I
.1
I
I
it,
2. couplicGl t e.d
of needed
p<'4r<,.mc,t~:rs
li,O th'l1T,,'.~ U.cal
d~,.t:iv~1tions,
[rom ac:tual data \vhich
81"C
:1
,I
both
ji:1P(11.tWlt
I
I
and
chal1engil1g.
.
~I
.,
1.2. .P..c'yjE::z..s~LJ1f.9:Q=AT:"Q.:1:.Y.t1_c:-11o d f;:2.::s jE__ Q~'Jg<?Zl.:~~.rhy
O~·
continuo'ls time and age
variabl(~~;.
The cle.ssic of discrete time deterudI1istic fornl'..llntions of t:kc
problem is that of
b~;cat1se
briefly
]~es15.e
[l9] and
it .serves as the
v.T€~
shall
st~1.rti!lg
):C.ViE··....'
it!::
ba~;ic [.,t·rl:cL.;re
pDint of alT'lost all ege-
specific discrete time stochastic treatments of the problei 1 of populn1
tion change.
Leslie considered only the female popt1letiOll and assu,n2d th;::L Dge-·
specific rates are independent of time.
He also made thz follcMing
definitions:
l1
xt
- the number of femaler; alive
hl th(~
age group (x, x+l]
at t.ime t)
P
x
"" the probability that a female in the ;:-.go group (x, x+ 1]
I
I
I,
,
I
I
.1
I
I
I
3
I.
at time t will be alive in the age group (x+l, x+2]
I
at time t + 1, 1
I_
I
I
I
I
I
I
I.
I
I
Px
~
0,
F = the number of daughters born in the interval (t, t+1]
x
I
I
I
I
I
I
~
per female alive aged (x, x+l] at time t, who will be
alive in the age group (0,1] at time t + 1, F
> 0,
x-
and
F
M=
o
o
o
...
m-1
F
m
o
o
o
o
P
o
m-1
where (m, m+l] is the last age group attainable in the population.
It then follows that
~+1 =~
where
Et
• • • (1.2.1.1)
is an (m+l) column vector giving the female age distribution
at time t, or
n
where t
=0
t
= Mt --0
n
• • • (1.2.1.2)
stands for some initial time period.
Leslie then notes that if (k, k+l] is the last age group in which
reproduction occurs, F
x
= 0 for x > k and M may be partitioned symmet-
rically at this point as
whence
4
+
[-f(;'::cl J ~j
t
=
M
t
But, since C
=0
for t ~ m - k, he concluded that it is the submatrix
A which is of primary importance and gave his mathematical treatment
in terms of it.
The prinicpal continuous time deterministic formulation is due
to Alfred J. Lotka [21], [22], [23].
Since it will serve as the basis
for our stochastic treatment, some aspects of Lotka's theory that are
relevant to this treatment will be reviewed here.
Let x denote age, x
~
0,
t denote time,
N(t) be the number of individuals in the population at time t,
c(x,t) be the proportional age distribution function at time t;
a
i.~., J
2
c(x,t) dx
= the
proportion of individuals in the age group
00
J
c(x,t) dx
= 1,
o
B(t) be the number of births in the population at time t and
bet)
= B(t)/N(t)
be the corresponding birth rate,
D(t) be the number of deaths in the population at time t and
d(t) = D(t)/N(t) be the corresponding death rate, and
p(x,t) be the probability that an individual born at time
(t-x) will survive to be of age x at time t.
Considering a closed population,
1
and using the terminology
1A closed population is one that increases only through birth and
decreases only through death.
.
I
.1
I
I
I
I
I
I
I
_I
J
I
I
I
I
I
.'I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
'I
J
I
J.
I
I
5
presented above, Lotka derived the fo11mV'ing relations
co
N(t) =
J
B(t-x) p(x,t) dx
• • •(1.2.1.3)
o
B(t-x) p(x,t)/N(t)
c(x, t)
• • .(1. 2.1. 4)
He then remarked that if the survivorship function p(x,t) and
the proportional age distribution function c(x,t) are mathematically
independent of time, both the birth and death rates of the population
will be constant and so will be the rate of change in the size of the
population.
If the birth and growth rates are denoted by band r
respectively, it follows that
N(t) = N(tt»
e
rt
.B(t) -- B(ta) e
rt
D(t)
e
rt
• (1.2.1.5)
where to is some initial time point.
It also follows, from relation
(1.2.1.4) above, that
c(x)
b p(x)
erx
• • • (1. 2.1. 6)
We observe that all functional forms of the age structure presented above depend fundamentally on the survivorship function p(x,t).2
In the following discussion, we shall be concerned with one generation real or synthetic - survivorship function, or a p(x,t) that is constant
2It should be noted that p(x,t) is rather complex. It is not a
"generation" survivorship function in the sense of a pure death process
describing the mortality experience of a cohort of births; nor is it a
"period" survivorship function describing the age-specific mortality
experience of a certain population at some point in time applied to a
synthetic cohort of births. Actually, p(x,t)V x at tDne t describes
segments of the mortality experience of as many birth cohorts as there
are ag~ points at time t, with the mortality experience of younger ages
represented more often than that of older ages.
6
over time; hence the argument
will not be displayed.
t
In addition to the prominent role the survivorship function plays
in det.ermining the age structure, it determines a life table completely.
For example, the life expectancy at age x, e
x
say, is given by the
expression
ClO
ex = (
J p(z)
• . . (1.2.1. 7)
dz)/p(x)
x
Consider a pure death process in which a cohort of N(O) births
goes through life until the last member of the cohort dies.
Let N(x)
denote the size of the cohort at age x (in such a process, age and time
since the start of the process coincide, of course) then the survivorship function is given by p(x)
= ~~~~
If the instantaneous death rate at age x,
~
~(x),
is a constant,
I
.1
I
I
I
I
I
I
I
_'I
say, then
-d
a~x)
= -
~ p (x) which gives the well-kno\im negative exponen-
tial relation
p(x)
on using the initial condition p (0) = 1. [7]
• • • (1.2.1.8)
The last relation defines
what might be called a constant-attrition survivorship function.
However, a survivorship function that describes human mortality
conditions cannot be expected to follow the constant-attrition survivorship function, for the risk of human mortality is a function of age.
The most general approach to a human survivorship function is to consider the probability of death as a function of age,
~(x),
in which case
I
I
I
I
I
I
.1
I
I,
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
7
we have
E.-Pd;Z-(x)
condition p(O)
=
"
' il.la
' 1
= - II (x) P (x) glv1ng
- on ut1'1"1Z1 ng t Ile 1n
1,
x
p(x)
=
expo (-
I ll(~)
ds)
• • • (1.2.1.9)
o
It remains, of course, to determine the mathematical form of ll(x).
The earliest attempts to find a general mathematical expression
that
i·~·,
descri~es
the human survivorship function aimed' at it directly;
considered p(x) or N(x) rather than ll(x).
De Moivre conjectured
in 1725 that p(x) decreases in an arithmetic progression.
Gompertz introduced his famous "Laws of Mortality." [8]
which was later modified by Makeham in 1860 to ll(x)
In 1825,
ll(x):: 13yX
= a + Byx. [7]
However, this law was not applicable at all for ages before tr.alurity.
Later developments - surveyed by Frechet [13] - started with ll(x) or
a variation thereof.
Two remarks are in order here.
First, most later attempts did
not aim at a law of mortality that is based on a general theory regarding the underlying biological process but rather at fitting mathematical expressions to observed age specific probabilities of death.
This resulted in expressions that were too complicated formally and/or
too specific to yield the analytic advantages that would result from
a general mathematical expression.
Second, although the reasons are
clear - namely, the complexity of a general expression that describes
human survivorship for all ages - the shift of the attention of
researchers in this area from p(x) to ll(x) and from starting with an
I
8
und~rl yi Db theory to fit Un:", appropri ate fune tional [orw:; to observed
lo,;~;
p(x) values is nnftn-tuDc.te and represents
[l
in the dt'gl'('(' of
generali ty of the re.sultinG exprcs,d ons.
1'or not much faith
('.pn
~IC
placed i.n an empi.d cal fitting that is not backoG by an acc(1"ta1110
theory; and - clearly .- if
).1
(x) represent" I-he probability of deatll in
an interval 6x starting at age x, then Jl ();) is a monotonicall y nondecreasing function of t..x and the fitted form - although it may not
chDuge basically - will depend on the age units used in the fitting
process.
Nore recently, typical patterns of mortality have
be~~ll
through the presentation of serie's of "model" life tables.
.
1. 2.2. Stochastic Nodels.
c):ist for
clD~;sical
debc,ihed
[8]
Although mathCrfi3 tical fonaulnti
birth and dc::ath processu; tk; t
ghTt'~
the population as a ftlnctton of ti.me, the syt';1:eJ11s of
or,~.:
the size of
equati(lm~
resul t--
ing \olhen birth and de.ath rates are allowed to he ge.neral functions of
time are not easy to solve and practical applications an. c:umberse.·l'1c.
[3L
[12]
The first age-specific stochastic model of population change is
ascribed to Bartlett. [4]
age groups.
Bartlett used discrete time and discrete
He considered females only, assuTIied fertility and mortalit.y
rates to he constant over time -and wi.thin each age group} and derived
asymptotic forms for the linear and quadratic moments of thc num1H'r of
females in an age group when the age groups and time intervals \-.'ere
both made small.
Three years later, Kendall
[17] discussed an age-specific con-
tinuous time stochastic model of population changr~.
Kf";nc1all formulated
.1
I
I
I
I
I
I
I
-,
I
I
I
I
I
I
.1
I
I
I
9
I.
the problem using N(x,t) to describe the state of the population at
I
time t in the sense that the Stieltjes integral'
J
I
I
I
I
I
'I_
x
I
xl
the number of individuals in the age group (x ,x ].
l 2
2
dN(x,t) is
He admitted that
distribution problems are extremely difficult but found it possible to
arrive at some of the moments.
Under the following assumptions:
1. one sex
2. the subpopulations generated by two co-existing individuals
develop
in complete independence of each other
3. the birth rate varies with age but not with time
4. the death rate does not vary,
either with age
or with
time, Kendall developed equations for
E (dN (x, t) ) ,
Var (dN(x,t», and
I
:1
:1
:1
'I
I
I.
I
I
Cov (dN(x,t), dN(y,t».
The equations are difficult to
solve, except in some unrealistic special cases.
For example, in his
paper, Kendall solved the case in which both the birth and death rates
are independent of age and time.
Simpler methods exist for solving
this problem now. [12]
More recently, age specific stochastic models of population
change were formulated in terms of discrete time and corresponding discrete age grouping.
Two formulations are worth mentioning here.
The
first, due to Pollard [24], revived interest in this type of model and
the second, due to Sykes [28], contains one of the most general treatments of the problem available.
10
Pollard considered females only, restricted his age distribution
to prercoprodl1ctive and reproductive ages
Cllld
DssUlUed thnt
1. fertility rates and survival probabilities in Leslie's
formulation represent binomial probabilities that are independent of
time,
2. a certain individual's death or procreation is independent
of other individuals' death or procreation.
Then using Leslie's notation, he derived an expression for
e
the vector of expected number of females in the (k+l)
-t
age groups (0,1], (1,2], •.. (k,k+l] at time t and
c(t) the vector that has as its elements the variances and
,
covar1ances
0
f t h e num b er
0
f f ema 1 es i n t1e
1 1. th and'J th age groups at
time t, c (t) say, l'1ste d in a dictionary order according to the subij
scripts i and j
(for i f: j,
ci~)
and
c;~)
are both listed).
I
.1
I
I
I
I
I
1
I
_I
This expression is
(~n)) - (-:-
I
1
0
I
I
I
A 9
=(~A-::A)n~O- :(0)-+- ;nj=l
• • • (1.2.2,1)
where B is a (k+l)2 x (k+l)2 matrix whose elements are functions of
F
x
and P , x
x
= 0,
1, 2, ••• , k, and A
matrix A by itself.
moments of '!!'t'
e.
A is the direct product of the
These results were generalized to higher order
I
I
I
I
I
_I
I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
11
It is in order here to point to the tremendous computational
effort involved in applying (1.2.2.1) for a moderate number of age
groups, if ,,-,e decide that the simplifying assumption on which the above
results are basod are tolerable.
Sykes also restricted attention to (k+l) age groups at or below
the limiting age of reproduction and defined stochastic versions of
Leslie's model by assuming respectively that:
1. the deterministic model is subject to additive random
errors,
2. the elements of the transition matrix A are binomial
probabilities,
3. the transition matrices are random variables.
If A
t
is the' Leslie transition matrix for the time interval
(t,t+l], Sykes shows that in all three cases considered
n ) = ( 1T
E(=t
t-l
A.)
i=O].
Eo
= fl
-t
• . • (1.2.2.2)
We give below some of hi.s further results together with the assumptions
required for them to hold in the three above mentloned cases.
• •• (1.2.2.3)
(1)
where {£ } is a sequence of random vectors with
-t
E(_€t)
=0
and Cov (£
-s , -£t)
= r s,t
s, t
0, 1, 2, ...
whence
Var (n
)
-t
=
L
t-l
j=O
( 1T
t-l
k=t-i
• • . (1. 2 •2.4)·
12
Further, if {~1:} is a normal stochastic process,
as a multivariate normal with mean
Qt
_
~ -~)
,
-1
Vt
}l-t
<It-t; -
3
t.hen !It is distributed
and variance Vt.
~)
Hence
2
is distributed as Xk+l
· • . (1.2.2.5)
which may be used to place confidence intervals on
(2) Assume that individuals in the i
th
~.
age group give birth cmd
die with probabilities tbi and (1 - tSi) respectively at time t and that:
all events result from mutually independent binomial trials.
This is
siulilar to Pollard's formulation except that Sykes allo'lt!s the transi-·
tion matrix to depend on time.
The variance matrix of
~
is given by
the relation
• • • (1. 2 . 2 .6)
I
.-I
I
I
I
I
I
I
_I
where
= (t a .. (1 - ta..»
~J
~J
(3)
· • . (1.2.2.7)
where At is a known Leslie matr~x and' {6
t
matrix random
variabl~s
with E (6 )
t
}
is a sequence of independent
= 0 and Var (6 t ) = St.
case, Sykes gave the variance matrix of
~
3i.~., for any finite subset (t l , t z'
In this
as
... ,
sequence of nonnegative integers, the vector ~'
has the n(k+l) - variate normal distribution.
t ) of indices in the
n
=
,
~
1
E'
-t ,
2
£'
••• , --1:
)
n
I
I
I
I
I
I
.1
I
I
I
13
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
vt =
t-1
L
t=O
t-.Q,-l
( 'IT
i=O
t-t-l
Ai) Pt (
'IT
i==O
A.) ,
1.
. • • (1.2.2.8)
where
P
t
is a (k+l) x (k+l) matrix whose ijth element is given by
and
Again, we can easily see that, even if we accept the various
restrictive assumptions of the above formulations, it is very difficult
to put their results to reasonably detailed practical applications.
As a case in point, in the example Sykes gave in his paper, only three
age groups were considered and the transition matrix did not vary with
time.
1.3. Objectives of the Present Study and Approach Adopted
The purpose of this study is to investigate some aspects of the
mode of change in the size of a human population and its age structure
under simple and realistic stochastic set-ups with the hope of producing
practical methodologies for the demographer who wishes to incorporate
stochastic variations in his calculations.
The line of development of
the stochastic formulations employed, which is a stochastic extention
of some of Alfred Lotka's work [23], is not directly related to any of
the existing macro-analytic stochastic models and the resulting formu1ations do not suffer from most of the shortcomings of the previous ones
summarized in (1.1).
There is some conceptual difficulty in a stochastic treatment of
14
lnnnan population change in
3
certain society since in reali.ty we have
only one such society and it i.s difficult to imagine hm,T that society
could be subject to chance variations.
HOHever,
observed pattern of population change as a
from a
tlniv.~rse
of possible patterns.
VIC
s~uple
may consider the
I
.-I
of one observation
I
I
This is the approach adopted
I
here.
The unit of analysis considered in this treatment is a human
I
population possessing certain characteristics and subject to vital
change continually affecting the number and characteristi.cs of its
members.
This vital change is effected through two
proces~'es,
il1cremen-
tation by the birth of new individuals and imnligration, and decreruentation by the death of living individuals and emigration.
Such a popula-
tion may be described by three types of random variables expressing
forces of incrementation, decrementation, and the resulting population
as a function of time.
I
I
I
_I
Thi.s way of describing the dynamics of hUlr.an
population change is most natural and could be considered as an attempt
to theorize the underlying concepts of population change in classical
(numerical) demographic analysis.
In the present formulation, demographic variables are assumed to
have the stochastic structure of a constant depending on time plus a
stationary random error component.
Although the exposition of the
present formulation is set forth entirely in terms of human populations,
the principles involved are applicable to all other types of populations, and so are the results.
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
CHAPTER II
POPULATION SIZE
2.1.
Theoretical Formulation, Discrete Probability
Consider the probability space·
en,?,
p) where
St~~cture
n =. {w l 'w 2 ' ... }
is a set of a countable number of elements,~ is the power set of
and P is a probability measure on
Elements of
n
:P.
n are identified here with conglomerations of
relative socio-cultural conditi.ons - in the broadest sense of the word
whose effect on the factors affecting change in demographic variables
is to introduce stochastic variations as a component of their total
variation.
Although the theory presented here does not depend on the
nature of the probability
as normal-type
l
m~sure
P, stochastic variations are treated
discrete errors around an expected value determined by
overall socio-cultural cllange expressed in terms of time trends in
demographic variables.
n are
Accordingly, elements of
having a one-to-one correspondence with the class
~f
conceived of as
integers - or a
symmetric subset of it - such that they could be arranged as
n = L .. ,w_2 ,
w_ , w ' WI' w ' ••• } where Wi' i
I
o
2
=
0, ±l, ±2,
••• ,
denotes an i-unit deviation. from the expected value; and the probability measure P is conceived of as assigning probabilities to elements
li.~., symmetric around the expected value which is also the
mode of the distribution.
16
of
jJ in
the fashion of an error probability distribution about some
expected value.
It is probably more realistic to think of relative sociocultural conditions as forming a continuum, but the case where st is
countable has a simplified probability structure and for this reason is
presented first.
A continuous probability structure version will be
considered in (2.3) below.
st -+J:lwhere 0 < T < t and']{is the set of non-
Let Nd("r,e)
=
negative integers such that Nd(T,W)
n
d
(T) is the size of the popu1aW
tion at time T corresponding to wEst, which is a discrete function of
time.
For the sake of mathematical convenience, we shall approximate
Nd(T,e) by the differentiable function of time N(T,e).
st -+ A, where 0 < T < t and A is a countable set of
Let R (T, e) :
points on a finite real
inte~"Va1
such that R (T,W)
=
r (T) is the rate
W
of change in the size of the population of time T corresponding to
WEst defined by
. (2.1.1)
-1
such that R
(T,e) exists.
From (2.1.1), it follows that
nw(t)
=
nw(O) expo
~ f
t
• . . (2.1.2)
o
where t
=0
stands for some initial time point.
In order to consider the probability structure while allowing
the relative socia-cultural conditions to change with time we define
'W (e)
: (O,t] -+ st
I
.-I
I
I
I
I
J
I
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
11
such that~(T)
= WT
existing at time
functions W.
'l" E:
is the set of relative socia-cultural conditions
(O,t].
Let-;t (T) be the set of all possible such
We may now think of R(T,·) T
T
into a space of functions.:J (T, .) T
R(T,W J
1
= rW
(O,t] as mappingdt (T)
E:
(0, t] whose typical element
E:
(L) represents the functional dependence of the rate of
L
change in the population size on time and the path of relative sociocultural conditions prevailing in the time interval (O,t].
also be redefined in a similar manner.
Note that r
W
T
N(T,·) may
(T) and n
W
(T)
L
are "sample functions" in the ordinary sense of stochastic processes i f
we take our underlying sample space to be
Q.
7f
O<T<t
Now, equation (2.1.2) may be written as
n
* (t)
=
(0) exp.
n
(1.)0
(J
t
o
r
W
(L) dL)
T
¥wL E: 1: (T)
. . . (2.1.3)
I
I
I
I
I
I
I.
I
I
for relati.on (2.1.1) holds for any path W
L
replacing W at T.
(Note that r
w
s"£
(L) L E: (O,t] with III
T
(L) is a function of T only).
1"
Clearly, the sample space to consider in discussing the probabili.ty structure of r 1" (1") and n (1") is ~ (T).
W
w
However, any probability
1"
measure on &
(ct
(1"»
- the 0- field on i.. (L) - is of limited practical
interest in the problem of human population change of one society since
we would not be able to estimate it.
All the data normally available
represent only one element i~ (f (T).
A more useful probability structure results from considering
equation (2.1.2) in conjunction with the probability measure P
when it is assumed to renmin stationary over time.
~n jQ
Given the initial
18
= 0,
population size at t
the foun of the time-trend in
R(T,") - its
expected value as a function of time - and the probability law P, the
probability distribution of N(t,") could be determi.ned - or approxi-mated - making use of remarks (1) and (2) below.
Remarks:
n
(1) Let X
Al
y
where
n,
~
AI' and A
2
Al
~
X(w) = x,
A :::,;) y (x) ::: y
2
-+
1
are countable sets elements such that P(X- ) and
P(X- l y-l) are defined.
In other words,
~
~-l
X :
(n,
y
(Al ,6!I' P X ) ~ (A2'~' P X
P) ->- (AI' \..:;t-l' P X
L:::V,
.
-1
),
-1
-1
Y )
1
where P is a probability measure on the cr-field~, (p X- ) is the
probability measure on ~l induced by X, and (p X-I y-l) is the probability measure on ~2 induced by Y.
The probability law of the random variableY is determined completely by the probability law of X, or by P on ~, in a straightfoward fashion.
For, clearly,
::: p' {w}
(2) A discrete probability distribution may be regarded as an
approximation to a continuous (or a finer discrete) one with the probability mass points of the approximation located at 'some well-defined
I
.-I
I
I
I
I
I
I
_I
I
I
I
I
I
I
.-I
I
I
Ie
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
19
position within intervals of the continuous distrihution
(CJi
o
g1:0U}'S
of
mass points for the finer discrete one) and carrying the probability
measure associated with the corresponding intcl"vdls (or group of mass
points).
(3) In practice, annual rates of natural increase would be the
most reasonable estimates of rates of change in the population size
for they are usually available on a yearly basis and honce W'ould provide a sample of reasonable size over a relatively short period of
time.
But the rate of natural increase is not the same as the rate
of change defined by (2.1.J.).
For w
~
n,
let n(t+l)
= net)
r
e .
(Note that the subscri.pt
u)
is suppressed); i.£., the rate of population change is asswned to be
.
constant throughout the time unit (t, t+l].
Assuming a closed popula-
tion, the rate of natural increase for the same time period is defined
by
r'
- net)
= n(t+l)
n(t;J-~)
.(2.1.4)
=
r/2 ]
[net) e r - n(t)]/[n(t) e
=
e
r/2
-r/2
- e
= 2{ (r)
2
= r+
1
+ 3!
3
(1:)
{l
r
2 3: ('2)
2
3
5
1
+ 5: (L)
2
1
+5T (r)
2
+ ••• }
5
+ ... }
i..~.
r' > r
for
r > 0
and
r' < r
for
r < 0
• . • (2.1.5)
20
and
The absolute difference
bet,~een
rand r' is very small in the
customary range of values for these variab1es t but when we consider the
app1icati.on of these rates to relatively large
popu~ations
for a large
number of intervals such that r is constant within each interval but
may vary from one to another, we might do well to make the necessary
adjustment.
Tables of hyperbolic fUllctions would be of use in numer-
.-I
I
I
I
/1
I
I
ical conversions.· [14]
If the population is open to migration t the above4nentioned
relations hold with the only difference that r' represents the balance
of natural increase and net migration.
2.2. A Discrete Error Probability Law
From the properties of the probability measure P described at
the beginning of (2.1), it seems natural that the probability distribution of a variable like
I
R(T,·) at a certain point in time would take
the form of an error probability distribution around an expected value
determined by the time trend in
R(T,·).
An appropriately-labeled normal distribution or a binomial distribution with probability parameter
~
would serve this purpose.
How-
ever, the data we usually encounter in demographic estimation are not
commensurate with high degrees of accuracy in the sense of probability
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
21
mass concentration around the mean, which makes an alternative to the
normal di.stribution in this regard desirable.
The same observation
applies to the synnnetric binorlinl distribution.
The purpose of this section is, taking the norm;)l distribution
as a standard, to devise a simple discrete error pro1nbility di.stribution that does not display a high degree of accuracy in the sense
described above.
It is well
k.110\vn
that the sum of independent con-
tinuous uniform random variables approaches the normal distribution as
the number of components increases, hence it was reasonable to expect
that the sum of independent discrete uniform random vari.ables would
I
I
I_
approach the form of a discrete error distribution as the number of
components increases.
The probability mass function (p.m.f.) of the
sum of three sucre random variables was judged to achieve a balance
between simplicity and closeness to the desired fonn.
This p.m.f. has,
in addition, the advantage of simple calculations of probabilities and
I
I
I
I
I
I
I.
I
I
the disadvantage of a restricted number of mass points.
Consider the probability distribution
x
= -k,
-k + 1, ..• , 0, ••. , k - 1, k
• • . (2.2.1)
Lemma (2.2.1)
It follows from the relation (2.2.1) that the probability
generating function (p.g.f.) of X is
G (5)
X
= E(sx)
=
-k
2k
('t' sx)
2k+l
'-'
s
x=o
• • • (2.2.2)
22
clearly,
° and
E(X) -
2
Vsr(X) =, k 3+k
may be calculated directly or as (;~ (1) •
I
.-I
Leruma (2.2.2)
I
I
I
I
I
I
Let Xl' X be independently distributed according to (2.2.1),
2
pr{y :: y} ::
.::.J..yJ + 2k + 1
y :: -2k,-2k+1, •.. ,O, ... ,2k-l,2k
(2k + 1)2
. (2.2.3)
. The proof is immediate from first principles.
This gives the p.m.f.
of a discrete triangular distribution and may be used as the building
block for the next step ..
Theorem (2.2.1)
Let Xl' X , X be independently distributed according to (2.2.1),
2
3
m
1
=-
Iz I + 3k + 1,
ID
2
=-
Iz I +
Pr{Z
and
k, then
= z} = ~(z)/(2k+l)3 where
, Iz I ::
~(z)
=
ID
1 (m1 + 1)/2 - 3m2 (m2 + 1)/2,
3k
2
+ 3k + 1
Izl ::
z
k, k+l, ""
1, 2, ""
=°
3k
k-l
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
.23
Proof
Let
~ (z) = number of \,'ays in \·:h:!_ch xl + xl + x 3 = z, from first
principles - using Lemma (2.2.2), or expanding the p.g.f. for Z,
2k
( L
4k
(x+l) sX +
x=o
2k
(4k-x+l) sx)( L sx)
x=2k+l
x=O
L
we get the following table
~(z)
Izl
+
+ (k+2) + (k+l)
+ (k+l) +
+ (lk+l) +
+ (k+3) + (k+2)
+ (2k+l) +
+ (k+4) +
+
(k+l) + (k+2) +
o
k
I
2
(k-l) +
k
+
(lk+l)
k-2
3
+
4
+
+ (2k+l) +
k-I
2
+
3
+
+
k
1
+
2
+
+ (2k-l) +
1
+ 2
+
k+l
(2k)
(2k)
-I-
(2k)
+ (lk+l)
(2k-l) +
I
3k-l
+ (2k-l)
+ (2k+l) +
(2k)
(k+3)
+
(2k)
2
I
3k
Clearly, for Z
=0
~(O) =
2
(2k+l) + (2k ) + 2
= 3k2 +
3k + 1
k(~+ll
24
Iz I =
and for
k, k+l, ""
3k
$(2) == m
(m +l)!2
1
l
Finally, for
Izi
1,2, " ' , k-l, after some algebraic monip01a-
=
. Further,
.-I
I
I
I
I
I
I
tion we get
==
I
m (m +1)/2 - 3 m (m +1)!2
1
2
2
l
3k
3k
L:
~(Z) == (3k2+3k+1) + 2 L: i3k-z+.l)?k-Z+~.l
z=-3k
z=l
k-1
- 2 L: 1 (k-z) (k-z+l)
z==1 2
_I
3 2 3
+ 12k + 6k + 1 ;: (2k+l)
== 8k
i.e., (2.2.4) is a proper p.m.f.
Q.E.D.
We note the following:
(1) For Z == 0, m (m +1)/2 - 3 m (m + 1)/2 == 3k
2
2
1
1
2
+ 3k + 1
and hence Pr{Z == z} may be thought of as composed of three second
degree polynomials in Z as it is the case with the continuous uniform
variates.
(2) It is clear, from the symmetry of the p.m.f. around the
mean, that all the odd moments of Z vanish.
Using the p.m. f. or the
p.g.f. to evaluate the even moments directly is straightforward but
tedious.
However, it is easy to arrive at the variance as
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
25
3
Var (z) ~ L Var (X.) by virtue of independence
~
. 1
~=
(3) The constant k determines both the number of probability
mass points, (6k+1), and the variance of the distribution
(4) lVar(Z)
.!...~., k < 0z <
k +
=
0 = ;k2+k
Z
~ since
k
+ k < (k +
t) 2
which means that all the
probability mass of the distribution of Z is concentrated in the range
. (5) The probability mass concentrated in the interval (-O'z ,oz)
cannot exceed
=
P
k-l
(2k+1)-3 {(3k2+3k+l) + 2 L [(3k-z+l) (3k-z+2) - 3(k-z) (k-z+1)]/2
~l
c
I
I
I
I
I
I
I.
I
I
2
+
k+l
(3k-z+l) (3k-z+2)/2}
L
z=k
...
l6k 3 + l4k
2
3 (2k+1)
- 2k - 3
3
< 2/3
• (2.2.5)
which means that the p.m.f. for Z allows for more uncertainty around
the mean than the normal distribution.
Alternatively, we may compute the kurtosis of Z
2
+ 13k
= 13k5k(k+1)
- 1
which lies in the interval
[2.5, 2.6) compared to the normal kurtosis of 3.0.
20
(6) T<lh1e
2.2.1
comp<1rc~
the proh,.l1d lily
llI:1SS,'~>
[1ssocL·,t.c·J
with (6k+l) mass poh1Ls for th(~ SY"1~'l;;c·tric tinonial ,mel tl iscret c en.o)"
distributior,s for k == 1, 2, 3 \·Jit.h the (Gk-Il) maSs points nu~,lbunc.!
1,2, ••• , 6k+1 in order, sbO\.Jlng that the:
di~;cr8te
I
.-I
I
I
I
I
I
I
error disl:-il),llio;1
a11o\Js for more uncertainty tban the synunctri.c binomi;Jl.
TABLE 2.2.1
COMPARISON OF TIlE DISC1\ETE ElumR A!\D SYH:fETRIC Ell';OHIAL
DISTlaBUTIO:iS HJR 1<. :. 1, 2, 3
k
Mass Poi.nt
Prob3 bi}o!-_!:'y_l~:J ~'!___ ._.. __._
Q:Ls..£E.~.~e En:..or
1
2
3
13 i n..llli!.:1.0J_.CJ.__
1 or 7
.037
.0] 6
2 or 6
.111
.094
3 or 5
.222
.234
4
.259
1 or 13
.008
.0002
2 or 12
.024
.0029
3 or 11
.048
.0161
4 or 10
.080
.0537
5 or 9
.120
.1208
6 or 8
.144
.1934
7
.152
.2256
1 or 19
.0029
.000 004
2 or 18
.0087
.000 069
3 or 17
.0175
.000 584
4 or 16
.0292
.003 113
5 or 15
.0437
.011 673
6 or 14
.0612
.032 684
7 or 13
.0816
.070 816
8 or 12
.0962
.121 399
9 or 11
.1050
.166 924
10
.1079
.185 471
.313
---_._-
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I
I
I
27
Theor..c~i~?l.1'.QP!lt!lation, Continuous _troba~:iJ.it.Y....§!.rucJ:ur,=­
2.3.
Consider the probability space
line,
13
(:R,
15 ,
P) where ~ is the rea.l
is the Borel field on ~ , and P is an a.bsolutely continuous
probability measure on
J3 .
Although the theory presented here does not depend on the nature
of the probability measure P, we think of points on ~ as having the
same physical meaning as elements of
n
in (2.1) above with x
~
E:
expressing an x-unit deviation from the expected value of the random
variable X defined by P on
13
with probability density function
(p.d.f.) p(x); c;ind of P as an error-type absolutely conti.nuous probability measure, the prime example being the normal probability
measure on the Borel line.
For the purposes of this section we consider R(T,X) L
x
E:
1("
E:
(O,t],
to he a measurable stochastic process, and require that any
two distribution functions F(r
(x» and F(r (x» corresponding to
L
2
1
R(T ,X) and R(T ,X) L ' L E (O,t] differ only in location.
2
1
2
1
L
It is well known that if A
parameter set and if
£
(O,t] is a Lebesgue measurable
JE /R(L,x)/d T <
00
then - since the value of
A
an absolutely convergent iterated integral is independent of the order
of integration E{
JR(T,X) d~} = J E{R(T,X)} dT,
A
Hence, if we define V(t,x)
_
[10J
t A t
=
I rx(T)
dT, and if
o
we have
J IE(R(T,X»ldT < 00,
0
t
E(V(t,x»
=
J E(R(T,x»dT
o
. • • (2.3.1)
28
Also,
rO(T) =
~
where, we may note) E(R(T)X»
~(T)
say.
Y·le hav~,
= E(V(t,x)
Var (V(t,x»
- E(V(t,x»)2
t
f (rx(T)
= E(
o
= E( 'IjJ
(x) t)2
where 'IjJ (x) is the difference between r ('r) and r (T) "Thich is
x
O
constant
fOl:
all
Further, 'IjJ (x) is symmetric around zero
(0, t].
T €:
and has the simple form 'IjJ (x)
Var (V(t,'»
=
=>
!Y-X,
a. > O.
Hence
Cut cr )2 where 0 2 is the
x
x
• • • (2.3.2)
variance of the random variable X with p.d.f. p(x).
Now, let aCT) and b(T) have continuous derivatives in (O,t],
Pr{a(t) < V(t,x) ~ bet)}
t
=>
Pr{
J
t
a'(T)dT
~ J rx(T)
o
dT
~
0
= pr{ a' (T) -<
r (T) < b' (T)
x
-
t
f b'(T)
dT}
0
¥ T €:
(0 , t] }
since, because of the parallelism of R(T,X ) and R(T,X 2 ) xl' x 2
l
T
€:
€
(O,t] resulting from the properties of R(T,X) given earlier,
aCt) ~ V(t,x) ~ bet) holds if and only if a'(T) ~ rx(T) ~ b'(T)
¥
T €:
(O,t],
Further, if we choose
a'(T) = ~(T) - a
a, b > 0, then
and
b'(T) = ~(T) + b where a, b
€
~
,
.-I
I
I
I
I
I
I
_I
then
~t =
I
lG,
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
29
aCt)
~
EeV(t,x»
- at Bnd bet)
and, becnuse of the fact th2L F(r
T
1
~
(x»
E(V(t,x»
b~
i
diffllr only" i.n
and F(r. (x»
1:
2
location, we find that
b (1")
a'lT
Pda I Cr) -<
:.--:
l'
X
(1:) _.
< b ' (1")}
b '(1")
f
:=
dF(r " (x»
1
b
a' (-r)
= pd-a ..$. x ~ b}
f
-a
""
p(x) dx
does not: depend on '[.
To establish tp.nt 'jr
t
;:: a'lT
b
"le deULc the pr-ob':lbility space
(R(.1e,t), RC"B,t), R(P»by the follow::llg one·-to···one conespoucbncc
relations.
x E:
B
E.:::B
X"",>
{R(l,X)
-=>' {R(T,X) J.j x
R(P)(R(B,t»)
VT
£ E,
E;
V-
= Pl-{R(l,X) V
=
(O,t]
E.
R(J(. ,t)
T £ (O;t]}
x
£.
E,
V-
R(B ,t) and
£.
1" £(O,t.]}
PCB)
=
J p(x)
dx
B
Finally it should be clear that the transformation V(t,x) preserves
probability structure in the sense that
Pr{V(t,x) : x £ B}
or, more specifically,
~
Pr{R('T,x) : x
€
B, T
£.
(O,t]}
30
TI
= Pr{a(t)
t
< V(t,x) < b(t)} =
-
.(2.3.3)
TI
a b
-
For example, if p(x) is the standard normal p.d.f., then R(T,·) is
2 2
distributed as normal (rO(T), a). It then follows that V(t,·) is
2 2
distributed according to normal (E(V(t,·», a t ), and, finally, N(t,·)
is distributed lognorma11y with density
1
f(n(t»
=
---=-- expo
{_~[
n (t) v'27i-at
In n(t) -
~
(t) - In n(O) 2
v
] }
at
and
• • . (2.3.4)
variance
median
=n
2
(0) expo (2 ~ (t)
= n(O)
v
expo
2 2
2 2
+ a t )[ea t
- 1]
(~v(t»
I
.-I
I
I
I
I
I
I
_I
~
where
v
(t) = E(V(t,·).
Also, f is skewed to the right and has positive kurtosis compared
to the normal distribution. [1],[16],[18]
2.4. An Example
Data on the population of Egypt were used to illustrate some
aspects of the previous theory.
The 1947 census gave the country's population as of the 31
2
st
of
We should actually use a standard normal variate truncated to a
finite interval, ~.z., (-4,4) or (-5,5), if R(T,·) is to be considered
finite.
..
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
31
March as 18967 thousands.
This figure was cOMdclerec1 t.o hove suffered
frorn an overall over-enumeration stimulated by a rationing census "'hich
was taken in 19Q5, and a sizeable under-enumeration of children less
than ten years of age.
A correction for over-enumeration carried out
by El-Badry [llJ resulted in an estimated population of 17907 thousands.
Another correction for under-enumeration of children less than ten
years of age - using the standard demographic technique of surviving
estimated births in the ten years preceding the census date - inflated
that estimate to 19619 thousands.
This last figure.was used here for
illustration purposes.
Egypt's is virtually a closed population - 1..£., external migration is a negligible component of population change - hence the annual
rate of natural increase is a proper measure of population change.
The
series of rates of natural increase in the period (1947-1959) was taken
to estimate the rate of population change defined by (2.1.1).
The
conversion of rates of natural increase to rate of change in accordance
wi.th remark (2) at the end of section (2.1) above was not deemed necessary since the difference did not exceed 1 x 10- 6 in any year (it is
implied in this comparison, of course, that the rates of change remain
constant for a whole year).
E1-Badry [1l], gave the £0110\-1ing series
of rates of natural increase, adjusted for under-regi.stration of births
and deaths, per 1000 population.
Year
1947
1948
1949
1950
1951
1952
1953
Rate
22.5
21.3
20.4
23.8
24.6
27.0
22.5
Year
1954
1955
1956
1957
1958
1959
Rate
24.4
23.8
23.7
19.1
23.3
25.0
32
A polynomial in time was used to estimate the time trend in the
rates of natural increase.
The least squares method was employed to
estimate the coefficients of this polynomial.
An assrnnption needed for
the optimum properties of least squares to hold, namely the independence
of the stochastic elements of R(T,") over time, would not, in general,
be met by the data of the type considered here.
are, in general, serially correlated.
These random variables
In this case, least squares
esti~·
mators are still unbiased but not guaranteed to be minimum variance.
It was decided to approximate the time trend with a quadratic function of time.
The estimated least squares function was
E(~» =
with T
=0
.021531 + .0006466 T - .00004501 T
at the 1947 census date;
l.~.,
2
1946.25
• • • (2.4.1)
The mean accounted for 99.3% of the total sum of squares.
Although the linear and quadratic terms contributed only .025 and .079
respectively to the square of the multiple correlation coefficient, it
was decided to keep them in the prediction equation.
Higher degree
polynomials were excluded from consideration because the resulting time
patterns were judged to be inappropriate in terms of projecting the trend
eVen to the near future; the cubic and quartic coefficients contributed
.093 and .355 to R2 respectively.
Although the choice of the second
degree polynomial is rather subjective, it was deemed to both improve
the predictive value of the prediction equation and provide a more
interesting example than the caSe of a constant trend.
We consider next the probability structure of the rate of growth
of the population in the time period under consideration.
Under the discrete probability structure, a constant multiple of
I
.-I
I
I
I
I
I
I
_I
I
I
I
I
I
I
.-I
I
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.I
33
a random variable that has a probability distribution of the type
described in section (2.2) above HaS superposed on the variation of the
growth rates that was not explained by the quadratic function of time.
A 19-·interval probability laY],
2-_•.£., k
= 3, \-7as used, this only detcr-
mines the degree of detail desired in the probability distribution.
The sample variance about the regression line,
u
i
.
1S
TI
2
13
2
1: u. where
1.
i=l
.
t h e 1. til res1'd ua 1 , was tak en to est1.mate
t h e vari.ance o.f the sto-
chastic component of R(T,·)
=a2
(3
2
+ 3)
= l2a 2 ;
and the step size for
the translated random variable R(T,·) was estimated by equating
TI
2
= 45956
-10
2
A
x 10
to 12a as a
=
.000619.
The fitted regression curve was taken as the center of a grid of
19 such parallel curves each .000619 units above or belm.,]' the next in
either direction (if there is one) with associated probability masses
determined according to the probability law considered.
It should be
noted that it is implied here that the random mechanism introducing the
stochastic variation around' the mean does not change with time, although
the mean itself does and that this probability structure amounts to
approximating the structure resulting from considering (p, r (T)) over
W
~he
time period (O,t] by continuous grouping of elements of
n
into 19
groups in a nlanner consistent with the structural characteristics of P.
If we are willing to project the time pattern of growth rates and
the estimated probability structure into the future, it could be used
to determine an approximate probability distribution for future population estimates making use of remark (1) in (2.1).
On the other hand,
we could stipulate future changes in the functional form of the time
·028
.026
.024
.022
.020
.018
.016
1947
1949
1951
1953
1955
1957
1959
V=I-1\R
'l... \
FIGURE (2·4· I)
ANNUAL RATE OF NATURAL INCREASE
EGYPT- (1947-1959)
•
•
w
.::--
•
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
eI
I
35
pattern of
gro~.,th
rates and retain the estimated probability structure.
As an examplc-! of the first case, the population of Egypt at the time of
the 1960 census was estimated using the 1947 adjusted census figure and
the information provided by the rates of natural increase.
of the fact that the 1960 census is referred to the 30
th
Making use
of Septenilicr,
equation (2.4.1), and the above theory and discussion, the following
results were obtained.
TABLE 2.4.1
ESTIMATED PROBABILITY DISTRIBUTION OF THE POPULATION OF EGYPT
IN THOUSANDS ON 9/30/1960
Interval
Prob~bilic!Y..
24774 - 24981
.002 915
24982 - 2519b
.008 746
25191 - 25402
.017 493
25403 - 25615
.029 155
25616 - 25830
.043 732
. 25831 - 26047
.061 224
26048
26265
.081 633
26266 .- 26486
.096 210
26487 - 26708
.104 956
26709 - 26932
.107 872
26933 - 27158
.104 956
27159
27386
.096 210
27387 - 27616
.081 633
27617 - 27847
.061 224
27848 - 28081
.043 732
28082 - 28317
.029 155
28318 - 28554
.017 493
28555 - 28794
.008 746
28795 - 29037
.002 915
I
I
36
vlith
= 26832 thousands and
mean
standD.rd deviation == 777 thousands
Under the continuous probability fonnulation, the following
results were obtained, in the nonna1 case
~
v
(13.5)
E(N(l3.5,·»
= 0.312
676
- 26832 thousands
Also, properties of the normal p. d. f. may be used to make appl.·oxi.mate probability statements about V(t,·) and, consequently, N(t,·).
In our example,
P~{25312 ~
N(13.5,·)
~ 28419} =
0.95
The 1960 census count was 25984 thousands.
When corrected for
under-enumeration of children less than ten years of age, the population
figure rose to an estimated 26896 thousands.
-.
I
I
I
I
I
I
el
I
I
I
I
I
I
I
-.
I
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
eI
I
CHAPTER III
AGE STRUCTURE
3.1.
l
Introduction
Let N(t) and ret) have the same definitions as N(t,-) and R(t,-)
respectively in (2.1.1),
B(t), bet) and c(x,t) have the same definitions as in Lotka's
formulation in (1.2.1),
N(x,t)
= c(x,t)
N(t),
m(x,t) be the birth rate for individuals of age x at the time
epoch t, and
ex t)
p,
= number
of individuals of age x at time t in the population
number of births at time (t-x) in the population
This generalized formulation of survivorship incorporates the
balance of forces of migration affecting the population in addition to
mortality conditions.
In this case, p(x,t) may, conceivably, exceed 1,
and "generalized" death rates computed in such a situation may become
negative.
However, such oddities do not introduce much logical diffi-
culty and it is in this generalized sense that death and survivorship
will be considered here.
Hyrenius.
The roots of this formulation are ascribed to
[29 J
lAll variables considered in this chapter are random variables.
The explicit dependence of these functions on elements of the underlying
sample space is suppressed in the interest of clarity because it does
not serve a significant purpose.
I
I
38
Using
th(~
ahove notation, tIle folloHj rLg relations cbscritle thl' age
structure of the population at time t:
N(x, t) = B(t--x) p(x, t)
..
. (3.1.1)
..
• (3.l.2)
-.
I
I
I
I
I
I
ClO
c(x,t) = B(t-x) P (x, t) /
J
B(t-x) P (x, t) dx
t
0
'" b (t--x) P (x, t) expo (-
J
r(t) dt)
t-x
making usc of the relation N(t)
= N(O)
t
exp (
J
reT) dT).
o
Equati.on (3.1.1) is fundamentally simpler, conceptually easier and
more a1l1enable to statistical manipulation than (3.1.2) \lhich is a generalization of the familiar fomulation of Latka and probably more
advantageous from t!he demographic analysis point of vieH since for its
purposes it is both easier and more meaningful to speak in terms of
rates of occurrence rather than the number of occurrences.
Of course,
if the total age span possible at time t is considered, then c(x,t) is
only a constant multiple of N(x,t).
The last observation points out the major source of difficulty
in determining the complete age structure of a population at a point
in time under the present formulation if all the demographic variables
are allowed to be general functions of time.
If w is the highest
t
attainable age at time t, then to determine the complete age structure
at this point in time, the history of birth and survival in the population under consideration must be available for the time period
({t-w ), t].
t
This ambitious requirement - as well as mathematical con-
venience - has led students of demography pursuing this formulation to
consider special cases in which some of the demographic variables are
el
I
I
I
I
I
I
I
-.
I
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.e
I
39
mathematically independent of time.
If p(x,t) and bet) arc mathematically independent of time in such
a way that ret) docs not depend on time in the interval «t-b-a),(t-a)]
a
~
0, b > 0, then c(x,t) must also be mathematically independent of
time for a < x < (a+b) and we have
c (x, t) = c(x)
=
bp (x) e
-rx
a
which is Alfred Latka's celebrated equation.
shown to hold at t
-+
00
~
x :5.. (a+b)
The same form may be
if p(x,t) and m(x,t:) do not deper..d on time. [21]
A slightly modified version of this formulation requires knowledge
of the age distrib'4tion function at a point in time prior to t,
N(x,t-a),a> 0 say, and the course of birth and survival in the popu1ation in the intervening time period «t-a), t] only in order to determine the complete age structure at time
*
p (x, a, t)
=
t.
For this formulation, define
number of individuals of age x at time t in the~~pulati~
number of individuals of age (x-a) at time (t-a) in the population
x > a > 0
and note that p * (x,a,t)
and
p * (x,x,t)
= p(x,t)/p(x-a;t-a)
= p(x,t).
Then, we can write
N(x, t) = N(x-a, t-a) p * (x,a, t)
= B(t-x)
p(x,t)
x > a
x < a
}
.(3.1.3)
The last relation is exactly the same as in the previous formu1ation and is written in the usual demographic practice as
40
00
=(
J m(s,
t-x) N(s, t-x)
d~)
p(x,t).
o
This is the model of component proje.ctions which is clearly qui.te
practical in a deterministic set-up.
This is so because it does not
require hard-to-get historical data.
But, on the other hand, si.nce
estimates of the age structure are available only through rather
infrequent surveys and censuses, the estimation of the underlyi.ng stochastic mechanism - in a stochastic treatment of the age structure is severely hindered.
Consequently, the distribution of N(x,t) is
usually given conditional on the initial N(x,t-a).
Data on birth and
survival are available much more frequently.
In actual applications, irrespective of the formulation used, we
are usually interested in age groups rather than age points;
i.~.,
in
the age distribution function as a set - (rather than a point) - function of age.
Hence we consider
c
bNc (t)
f
= J
= f
=
N(x,t) dx
b
c
b
B(t-x) p(x,t) dx
c.:::,a
c
N(x-a, t-a)p*(x,a,t) dx b > a
b
.(3.1.4)
Since all the functions involved in the above relations are
always non-negative, then - by the second mean value theorem - we have
c
bNc (t) = K1
f
B(t-x) dx
x > a > c > b
b
=~
B
(t-c) (t-b)
• • . (3.1.5)
I
I
-.
I
I
I
I
I
I
el
I
I
I
I
I
I
I
-.
I
I
I
.I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.I
41
where
p(x,t) < Y
min
~l
-
b < x < c·
max
<
- b
p(x,t) and
< x <
is the
B
(t--c) (t-b)
C
number of births in the time interval «t-c), (t-b)]; or
bNc(t)
~
c
f .N(x-a,
K2
t-a) dx
b
~
K
~
K
2
• . . (3.1.6)
N(t-·a)
(b-a) (c-a)
2
where
min
b < x <
p * (x,a,t)
C
~
max
p * (x,a,t) and
N (t-a)
b < x < c
(b-a) (c-a)
is the number of individuals in the age group «b-a), (c-·a)] in the
population at time. (t-a).
K and K are usually approximated by some sort of average value.
l
2
For example, if survivorship conditi.ons are constant in the time pe.riod
«t-a),t] for birth cohorts of the time period «t-a-c), (t-a-b)] and
survivorship probabilities from age (x-a) at time (t-a) to age x at
time t are weighted by the cohort size at time (t-a), we have
c
K
2
~
~
f
b
c
P* (x,a) p(x-a) dx/
f
b
=
p(x) dx/
I
p(x-a) dx
b
L
(c-b) b
/
L
(c-b) (b-a)
x+n
where
L
n x
~
f
x
p(x-a) dx
b
c
c
I
p(x) dx
which is the regular demographic practice.
..
. (3.1. 7)
42
3.2.
Statistical Formulation
3.2.1. Introduction
In this section we discuss the naturc of the prok1hili ty c1istdl.ution of the age structure at a point in time.
For a fixed age point and a fixed timepolnt, xO' to respectively
- under both thc origi.nal Lotkian formulation and the component proj ections set-up - the problem of determining the probability dis tri.lmtion of N(x ' to) is one of either finding the distribution of the pro-'
O
duct of two random variables that cannot, in general, be assumed sto-chastically independent, or fi.nding the probabil:t ty distri.bution of a
constant multiple of a random variable.
2
Further, the probability distribution of N(x, to) is the joint
distribution of N(x ' to) for all xO' and, in general, the marginal
O
distributions - for fixed age and time points - are not expected to be
those of independent random variables.
Also, implementing the approximations discussed in (3.1), it is
clear that the stochastic treatment of the age distribution function ?s
a set-function of age is essentially the same as its stochastic treatment as a point function of age.
We shall consider the joint distribution of the number of individuals in two different age groups in the population at some point in
time.
Demographic variables will be assumed to have the simple stochas-
tic structure of a constant depending on time (a deterministic component)
2Note that in the case of using m(x,to-x ) to arrive at B(to-X O)'
O
the probability distribution of B(tO-x ) is a transformation of that
O
of m(x,to-x ) since i t is usually not possible to treat N(x,tO-x O) as
O
a random variable.
I
I
-.
I
I
I
I
I
I
el
I
I
I
I
I
I
I
-.
I
I
I
.I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
•I
43
representing the expected value of the variable plus a stationary - in
time - stochastic error component.
This restricted form will not be
required in the theoretical development but is suggested as the basis
for estimation in practical application.
The joint distribution referred to in the preceding paragraph is
a function of the joint distri.bution of a number of demographic variables that d.epends on the underlying formulation (Lotkian, component
projections or a combination of both).
This number of component vari-
ables ranges from two to four depending en the situation.
In the most
general case, Lotkian formulation, namely,
N(x1,t)
= B(t-xl )
p(x1,t)
N(x ,t) :;: B(t-x } p(x ,t)
2
2
2
we have an initial joint distribution of four random variables expressing
demographic phenomena, two for number of births and two for survival
probabilities.
These four random variables are not - in general -
stochastically independent.
This set of four components is divided into
two subsets of two random variables each,. (one for the number of births
and the other for the corresponding survival probability),- and the finaL
probability distribution is the joint distribution of the procfucts of.'
the two random variables in each of the two subsets ~
All. ather: cases.
may be deduced from this by considering degenerate. random variables •.
The distribution of the product
of two stochastically dependent
random variables is usually difficult to obtain:t let alone- the hivariate
distribution of the two two-variate products of four stochasti.c:aIly
dependent random variables.
.-
Because o:f this difficul.ty',. we shall. be'".
I
I
44
satisfied with describing the last distribution by its lower moments;
!.~.,
the means, variances and covariance of its two components.
covariance is probably of the least practical importance.
The
These moments
-.
I
I
I
I
I
I
are functions of the joint moments of the original four-variate distribution.
An interesting and important statistical problem arises in the
estimation of the joint moments of the original four-variate distribution.
The usual methods of estimation are not strictly applicable
here since the basic requirement of a sample of n independent obscrvations is not met in time series data.
However, we shall use the same
n
functions of sample moments (1
r
4
r.
Xji) r j
4 or
n
n i=l j=l
= 0,
1, 2 to estimate
j
functions of population moments E( n x. ) and the usual unbiased
j=l Ji
estimators of covariances in the normal case.
In general, let
Then
i
=
1, 2
• • • (3.2.1.1)
In actual applications, we may set out to estimate these quantities without specifying a parent distribution.
Alternatively, we may
_I
I
I
I
I
I
I
I
-.
I
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
•e
I
45
stipulate an underlying probability distribution as a basis for the
stochastic variation in the demographic variables, and use its pro··'
perties to arrive at expressions for the required litOnlents and then
estimate these, hoping that the latter procedure, being more specific,
may prove to be easier.
One such parent distribution, the multivariate
normal,3 is considered here as providing the random error componE:nt
in the variation of the demographic variables.
3.2.2.
The Multivariate Normal Case
If Xl and X are distributed identically and independently as the
2
standard normal variate, then the distribution function of their product is
; KO(Z) where KO(Z) is the Bessel functior; of the second kind
of a purely imaginary argument of zero order.
Further, if Xl and X are
2
not stochastically independent, the distribution function of their product is available in a closed form. [9]
Unfortunately, this form is
too complicated to be of much analytical use.
Limited tabulations of
the distribution function are ·available in the case of independence. [2]
Consider the four dimensional normal variable X' = (Xl X X X )
2 3 4
with mean vector ~' - (~l ~2 ~3 ~4) and covariance matrix V
i, j
= 1,
variables.
2, 3 ,4 •. Let !.'
=
(t
l
t
2
t
3
=
(Oij)
t ) be a vector of mathematical
4
Then the moment generating function of X is given by
~x(t)
== expo (1!.'!.
+ ~ !.' V !.).
Making use of the properties of ~ X <.!), we get
3 In an exact treatment, the multinormal distribution would have
to be truncated to a finite rectangular region in the four dimensional
space •
46
E (XiX. )
J
Var (X.X. )
1 J
.. + ll.1 llj'
= o1.J
2
2
+
= O..
1J +Oii llj
2
20 .. lli llj + o .. p.
1
1J
JJ
• . . 0.2.2.1)
+
0 .. 0 ..
11
JJ
i,j
and
which are sufficient to determine the lower moments of the distribution
3.3. A Discrete Approximation to the Human
Survivorship Function
An approximation to the survivorship function p(x) based on a
finite number of age groups is motivated by the following consideration.
In practice we can only observe a finite approximation to p(x) and
for most practical purposes we do not need more than that.
One such
approximation may be arrived at as follows:
Divide the possible age span (O,w] into n intervals of the form
(ai' a i +l ] ,
i = 1, 2, ••• , n where a l = 0 and a n+l = w such that ll(x)
can be considered approximately constant within each of the n age
groups.
Let the probability of death per age unit within the i
th
age
interval be approximated by qi say.
Clearly, the assumptions needed for a constant - attrition survivorship function are approximately satisfied within each of the n
age intervals considered and we have the original p(x) function approximated by n negative exponentials for
I
I
-,
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-,
I
I
I
.e
I
I
I
I
I
I
47
n
1J (x)
where
,e
I
(a i ,a i +1]
n
= 1
i=l
a.1. < x
~
qi 1 (ai,a
Hl ]
• • • (3.3.1)
a i +1
= 0 otherwise is the indicator function of the
It then follows - from (1.2.1.8) - that
p(x)
~
m-1
exp.{-( ~ q1..(a + - a.) + (x - a )q )}
i 1
i=l
1.
m m
p (x)
n
m-1
=
1T
where am < x
~
exp.{- qi.Q,.}
i=l
= km e
e
-q (x-a )
m
m
1.
-~(x-am)
a m+1 , .Q,i = a i +1 - ai' and
• . . (3.3.2)
~
is the probability of
th
survival from birth to the beginning of the m age group.
Ie
I
I
I
I
I
I
I
1
~
1J (x) =
Note that the conditional probability of death in the i th age
interval given survival to the beginning of the interval is equal to
. • • (3.3.3)
which bears a strong resemblance to the Reed and Merrell formula
resulting from the direct assumption that In p(x)
=a +
Sx within the
age interval. [25]
Further, it turns out that qi is equal to the central death
rate
4
of the i
th
age group, d. say, which is the measure of age-specific
1.
mortality usually available, since
4Relating the number of deaths in a certain age group over a
certain period of time in a population to the number of person-years
lived in this population by individuals in those age groups and time
period.
I
I
48
-q.9J.
k. - k
1.
k.
1.
The
approxim~.tion
e
i
1
--
q.
1
-,
1. 1.
(1 - e
-a.£.
'1. 1.
)
I
I
I
I
I
I
(3.3.2) may be used to calculate life expect··
ancies rather easily, for
.
e(a. )
:;:
00
1
- .
• k·
1.
1.
p (x) dx
n
f
a.
1
n
1
= -
k.
1
L
j=i
k.
J
(j+l
a
e
-q (x-a.}
j
J
dx
j
. • • (3.3.5)
3.4. An Example
In this section we apply some aspects of the previous theory to
estimate the number of males in the t\170 age groups (0) and (1-4) ..
using the conventional census age grouping - in Egypt on January 1,
1965.
Males in these two age groups are the survivors of male births
in the period January 1, 1960 - December 31, 1964.
Yearly male births \\rere estimated using a series of crude birth
rates corrected for under-registration by El-Badry [11], a sex ratio
at birth of 105 males per 100 females and a series of estimated mid-year
population figures - using census figures adjusted for under-enumeration
of children less than ten years of age and assuming exponential grm\Tth
between census counts.
_I
I
I
I
I
I
I
I
-,
I
(
I
I
I·
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
49
Approximate single-year central death rates were derived from thE'
reported' death rate for the age group (0-4) using divisors computed from
complete life tables for the years 1937, 1947 and 1960. [6]
were multiplied by correction factors computed by relating crude death
rates adjusted for under-registration to reported crude death rates. [1]]
Finally, the resulting rates were transformed to survival probabilities
using the method described in (3.3) and separation factors of 0.75, 0.7,
0.65, 0.6 and 0.55.
Appendix (A) includes the original data and some of
the intermediate calculations.
Table 3.3.1 shows the resulting births
and survival probabilities estimates.
The next step is to estimate the time trend in the four demographic
variables under consideration and then use deviations from the trend to
estimate the moments of the underlying distribution.
We shall confine
ourselves to polynomials and use least squares to get unbiased estimates
of their coefficients.
For Xl' X and X , we fitted straight lines.
2
4
The square of the
multiple correlation coefficient was 0.845, 0.907 and 0.991 for Xl' X
2
and X respectively, and the inclusion of a quadratic term in the
4
2
regression equation did not contribute to R more than .01 in any of
the three cases.
The fitted regression lines were
----
E(X )
l
= 409.86 +
8.9I5t
-
= 0.8585 +
0.00386t
E(X )
2
E(X ) = 0.5939 + 0.01057t
4
with t
,.
I
These rates
.-
=0
at mid-year 1940;
i.~.,
1939.5.
• • • (3.4.l)
I
I
TABLE 3.4.1
ESTE1ATED
Calendar
year
~1ALE
BIR'nIS IN n!OUSANDS 1',T,;]) SUlzVl\'j\t rnor,ABIL UIES
Male births
in thous~mds
in year t
Probability
of survival
from birth
to be in
the age
group (0)
Hale llirths
in thou[;,'mds
in the fOl1l"year period
starti.ng with
year t.
at the
start of
_.
bjrth in a fouryear period
E:tiJrting \dth
year
t to be in
thQ age group
(1-./1) at the
start: of year
(t+l)
(t+5)
__._--- -----t
Pl.-oh::lhih.Ly of
survivul fro~l
X,q
Xl
1940
432
.861
1656
.597
1
426
.8611
1656
.605
2
391
.856
1702
.617
3
407
.876
1764
.635
4
432
.877
1852
.638
5
472
.871
1899
.641
6
453
.889
1907
.654
7
495
.897
1964
.661
8
Il79
.870
2002
.672
9
480
.894
2071
.685
1950
510
.901
2125
.697
1
533
.896
2159
.708
2
548
.906
2185
.723
3
534
.905
2165
.738
4
544
.918
2142
5
559
.912
2157
6
528
.932
7
511
.921
8
559
.927
9
598
.930
••
I
I
I
I
I
I
_I
{J
I
I
I
I
I
I
.,
I
I
51
t
II
I
I
I
t
For X , it is important to realize that yearly births determine
3
period births completely, and since we have expressed Xl as
=
1
I
i = 0, 1, ••. , 19
£.
1
It follows that
j+3
j = 0, 1, ••• , 15
L
i=j
= (4a + 6S) + (48)
j
+
j+3
L
E:. )
1
i=j
=
where E(e:~)
J
= 0,
E(X )
3
Ie
.-
O.
where E(E:.)
I
I
I
I
I
I
I
I
a + Si +
a' + 8' j +
e:j
therefore (3.4.1) implies
= 1692.93 +
• • • (3.4.2)
35.66t
If we assume that the observed time trends will continue for a
period of five years (which we do not have to do if we want to stipulate
different trends), and that the underlying stochastic structure about
these trends is stationary, and using the expected values of the four
demographic random variables on 1/1/1965 predicted from the estimated
regression lines in conjunction with the deviations from the lines, we
get the following results for the number of males in Egypt as of
January 1, 1965.
Age
Group
Method
0
distribution free
593.3
21.5
normal
593.3
22.1
distribution -120.2
free
distribution free
1939.5
36.3
normal -132.6
1939.5
37.6
1-4
normal
(*) in thousands.
Mean(*)
Standard (*)
Deviation
Covariance(*)
I
t
••I
,
I
I
t
I
el
I
I
I
I
I
I
I
.,
I
"
'.
I
I
I
I
I
I
I
I_
I
I
I,
I
I
I
'.
I
I
CHAPTER IV
SOME ASPECTS OF THE ESTIMATION PROBLBf
4.1. Estimation of a Certain Function of Joint Moments of a Multivariate Distribution Using the Same Function of Joint ~1oments of a
Sample ~f Dependent Observations
Consider the k-variate random variable Y' = (Y
c.d.f. Fy (')'
1
Y ••• Y ) with
2
k
Suppose we have a sample of n observations from this
parent distribution,
I 1 , Y2'
Y where
... , -n
i = 1, 2, ... , n
such that the n observations are not independent.
k
r
r
t
Let X denote the product (TI
Yo.),
in chapters 2 and 3, we used
i
t= 1 Nl.
n
k r
n
t
1
r
1
and
say, where r = 0, 1, 2,
=
m = r (TI Yti) n r Xi'
t
r
n
i=l t=l
i=l
...
t = 1, 2,
... , k as an estimator for
used (m - m mt ) to estimate
r
s
(~r
-
~
k r
r
t
= E( TI YR;") = E(X ).
r
i
t=l il-
~s ~t)'
We also
In this chapter, we inves-
tigate some of the properties of these estimators.
In the course of such an investigation, we may want to consider
i
. of _Z' = (Y'
y' ..• -n
y')
t he kn-variate distribution of the samp 1 e,. _.~.,
-1 -2
with c.d.f. F {')'
Z
Unfortunately, however, we end up with a sample
of one observation only from FZ (')'
It is easy to show that m is an unbiased estimator for
r
~
r
since
53
E(m )
r
1
r
= -
n
:;
n
J
i=l
1
r
--
n
J
x
xII
n
Y
1.
kn
fxli... xf
i=l
dF
X:
X: d
1.
ki
J...
Xu
r
x
I
F
Y
kn
t
I
except
1
r
=n
n
1
E(X:)
r
n
1.
i=l
n
i=l
...
xli
x ki
lJ r = lJ r •
Also, the calculation of the covariance of two stich estimators
m and m is straightforward.
r
s
1
Cov(m ,m ) = E{[r
s
n
=
r
n
i=l
We have
r
(X. - E(X:»]
1.
l2 {n[E(X~+S)
1.
[1:.n
n
L::
i=l
1.
1.
- E(X~) E(X~)]
n
1
r
s
+ [r
E(X~A~) - n(n-l) E(X ) E(X )]}
i;j =1
J
n
r ,X.)
S
+ 1:.2 r
s
= -1 Cov (X.
Cov (X.r ,X.)
n
1.
1.
n i;j=l
1.
J
1
r s
1
r s
= - Cov (Xi ,Xi) + (1 - _..) Cov (X. ,X.)
n
n
1.
J
1
n
rs
1
+ (1 - -) .. -a
n 1.J rs
say
l
.
.(4.1.1)
where
Cov (Xi,X j ) = ij 0rs denotes the arithmetic mean of the n(n-l)
covariances.
r
s
k
'IT
,Q,=l
r,Q,+s,Q,
~i
f
I
I
(X~ - E(X~» ]}
n
=-<1
,
.1
,
-,
1
I
,I
I
I
"
.'I
t
,I,
t.
I
J
I
I,
I
I;
I
I_
I
I
I
I
I
I
'.
I
I
54
If the n observations are independent, the second term of the
right hand side of (4.1.1) vanishes, and if any two observations have
the same dependenc.e struc.ture, hence the same distribution, then i'O-
J rs
is equal to the common covariance, 'jO
1
rs
As a special case of (4.1.1) when r
say.
1
= sl'
1 = 1, 2, •.. , k we
get
Var (m )
r
= 1:.n
=
1:.
n
Var (X:) + (1 1
0"
rr
+
(1 -
1)
n
Cov (x:,X:)
1
J
1)
.' (1
n 1J rr
• • • (4.1.2)
This is a restatement of the fact that if a set of n random variables
, are distributed such that they all have the same variance, thp.n their
.
. 2
2
average internal covariance must not be less than CO~l) where 0 is
n-
.
It follows that, in the limit, 'jO
their conW-lon variance.
1
rr
is non-
negative.
T~
situation gets more complicated >"hen ,ve consider (m 'm ) as
s t
an estimator of (~s'~t)'
To start with, the estimator is - in general -
biased for
E(m 'm ) =
s
t
11
11
t-'st-'t
+ {l
0"
+
n st
(1 -
1:.)
n
jj
0st }
• . • (4.1.3)
as may be shown either directly or as a corollary of (4.1.1) in conjunction with the relation E(XY)· = Cov (X,Y) + E(X) E(Y).
of fact, (m 'm ) cannot even be asymptotically unbiased for
s t
unless ijO"st
As a matter
(~s~t)
o.
Further, the bias term in (4.1.3) is not estimable by the usual
estimators of sample covariances.
It is easy to see that
55
-8
,.,here X = m
s
Hence
'I
e-I
I
I
n
E( 1:
i=l
I
. . • (4.1.4)
and
I
"
n
--~
(X~
= E {L
i=l
n
=~=1
L
- lls -
+~
{ast
n
S
X
-
lls)
n
(X~
- llt -
t
X
- llt) }
I
I
n
(L a
+ L
'j,a)
j=l st· jfj'=lJ
st
e,
I
I
I
I
Clearly, the bias term in (4.1.3) cannot be' expressed as a func-
We also observe that
n
E(
t
r
ifj=l
X~ X ) = n(n-1) <. . cr
~
~J s
j
t
+ II llt)
,
• , ,(/+,1.6)
s
and
n
E(
r
S
t
X X )
i=l i i
= n(a st +
~ II )
I
, , ,(4,1.7)
s t
.1
"
I
I
I.
I
1
I
I)
I
t
,'\_
S "f
lience
c:ons:ider
\';C. inDy
n
--~--- L
n(n-l)
n
1
E
n 1"'1
esti~ators
o~o
~rJ::=
1
X~
~
X~~ X~1 as
X7J
and
\18 11
no;
8
TIl
3
s
m
t
as
bia::~('.d
11 ) .
for 01~ s"'t
An a1tenlative approach to "-That He have been considcri Ilg so f;n-
stipulates certain dcpendenc'3 patterns among the. n 0bser:voUons
A pa:rticuli3rly appealing pattern \·ias considered by P. K.
sample.
[26J
or
the
Serlo
Sen extended the U"statistics method of csti.l:l;:::t:ion of functio:1al,,;
of distribution func.tions [l~] to the case of m-dependent 8tEJtiC'!~;:;yy
stochastic processes.
Let {X } be a seqUf'.nce of vector~·v<,llJed );'i'mc1om v<1ri ...bles) Sen
Il
called such a sequence I;1-dependent if the randOI'l vector (X.l ,X 21 ••• ,Xl)
is stoch2stica11y inde:pendent of the random vector (X , X '1' ... )
I
I
I
I
I
I
I.
I
I
S
S-r
whenever s-r > m and it is sai.d to be stationary if the iuint: cH:·trihc,tion of Xi' •.. , X + is independent of i for 8.11 r.
i r
Let f(X
, X
o.~
0.1
,
.... , x )
~
be a
statistic syul;netric in its nybu·
If E(£(X
0.
,
... ,
g,
1
we call f
<it
kernel for g.
Defiue h
and the
= E(f(X(Xl ,
sj'!"~£t:etric
n > m (k-·J.)
+
... , X )\0.0+1 - a. > m for j = l,2, ... ,k-l)
(ik
J
. J
estimator based on a sample of n observat ions)
k,
... ,
E f(X
S
0.1
, ""
where the s",:,mntion extends ove.r all possible (
X )
ak
• . . (4.. 1.8)
n -om (l~ ~ 1) )
sets of
k
I
57
(a ,
l
•
t
•
,
a k ) satisfying
The estimate U(X ,
l
(Ct j +l
... ,
-
Ct
j
) > m.
X11, ) is unbiased for h.
Sen sho\ved that
.t
I
i t is also consistent and, under the conditions of existence of the
second and third order moments of f(X , ... "
I
X ), that U(X ," ... ,
n
l
is asynptotically normally distributed with mean h and variance k
X
Z
n
I
)
z;
I
,I
z; •
and gave a consistent estimator for
Under the m-dependence set-up, we may formally generalize c
get an unbiased estimator for (]..I s]..I)
t·'
1
to
Consider
I
t
11,
=
1
L
X~ Xt
(n-m)(n-m-l) i~j=l 1 j
• • • (4.1.9)
Ii-jl >m
Clearly mel i.s unbiased for (]..Is]..lt) and Oe l = el'
We may also want to
consider the estimator
• • • (4.1.10)
Recapi.tulating, we have five estimators that we may use to
estimate (11 l.l ).
s t
Let
c1
=
c'
I
=
(n-m) (n-m-l)
n
Z
en-m) (n-m-I)
11,(11,-1)
'
C
z=
m(Zn-m-l)
2
n·
m(Zn-m-1)
, c'Z = n(n-l)
are related by the following relations.
1
, c 3 = -11,
then these estimators
t
••I
I
I
t
t
I
eI
I
I
I
I.
I
I
I
I
58
c'
m
m
where ijOst is the average of the (n-m) (n-m-l)
s
covariances of (X., X~) such that Ii J.
J
jl
> m
(m)
+ ij 0 st
...s ....t
-
11
11
(m)
where ijOst
E(e )
;:::
1
E(e
I
I
e
(m) 1
and, in general,
e
I
I
I
I.
I
+ c'2
• • • (II .1.1J )
I
I
I
f
I
I
e
1 m 1
z) ;: :
covariances of (X si, X:) such that Ii J
m
- (m)
c'
+
c'
+
,
0
llsllt
1 J.'jO S.t
2 J.j s t
11
11
....s .... t
= "11
....s .... t
E(e )
3
is the average of the m(2n-m-l) • • . (4.1.12)
j/
< m
-
+ 0 st
m
+ c1
,,0 t
J.J s
+
C
z
+ c3
0
st
m
under m-dependence
c Z> c
2'
C
3
,,0
~
s
t ;::: 0 and asymptotically c ; ci ~ land
l
n~
--... 0, and we can distinguish two subsets of five estimators
n-+oo
accordingly.
On one hand, we have a fundamental estimator, namely mel'
on which two other estimators, e
l
and e , are based.
3
estimators are asymptotically equivalent.
m-dependence, e
l
and e
3
These three
Since mel is unbiased under
are asymptotically unbiased under m·-dependence
59
and have the same variance as mel.
and e
2
On the other hand,
~"e
have (m) ('1
which have the undesirable property that they are ahm)'s biased;
i .. e. for all sample sizes, unless - of course -
o(m): 0 or a
ij st
st
: 0
respectively which is very unlikely •.
Clearly \>Je should be more interested in small sample results
rather than asymptotic results because the length of time series of
demographic data j.s usually restricted.
Unfortunately the nature of
the bias terms of the five estimators does not allow preci.se compari'sons.
Only general tendencies can be indicated.
For the first subset of the five estimators, mel seems to be the
choice estimator for m-dependent processes and n > m(k - 1) + k,
otherwise mel
= O.
from the sample.
However, we need to either know m or estbuate it
Even if m-dependence does not describe the dependence
structure of the process under consideration adequately, mel may still
be preferred to (m)e
l
or e
l
if there is reason to expect that
or
respectively, as is usually the case with time series.
Between the two
groups of estimators, mel may-be preferred to (m)e l for the same reasons
that is may be preferred to el' and it would probably be less biased
m
than e
Z since it is reasonable to expect that lastl >
within the second subset of estimators, (m)e
(Ul)
if
I.J.j crs t I
I
I
I
I
I
I
I
.,
I
m
I.·cr
J.J s t I
J
.1
l
I.1J,0S t I.
Finally,
would be better than e Z
< last I which is likely to be the case.
However, we must remember that in estimation from small samples,
the relative merit of an estimator is determined to a great extent by
how much use it makes of the sample information, and the five estimators
I
I
1
I
I
eI
I
I
I
I.
I
I
I
I
I
60
under consideration here use various portions of the sample.
example
For
e , e and e make increasingly more use of the sample informal
m l
3
tion, in that order.
Also, the amount of relative use of the sample
information for mel and (m)e l depends on the relation of m to n.
A still more conclusive comparison among the five estimators con-sidercd would be based on a more decisive criterion than unbiascdness
only, like the mean square error of an estimator. All indications arc,
however, that such a criterion would not be feasible analytically extrapolating from the experience with unbiasedness.
As a matter of fact the
variance ternlS needed for a study based on M.S.E. are too general and
I
I
Je
I
too complicated to be useful analytically.
Obviously, the problem gets to be considerably more complex when
we consider estimators for
t
-
~
s
~t)
based on m and the five estir
) discussed above.
In this section we use the Monte Carlo method of statistical
experimentation to estimate the mean square error of five estimators
for a special case of the estimation problem discussed in (4.1).
4.2.1. Introduction
Let
I
I.
I
I
s
r
4.2. A Monte Carlo Study of Some of the Properties of Five Estimators
for the Variance of the Product of the Two Components of a BivariateNormal Time Series in Small Samples
I
I
I
I
(~ ~
mators for
(~
2.
{~}
.!..~.
be a (p x 1) vector-valued normal time series
with
, for any subset of indices (t l , t 2 , ••• , t n ) the vectors
X
-t
.-
2
) have the np-multivariate normal distribution.
n
(,1
and
l:h
;::
Cov(X
x +1)
-t , -t
1
(1 OJ .)
1
J
i,j ;:: 1, 2,
where
La
;::
1
P 12
P 2I
1
... ,
p,
Vt
PIp
...
P 2p
1
• • • (4.2.1)
Further, let
l:
such that
ea (m+1) 9
h
a.
ea h
;::
L
ct
a
>
a
• • • (4.2.1.2)
This defines a special case of an m-dependent
multi-normal time series.
Then a subsequence of n consecutive elements of {X
t
}
variate normal with
0'
(1 x np)
and
X
-t+1
Var
X
....t+2
X
-t+n
;::
...
...
L
n-l
L
;::
n-2
l:a
l:
(np x np)
has the np-'
I
.1
I
I
·1
I
I
I
I
••I
I
I
I
I
I
.-I
I
I
62
I.
I
I
I
I
I
I
I
I_
I:
=
(n-l) a
I
-a
e
e
-a.
e
I
e
(n-Z)a.
_(n-l)a.
e
~
L for all t.
O
(n-2)a.
e
I
. . . (4.2.1.3)
In the most general problem considered in Chapters II and III, we
had p
=4
for the estimation of
components of
~,
note that this parameter is independent of t.
This
function of moments represents the covariance of the number of individuals in two different age groups.
Unfortunately, a complete investiga-
tion of this case using Monte Carlo methods would be both cumbersome
and of questionable utility silnply because of the large number of correlation patterns that we have to consider among the four components of
I
~.
ance will not be of much practical importance.
I
I
I
I
I
I.
I
I
Furthermore, as indicated earlier, it is likely that this covari-
A special case that is of considerable practical importance refers
to the problem of estimation of the variance of the number of individuals
in a certain age group.
In this case, p = 2 and the parameter is
This case will be considered here.
Equation (3.2.2.1) gives
.-
I
63
in our special case, using the conditions included in (4.2.1.1), this
reduces to
2
• . • (4.2.1.4)
V = P12 + 1
1
n
2
2 2
Let eO = -- L (X X ) be the unbiased esti.mator for E(X X ),
2i
li
n i=l
1 2
and mel' (m)e l , e , e 2 and e be defined from the corresponding formulae
1
3
in (4.1) by putting
t
= si = 1,
i
i
= 1,
2, and X.~
= X- .X2 ..
-~J..J.
The fol-
lowing estimators for V were considered
where
A
51
= eO
- mel
52
- (m)e l
53
= eO
= eO
54
= eO
55
=
• • • (4.2.1.5)
- el
- e3
A2
A A2
A
A
A
A A2
A A
°12 + °Il P2 + 2°12 Pl P2 + °22 PI + °11°22
We note that (eO - e )
2
= 0,
estimator for V although eO and e
which does not make it a useful
2
may be - under certain conditions -
2 2
2
reasonable estimates of E(XIX ) and E (X X ) respectively
2
1 2
4.2.2. Procedure
The following values of n, m and P12
n
= 5,
= P,
say, were considered
10, 15
m= 0, 1, 3, 5, 10, 15
P = -.75, (.25) .. 75 .3
.-
I
I
I
I
I
I
I
_I
denotes the usual unbiased estimator of the parameter in the
case of a sample of independent observations.
3p
.1
=±
1 were not included to keep L positive definite.
J
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
6!;
for (:;lch cc,.:·.ir13Li.on of n, m nnd p, 1000 si"rpL':'i
dent nonn".] deviates
\>/eYC
I
I
I
I
I
I.
I
I
£
;~n
in(kp~'l'-
genC'l'.'ltcd using the }Ji1:1 E:uln.ot'tinc R?NliL: La
gcnerat c untform ra'ldoLl numbcn; [30J and J. Be)]' s wn.1ificn.t ion of the
r.ox··Hullcx routjhe to transfonn them to il'.dcpc;Hc1ent
nnn~:.11 dl~vi~'.t(.s. [:i]
This siOl'nple of 2n ohservations pas tran,;fo:-nwd to a Edu;;J C' rondo:;, obs.rvatioa frori>. a 2n'-variate T!ol-ul,:11 \'lith weaD ,Q ilnd
detc:rniDcd by n,
The follm,ing
(1) l.et
J"l,
t\o70
v~n'L1fiC(' n~c,t:r.:ix
::
p and the cov3d.ance structure c1.:,sc:db8U ill (li.2.J).
remQrks "Tere use.rl· in the trans fone"i. ie"i.
Y.--.... N(O, 1 )
(1'5(1)
~,. (Pxp)
and
z "" C Y then
(pxl) (pxp) (px'I)
Z-~/ N(Q, C C')
(2) Choleski's Fact.ori:<:ation
If
L
is a symmetric positive definite matrix, thcn
(f>xp)
1: = T T' ".There
I
C
T
is a lOHet tIHtngulcr In;,tr:ix.
(pxy) )
elements c,f Tare detennined' by the follol-dng relations.
t, .
J.J
t
--
i < j
°
n =-~ Vall
= ail/t n
t'l
1.
too
J.1
=Va
ii
-
i-I 2
t ik
k=l
= 1, 2, 3,
i
L
j--l
-. (00 0 J.J
L
k=l
t, k t Ok) It.
J.
J
0
JJ
i:!-j,i,.i
... ,
P
T11,;
I
65
Obviously. the one observation from the 2n-variate normal may be
considered as a sample of n correlated observations from a bivariate
normal distribution determined by P , whose correlation structure is
determined by m, which is what we actually need.
For each of the 1000
samples. the five estimators Sl' S2' 53' 54 and 55 were computed and
then their mean and variance over the 1000 resulting values were used
to estimate their mean square error (M.S.E.).
In computing the estimators use was made of the following forms.
= 1,
i
2, ... , n, then
mel = (X 'E1 Y)/ «n-m)(n-m-l»
(m)e1
e
8
1
3
=
cr.'
=
<X' E2 Y)/n(n-l)
=
(X' (E1 + E2)Y) !n(n-l)
=
(I' J Y)/n
E2 Y)/(n(2n-m-l»
n>m+2
n <m
+
E2
ij =
2
ij
=
1~
• • • (4.2.2.1)
li-jl > m
li-j I
~m
0
i=j
0
li-j I > m
1
li-j I ~ m, i " j
I
I
I
I
I
I
I
_,
2
where J is an (nxn) matrix of l' s,
El
.1
We may point out here that it would have been possible - but
vezy:arduous - to avoid the sampling experiment performed to arrive at
numerical values for the M.S.E. of the first four estimators for the
I
I
I
I
I
I
.1
I
I
I
I.
I
I
I
I
I
I
I
Je
I
I
I
I
I
I
I-e
I
I
combinations of n,
that can
b~~
TIl
and p considE.Ted here using the g8Dcr.ll
laborious} y obtained.
foYl-=-;~;
This proc.edurc would have the
advantage of the absence of sampling errors from the H.S.E. estimates.
However, such a procedure \-1aS cor,sidered prohibitively tedious for the
fifth estimator.
Consequently, in order to enhance comparability and
save time and effort,
vIC decidf~d
•
on the Monte Carlo sinm1ation pro'-
cedure fOT the five estimators.
4.2.3.
Conclusiol~.§.
The variance -Has invariably the major component of M. S. E. of the
five estil;:ators.
study.
Table
C.l
sU1T'lnarizes the results of the Monte Carlo
It gives the Relative Mean Square Error, defined as the ratio
of the N.S.E. to the parameter value being estimated and henceforth
denoted by R.N.S.E.
This will be our criterion for comparing the fi'l'::
estimators.
The main conclusion to
be-dra'~
out of the results of the study
is that for small sample sizes (up to 15 observations) making maxhlun
use of the sample information is mOTe important than avoiding corrda··
tion structures by basing estimators on the subsets of uncorrelated
observations in the sample that form kernels for the parameter being
estimated.
This is indicated by the facts that 54 had, un i f O"!:mly,
thc~
lowest R.M.S.E. of all estimators and 52 had lower R.M.S.E. than
Sl when m was relatively large compared to n.
Naturally, the performance of the five estimators, in terms of
R.H.5.E. --, improved as the sample size increased; also, the differences
among the estimators decreased considerably.
\\Cas 14.6, 3.8 and 2.7 for n
=
The range of the R.H.S.:E.
5, 10 and 15 respectively.
In particular,
I
67
the magnittide of the relative superiority of 54 with respect to the
rest of the estimators declined as n i.ncreased.
Fut"thar, the uni.form
ordering pattern of the five estimators that held for n
~
5
ov~r
all
.-I
values of the correlation coefficient and time lag - with the exception
I
I
I
I
I
I
of 8 and 8 2 exchanging relative positions as m increased - was replaced
1
by a criss-crossing pattern that was much in effect for n = 10, 15.
The fifth estimator, based on normal theory, had - consistently _.
the highest R.M.S.E. - and with a wide margin - except for m : 15 when
Sl came close to it, for n ::: 5.
Evidently .this wns clue to the large
number of component parameters that had to be estimated in the c:ourse
of computing S5 and the non-linear form in which they were used.
How-
ever, it showed the largest proportional decline in R.M.S.E. as n
increased.
Further, it was not uniformly the worst estimator starting
with n ::: 10.
for n
= 15,
Actually, it competed with 8
moderate correlation
~nd
4
for the lowest R.M.S.E.
intermediate time lags.
l.Je ll!ay
even speculate that for larger samples, 55 will probably compare more
favourably or have the lowest R.M.S.E. as the disadvantages of a large
number of component parameters and non-linear forms get overwhelmed by
sample size as the present results indicate.
For m ::: 0,
e
(m) l
==
0 and for m relatively small compared to n,
(m)e l makes relatively little use of the sample information; hence,
(m)e
l
had a relatively high R.M.S.E. as may be expected.
for m + 2 > n
mel
==
Similarly,
0 and for m relatively large compared to n mel
makes relatively little use of the sample information.
Clearly, 8
3
tended to be dominated by Sl or S2 depending on the re1ati.ve magnitude
of m to n.
For n = 5, with S5 being uniformly worse than all other
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
68
C'stiElates, S2 and Sl exchanged the position of the scCOnd-\'7on:t esUr":l"
tor as m ix-.creased; and for n = 10, 15 they also exchang8d their l"L'h!-tive posit:ion - \olhich \-'as the \.".orst for n == 15.
The R.N.S.E. of Sl . . :Jld
52 tended to increase dY'w,tical1y in response to inc1."easps in
Ip I.
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
J
J
I.
I
I
-CI-li'..PTER V'
SUMM..A.RY AND SUGGESTIONS FOR FURTHER RESEARCH
5.1.
SUllimaq::.
Stochastic mone1s of human population change arc valuable
theoretical constructs and useful tools of practical applications.
However, available stochastic formulations are of limited use because
of over-simplified assumptions, complicated mathematics and difficulty
of applications.
In the present study, simple macro-analytic stochastic formulations of the problem of human population change over time are considered on two levels of detail, namely, total population size and age
structure of the population.
In the development of these formulations,
no conditions were imposed on the physical mechanism producing the
process of population change and the resulting methodologies are simpler
than what is presently available.
Demographic variables are assumed to have the simple stochastic
structure of a constant depending on time plus a stationary random error
component;
~.~.
the stochastic mechanism producing random variations
as a componant of the total variation in the population is assumed to
remain stationary over time.
In Chapter II, a basic probability space is defined whose elements
are identified with relative conglomerations of socio-cultural conditions that are responsible for the random component in the change of
70
the population.
A discrete error-type probability law that allows for more
uncertainty - in the sense of less probability mass concentration around
the mean - relative to the normal and symmetric binomial
is discussed.
distributio~s
Such a distribution would be of value for types of data
that are not commensurate with high degrees of accuracy as is the case
with demographic data for developing countries for exmnple.
Using characteristics of the above-mentioned probability space,
probabilistic characterizations of the distribution of the population
at a certain point in time are arrived at under both discrete and continuous probability structures.
The theory discussed is illustrated by
data for the population of Egypt.
The
proble~
of the probability distribution of the age distribu-
tion function is considered in Chapter III.
A generalized formulation
I
.-,
I
I
I
I
I
I
_I
of survivorship from birth incorporating net nigration is used to present the age distribution function formulations of Lotka and the model
of component projections.
The probability distribution of the number of individuals in two
different age groups at a certain point in time is described by its
lower moments,
~.~.,
the means,variances and covariance, in general and
in the special case of an underlying multivariate normal distribution.
A numerical example is given using Egyptian data.
Because demographic data needed for the application of stochastic
models of population change are usually in the form of time series,
classical methods of estimation are not strictly applicable for the
samples available are not - in general - composed of independent
observations.
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
1_
I
I
I
I
I
I
I.
I
I
71
An investigation of some aspects of this estimation problem is
carried out in Chapter IV for the case of a certain function of joint
moments of a multivariate distribution.
It is found thnt estimators
usually considered are biased and that it is possible to define unbiased
estimators for m-dependent processes by basing estimators on subsamplcs
of independent observations that can be formed from the original sample.
Because the variances of estimators considered are too general and too
complex to be of much analytical use, a special case was investigated
using the Mante Carlo method of statistical experimentation to study
the mean square error of five estimators for the variance of the number
of individuals in a certain age group at a certain point in time,
assuming an underlying normal distribution, in small samples.
The main conclusions to be drawn out of the study are that maximum use of sample information outweighs all other considerations in
comparing the performance of estimators in terms of M.S.E., and that
an estimator based on normal theory does not compare favorably with
other estimators that do not assume an underlying normal distribution,
presumably because it depends on a large number of parameters in a nonlinear fashion.
5.2. Suggestions for Further Research
Two main directions for further useful research that are indicated
by this study are:
(1) investigation of probabilistic characterizations for the
distribution of the age distribution function that describe the distribution more fully than its moments,
(2) further examination of the problem of parameter estimation
I
72
from time serJ.es data in general, and the estimation of functions of
joint moments of mult.iv..n:iate distributions from serially correlated
observations in particular.
.-I
I
I
I
I
I
I
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
,_
I
I
I
I
I
J
I.
I
I
APPENDICES
A. DATA AND INTER}lliDIATE CALCULATIONS FOR SECTION (3.4)
TABLE A.l
BIRTH
P~\TE
PER 1000 POPULATION CORRECTED FOR UNDER-REGISTRATION
EGYPT (1940-1959)
==- --
Year
1940
1941
1942
1943
1944
Birth rate
46.2
45.1
41.0
42.2
44.3
Year
1946
1947
1948
1949
1950
1951
Birth rate
45.5
49.1
46.3
45.3
47.1
48.0
Year
1952
1953
1954
1955
1956
1957
Birth rate
48.2
45.9
45.7
45.9
42.3
40.0
Year
1958
1959
Birth rate
42.8
44.8
Source:
1945
E1-Badry. "Trends in the Components of Population GrO\vth in
the Arab Countries of the Middle East." [11J
I
.-I
TABLE A.2
ESTIHATED HID-YEAR POPULATION IN THOUSANDS, EGYPT
(19 l I0-1959) 'I:
Year
Population
Year
Population
Year
Population
Y.ear
Population
1%0
1941
1942
1943
18236
18434
18635·
18838
190h3
19250
1946
1947
1948
1949
1950
1951
19460
19734
20201
20678
21167
21668
1952
1953
1954
1955
1956
1957
22180
22704
23241
23791
24353
24929
1958
1959
25518
26121
1945
*Using
census figures corrected for under-enumeration of children less
than ten years of age and assuming exponential population growth bet\veen
censuses.
Source:
Compiled from "Population Trends in the U.A.R."
[6J
I
I
I
I
I
I
_I
I
I
I
I
I
I
.-I
I
I
I.
I
I
I
I
I
I
I
I_
I
I
I
I
I
I
I.
I
I
75
TABLE A.3
REPORTED DEATH RATE PER 1000 }L~LES IN THE AGE GROUP (0-4)
EGYPT (1940-1959)
1940
1941
1942
1943
117.4
112.1
116.5
100.1
99.5
106.9
Year
1946
1947
1949
1950
1951
Death rate
93.1
83.6
108.3
88.3
81.6
86.2
Year
1952
1953
1954
1955
1956
1957
Death rate
78.7
79.4
68.5
70.2
59.0
66.6
Year
1958
1959
Death rate
61.6.
57.0
Year
Death rate
Source:
"Population Trends in the U.A.R. lI
[6]
1945
76
TABLE A.4
\
AVERAGE RATIO OF SINGLE-YEAR nEATH JtATE TO THE DEATH RATE
OF THE AGE GROUP (0-4), EGYPT,
1937, 1.9/17 AND 1960
I~IFE TABT.ES
Year of life
Ratio
1
2
3
4
5
2.100
1.300
0.725
0.325
0.200
----_._-----_._---------_._-----------Source:
Compiled from IIPopu1ation Trends in the U.A.R." [6J
I
.-I
I
I
I
I
I
I
_I
I
I
I
I
I
I
.-I
I
I
I
77
,I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.I
TABl.E A.5
RATIO OF CRUDE DEATH RATES CORRECTED FOR UNDER-REGISTP-ATION
TO REPORTED RATES, EGYPT
(194(J-1959)
-=====::.:.:--=
Year
1%0
1942
1943
1944
1945
Ratio
1.221
1.276
1.267
1.258
1.235
Year
1946
1947
1948
1949
1950
1951
Ratio
1.204
1.243
1.225
1.209
1.220
1.212
Year
1952
1953
1954
1955
1956
1957
Ratio
1.198
1.200
1.197
1.256
1.141
1.174
Year
1958
1959
1.175
;1..222
Ratio
Source:
1941
Compiled from E1-Badry's paper. [11]
I
B. CO}fPUTER l'ROGRAN
The FORTHAN IV computei' program used in the Honte Cnr10 study
describc'J in
(l~.
2) is reproduced be1oH:
DI~mNSJON
X(50,50),T(50,50),S(50,50),SD(50),R(7),M(8),A(8),B(25)
1,Yl(50),Y2(50),Y(25),E1(25,25),E2(25,25),ES(5),ESM(5),ESV(5),
2EStlliS(5),ESR}lSE(5),T1(2500),E11(625),E21(625)
C BUILDING lAG AND CORRELATION ARRAYS
1'1(1)=0
M(2)=l
1'1(3)=3
DO 2
3=1~,
6
2 M(J)=(J-3)*S
--I
I
I
I
I
I
I
_I
DO 3 J=1,6
3 A(J)=15./(M(J)+1.)
R(1)=-.75
DO 4 1=2,7
4 R(I)=R(I-l)+.25
DO 1 N=lO,30,lO
N2=N/2
WRITE(3,9)N2
9 FORMAT(lHl,13HSAMPLE SIZE =,5X,I5)
DO 1 J=1,6
WRITE(3,19)H(J)
19
FOR}~T(/////1HO,4HLAG=,14X,I5)
I
I
I
I
I
I
I
-.
I
I
I·
,e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
e
I
I
79
C BUILDING COPJlliLATION bECAY FACTORS ARRAY
DO 5 K:1,25
5 B(K)=O.O
MJ=M(J)
IF(MJ.EQ.O)GO TO 91
DO 6 K=l,MJ
B(K)=~~(-A(J)*K)
IF(ABS(B(K».LT •• IE-I0)B(K)=O.O
6 CONTINUE
91 MJ2=M(J)+2
C BUILDING
Y~TRICES
REQUIRED FOR COMPUTING ESTIMATORS
IF(N2.LT.MJ2)GO TO 15
DO 17 K=I,N2
DO 17 L=1,N2
IF(IABS(K-L)-MJ)117,117,18
18 El(K,L)=l.
E2(K,L)=O.O
GO TO J.7
117 E1(K,L)=O.O
E2(K,L)=1.
17 CONTINUE
121 DO 49 K=1,N2
49 E2(K,K)=O.O
GO TO 16
15 DO 20 K=1,N2
DO 20 L=1,N2
so
--I
El(K,L)=O.O
IF(K-L)21,200,21
21 E2(K,L)=1GO TO 20
200 E2(K,L)==0.0
20 CONTINUE
16 DO 1 1=1,7
WRITE(3,29)R(I)
29 FOR}~T(/!/lHO,24HCORRELATION
C BUILDING VARIANCE MATRIX
DO 7 K=l,N
7 X(K,K)=l0
DO 8 K=2,N
8 X(K,K-l)=R(I)
DO 59 K=l,N,2
K2=K+2
DO 59 L=K2,N,2
LB=(L-K)!2
X(L,K)=B(LB)
59 X(L+l,K)=X(L,K)*R(I)
DO 10 K=2,N,2
K2=K+2
DO 10 L=K2,N,2
LB=(L-k)!2
X(L,K)=B(LB)
10 X(L-l,K)=X(L,K)*R(I)
I
COEFFICIENT~,F10.6)
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-.
I
I
I.
I
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
••
I
81
DO 81 K""l,N
DO 81 1,=l,K
IF(ABS(X(K,L».LT •• 1E-10)X(K,L)=0.0
81 CONTINUE
DO 101 K=l,N
DO 101 L::::1,K
101 X(L,K)=X(K,L)
C GENERATING HULTINORK<\L OBSERVATIONS
CALL FCTRIZ(X,N,T)
W=O.O
IX=I*}LJ2*N+1
DO 26 L=I,5
ESH(L)=O.O
26 ESV(L)=O.O
NS=1000
DO 11 NN=I,NS
DO 12 L=l,N
CALL
NOP~EV(IX,W,Z)
IF(ABS(Z).LT •• IE-10)Z=O.O
12 Yl(L)=Z
CALL PACKR(T,50,50,Tl,N,N)
CALL MULT(Tl,N,N,Yl,N,1,Y2)
DO 13 L=1,N,2
Ll=L+l
L2=L1/2
13 Y(L2)=Y2(L)*Y2(Ll)
82
C CONPUTATIUN OF ESTU1ATES
ESOl=O.O
DO 14 L=1,N2
14 ES01=FSOlfY(L)*Y(L)
ESO~ES01/K2
CALL PACKLS(El,25,25,Ell,N2,N2)
CALL ATRBA(Y,N2,1,Ell,N2,N2,ES(1»
CALL PACKLS(E2,25,25,E21,N2,N2)
CALL ATRBA(Y,N2,l,E21,N2,N2,ES(2»
ES(3)=ES(1)+ES(2)
ES (4)=ESOl+ES (3)
IF(N2.LT.MJ2)GO TO 22
ES(1)=ES(1)/«N2-M(J»*(N2-M(J)-1»
ES(2)=ES(2)!(N2*(2*N2-M(J)-1»
GO TO 23
22 ES(2)=ES(2)/(N2*(N2-1»
23 ES(3)=ES(3)/(N2*(N2-1»
ES(4)=ES(4)/(N)*N2)
DO 24 L=1,4
24 ES(L)=ESO-ES(L)
Yill=O.O
YM2=O.O
Vl=O.O
V2=O.O
V12=O.O
DO 25 L=1,N,2
I
_.
I
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-.
I
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.e
I
83
Ll=L+l
YMl::YNl+Y2(L)/N2
YH2==YH2+y2 (Ll) /N2
Vl~VI4Y2(L)*Y2(L)/(N2-1)
V2=V2+Y2(Ll)*Y2(Ll)/(N2-1)
25 VI2=V12+Y2(L)*Y2(Ll)/(N2-1)
Vl=VI-N2*Yl-U*nu/ (NZ-l)
V2=V2-N2*YH2*YH2/ (N2--1)
VI2=VI2-N2*~11*YM2/(N2-1)
ES(5)=VI2*VI2+Vl*YM2*~12+V2*YMl*YMl+2*VI2*YM1*nI2+Vl*V2
DO 11 L=I,5
ESM(L)=ESM(L)+ES(L)/NS
11 ESV(I.)=ESV(L)+ES(L)*ES(L)/NS
P=R(I)*R(I)+L
DO 27 L=I,5
ESV(L)=ESV(L)-ESM(L)*ESM(L)
ESMSE(L)=ESV(L)+(ESM(L)-P) **2
27 ESRMSE(L)=ESMSE(L)/P
1 WRITE(3,39)P,(L,ESM(L),ESV(L),ESMSE(L),ESRMSE(L),L=I,5)
39 FORMAT (lHO, 16HPARAMETER VALUE=,4X,FI2.6,///3X,9HESTlMATOR,12X,
14HMEAN,12X,8HVARIANCE,12X,6HM.S.E.,14X,8HR.M.S.E.//(6X,I5,6x,4
2(FI4.6,5X»)
STOP
END
84
SUBROUTINE FCTRIZ(X,N,T)
DIMEKSION X(50,50),T(SO,50),S(50,SO),SD(50)
DO 14 K=I,N
DO ll. V=K,N
J.4 T(K,L)=O.O
T(I,l)""SQRT(X(I,l»
IF(N-l)ll,12,2
2 DO 3 I:;:2,N
3 T(I,l)=X(I,I)/T(!,l)
Nl=N-l
IF(Nl--1)12,J.,4
4 DO 5 K=2,Nl
Kl=K+l
DO 5 I==Kl,N
5 S(I,K)=T(I,I)*T(K,I)
DO 6 J=2,N
6 SD(J)=T(J ,1)*T(J ,1) .
DO 10 K=2,N1
T(K,K)=SQRT(X(K,K)-SD(K»
Kl=K+l
DO 9 I=Kl,N
9 T(I,K)=(X(I,K)-S(I,K»/T(K,K)
IF(NI-K)l2,1,7
7 DO 8 J=K1,N
8 SD(J)=SD(J)+T(J,K)*T(J,K)
DO 10 L=Kl,Nl
I
••I
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-.
I
I
I
85
,.
L1=L+1
DO 10 M=L1,N
10 S(M,L)=S(M,L)+T(M,K)*T(L,K)
I
I
I
I
I
I
1 SD(N)=SD(N)+T(N,N1)*T(N,N1)
T(N,N)=SQRT(X(N,N)-SD(N»
RETURN
11 WRITE(3,13)
13 FORMAT(1HO,33H VARIANCE MATRIX DIMENSION ERROR)
12 RETURN
END
SUBROUTINE NORDEV(IX,W,Z)
IF(W)1,1,2
Ie
2 Z=2.*W*V*U
W=O.O
I
I
I
I
I
I
I
RETURN
1 CALL RANDU(IX,IY,V)
IX=IY
CALL RANDU(IX,IY,U)
IX=IY
U=2.*U-1.
W=V*V+U*U
IF(1.-W)1,3,3
3 CALL RANDU(IX,IY,X)
IX=IY
,.
I
W=SQRT(-2.*ALOG(X»/W
.-
I
86
Z= (V*V-U'~U)
,'w
RETURN
LND
The subroutines:
PACKR
packs a rectangular matrix as a column vector
HULT
multiplies two rectangular mio"'.trices
PACKLS: packs a lower symmetric matrix as a column vector
ATRBA : performs the matrix multiplication AlBA
were written by Dr. T. G. Donnely for the Hatrix Subroutines Pac.kage
of the
Dep~rtment
of Biostatistics, University of North Carolina at
Chapel Hill, North Carolina.
••I
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-.
I
I
I
.e
I
I
I
I
I
I
C. NillmRlCAL RESllLTS OF THE H01'1'fE CARLO STUDY
TABLE
RELATIVE MEAN SQUARE ERROR OF THE FIVE ESTIMl\.TORS
-
_..
n
m
.e
I
Estimator
Corre~ation
-0 :1.L-=-Q.!..~
Parameter Value
5
0
Ie
I
I
I
I
I
.1
I
C.1
1
3
-0.25
.-:.:~.=:::::==::::~:.::.~
Coefficient
;-,
._--
0.0
0.25
O'.~__ ~_.'
1. 5625
1.25
1.0625
1.000
1.0625
1.25
1.5625
1
4.225
2.655
2.100
1.476
1. 795
2.200
3.146
2
7.061
3.390
2.435
1.394
2.085
2.918
4.944
2.655
2.100
1.476
1. 795
2.200
3.1l16
3
.4.225
4
2.743
1.769
1.381
0.998
1.188
1./173
2.101
5
11.820
5~776
4.439
2.802
3.813
5.052
8.572
1
5.925
3.527
2.083
1.656
2.352
4.442
3.492
2
8.635
4.111
2.111
1.517
2.125
4.743
5.160
3
5.879
3.222
1.994
1.606
2.077
4.132
3.571
4
3.779
2.107
1.295
1.058
1.371
2.687
2.346
5
12.951
7.396
4.588
3.222
4.768
6.566 10.652
1
6.063
4.200 . 5.237
2.449
3.274
5.183
6.802
2
4.594
3.169
2.736
1. 780
2.041
4.010
5.650
3
3.634
2.721
2.468
1.833
1. 929
3.627
4.685
4
2.390
1. 782
1.611
1.183
1.268
2.374
3.031
5
11.285
6.859
6.741
3.673
4.791
7.058 13.142
8°u
TABLE C.I--Continued
.
..
=-----==--=--.:::.;.::;.=.;.:::.::.~"":::::;..:.~.:;:::_:::~,.;;:R_:::.:::.-=;=:::;=_...=_:= ..:::.::_:::;.;=_~_=:::::.~:·=:'::::=:=';':'::~;..~_ .:";=:-:.:,::;-:::-:-;=-~~:-=-,,=,._
n
m
-:::..-=:.---==
Estimator _________ -..£9rr_~.latic.?lL(,~()_(;d)2-,-c:;i~1l.L_.
:_____._____
" ..:-0 .1.5__:.9_~5__ -=.Q: 25___~ ._Q___ ._._.iJ~?-2. __.9~5______9_~? 5 __
5
10
15
10
0
1
6.580
4.013
4.369
1.527
2.617
3.524
7.711 6
2
3.353
2.666
3.834
1. 62)
2.188
2.507
'•. 745
3
3.3.53
2.666
3.83/1
1.621
2.188
2.507
4.745
-4
2.:115
1.748
2.497
1.085
1.438
1.652
3.098
5
11.539
7.879
4.508
2.834
5.221
5.682 10.853
1
7.260
4.586
2.308
1.392
2.139
2.809
9.668
2
4.11/.
2.910
1.818
1.373
1.641
1.710
6.095
3
4.114
2.910
1.818
1.373
i.641
1.710
6.095
4
2.709
1.919
1.231
0.934
1.121
1.200
4.017
5
8.367
6.312
3.465
2.517
3.438
4.248 10.931
1
15.252
5.525
2.242
1.655
2.244
I~. 770
9.756
2
5.771
3.136
1. 776
1.537
1. 679
2.612
3.573
3
5.771
3.136
1.776
1.537
1.679
2.612
3.573
4
3.7"61
2.098
1.191
1.051
1.154
1.753
2.350
5
15.560
6.890
2.972
2.387
3.802
5.318 10.238
1
2.459
1.200
0.992
1.067
1.021
1.266
1. 726
2
3.811
1. 784
1.188
1.03l
1.147
1.896
2.989
3
2.459
1.200
0.992
1.067
1.021
1.266
1. 726
4
2.006.
0.'997
0.813
0.863
0.826
1.043
1.431
.5
2.666
1.839
1.302
0.974
1.047
1. 991.
2.696
-------_.-
I
I
-,
I
I
I
I
I
I
eI
I
I
I
I
I
I
I
e
I
I
89
TABLE C.1--Col'. t inucd
n
m Estitlator
_
____-~o::..;._I2-_-=.9_~5
1
3
5
10
-0. 25,,--_0_._0
0.:J.5_ _.Q.~__Q.. 75 __
1
2.137
1.~82
1.030
0.782
1.039
1.203
1.693
2
3.353
2.152
1.173
0.742
1.142
1.652
2.688
3
2.088
1. 498
1.014
0.772
1.027
1.191
1.667
4
1. 707
1.224
0.826
0.638
0.846
0.986
1.362
5
2.988
2.156
1.354
0.938
1.267
1.797
2.389
1
1. 709
1.495
1. 064
0.901
1.034
1.327
2.180
2
2.354
1.933
1.106
0.885
1.105
1.617
2.879
3
1.665
1.464
0.999·
0.905
0.978
1.241
2.067
4
1.373
1.186
0.815
0.737
0.795
1. 024
1. 686
5
2.397
1.931
1.193
0.926
1.260
1.873
2.586
1
2.397
2.120
1.167
0.868
1.291
2.186
2.582
2
2.872
2.197
1.153
0.786
1.150
2.453
2.821
3
2.180
1.907
1.078
0.797
1.110
2.127
2.276
4
1.768
1.547
0.881
0.653
0.904
1. 742
1.860
5
3.118
1.780
1.274
1.025
1.052
1. 852
2.602
1
3.824
2.404
1. 022
0.966
1.330
1. 632
3.691
2
2.280
1.501
0.851
0.956
1.119
1.050
1.889
3
2.280
1.501
0.851
0.956
1.119
1.050
1. 889
4
1.864
1.228
0.710
.0.782
0.923
0.881
1.552
5
2.566
2.179
1.220
0.903
1.170
1. 669
3.335
I
I
90
TABLE C .l··-Collt:imwd
=----==~--=-~_.:.:.::=
n
m
~~=:::=.:_~=':;:..=--~--.:.:::-~.~-:::==.;..-:':.:::::':':'-:~
Estintator
_ .__
~ _ _.___Qp.xL~L~tion C(2.S'J.tt..~i5'nt_·
-0.75
15
15
0
1
3
.=..:~:~"::7_'7;."'.::::':-':'=:-=::-;'-":'=-....;:;;;:.;.:,,:::::.~.~-==-=~:'.':==:=:--=:'";';:::
-0.5
.-O_,.? ~_..Q,_O ____~~..L__
O"-~ ..___-.Q:.} 5 "~"
1
4.265
3.045
1.405
0.865
1.154
2.636
4.451
2
2.282
1. 946
1.031
0.792-
0.906
1 ,1133
2.237
3
2.282
1. 946
1.031
0.792
0.906
l. l133
2.237
q
1.880
1.590
0.864
0.663
0.762
1.178
1.832
5i
3.290
2.182
1.303
0.948
1.175
2.71+8
3.557
1
1.343
0.957
0.600
0.537
0.671
0.790
1.075
2
2.328
1.396
0.716
0.528
0.780
1.169
2. all.
3
1.343
0.957
0.600
0.537
0.671
0.790
1.075
4
1.178
0.839
0.527
O.ll72
0.592
0.702
0.944
5
1.457
0.968
0.620
0.569
0.655
0.980
1.438
1.
1.484
0.907
0.691
0.463
0.776
1.043
1.432
2
2.478
1.312
0.795
0.466
0.843
1.502
2.292
3
1.487
0.903
0.697
0.461
0.775
1. 0117
1.423
4
1.2"94
0.791
0.612
0.412
0.676
0.915
1.250
S
1.549
0.977
0.561
0.504
0.568
1.135
1.4'17
1.
1.655
0.967
0.676
0.453
0.633
1.099
1.465
2
2.412
1.274
0.760
0.445
0.689
1.493
2.3 l13
3
1.625
0.939
0.681
0.449
0.625
1. 09 /+
1.491
4
1.417
0.817
0.597
0.398
0.548
0.956
1.304
5
1.450
0.9l.2
0.573
0.381
0.590
1.157
1.545
-.
I
I
I
I
I
I
eI
I
I
I
I
I
I
I
-.
I
I
I
-,
LIST OF REFERENCES
1. AHchinson, J. and J. Bro\VIl.
l'he_L2.,g1~o..rmaJ-l?JsJ:_I}bution,
CmLbriclgo University Press, 1957, pp. 8-9.
2.
AJ~oian,
L. A.
liThe Probability Function of the Product of T\vo
:Normally Distributed Variables." An.!:l.al!i..-of Mat:..hcma_tiq!.:J:.
St_ati§tic.s, Vol. XVIII, 1947, pp. 265·-277.
3. Baney, N. T. J. Th£ Elem_eE.ts of St:.QEJ..1_"!.~.!ic Proc_~.§.:~8s..\Vi_~.h.
APplications to the Natu,ra1 Sciences, John lViley & Sons, Inc.,
New York, 1964, pp. 110-115.
4. Bartlett, 'H. s.
Stocllastic Processell, Notes of a course given
at the University of North Carolina in the Fall quarter, 1946,
pp. 28-42.
5. Bell, J. R. "Algorithm 334, Normal Random Deviates," Communications
of the IICM, July 1968, p. 498.
6. Central Statistical Committee.
Population Trends in the U. A. R.,
Government Printing Office, Cairo, 1965.
7. Chiang, C. L. Introduction to Stochastic Processes in Biostatistics,.,
John Hiley & Sons, Inc.~ New York, 1968, p. 62.
8. Coale, A. J. and P. Demeny. Regional Model Life Tables and Stable
Populations" Princeton University Press, Princeton, New Jersey,
1966, pp. 8-9.·
9. Craig, C. C.
"On the Frequency Function of xy."
Vol. VII, 1936, pp. 1-15.
Annals of }fathe-
~atical St~tistics,
10. Doab, J. I•• Stochastic Processes, John Wiley & Sons, Inc., New
York, 1967, pp. 60-63.
11. EI-Badry, M. A. "Trends in the Components of Population Growth
in the Arab Countries of the Middle East, A Survey of Present
Information." 'pemog~l!Y., Vol. 2, 1965, pp. 140-186.
12. Feller, W.
An Introduction to Probability Theo_D:. and Its Arp)ications, Vol. I, Second Edition, John Wiley & Sons, Inc., 1965,
pp. 407-411.
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
e·
I
I
I
93
I.
I
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
Ie
-,
I
13. Frcchct, H. "Sur Les Expressions Analytiquc dc 1a Hortalite
V<1riahles pour 1a Vie Entiere. II JOl~~a..L.A(:...1~"l.-0g_~P.t(.E...C!.
St1'ltif"~ti(1_ye <-15J?rif::i., Vol. 88, July--August 1947, pr. 261-285.
14. Harvard University. _:r.~bles..-.£f._Inv~..r:..9..~.J!YJ?_erhoJic
__D!D~_UQ.Q§_,
Harvard Uni.versi.ty Prcss, Cambridge, Hassachusetts, 1949.
15. Hoeffding, \\f. "A Class of Statistics With Asymptotically Normal
Distri11ltions." bnn~ls of Has.hemati~G! Stati~tics, Vol. 19,
1948, pp. 293-335.
16. Johnson, N. L. and F. Leone. §.ytist;£~-.?.!l~~~~I.~~nt_a1DC!i!X__12,
Volume I, John Wiley & Sons, Inc., New York, 19M, pp. 117-118.
17. Kendall, D. G. "Stochastic Processes and Population Growth,
Symposium on Stochastic Processes." JC!.lgll?l o~ t1t....c Royal
Statjst~~a1 Society, Series B, Vol. 11, 1949, pp. 230-264.
18. Kendall, N. and A. Stuart. The Adval'~~cd Theo.EL..~_LStatisti.c.:?,
Vol. I, Hofner Publishing Company, ~ew York, 1963.
19. Leslie, P. H. "On the Use of Matrices in Certain Population
Mathematics." Biometrika, Vol. XXXIII, Part III, 1945, pp. le3212.
20. Lopez, A. PrQ..12leI!l.s in Stable Populati9n Theory, Princeton University Press, Princeton, New Jersey, 1961.
21. Lotka, A. J. "Studies on the Mode of Growth of Material Aggregates."
American Journal of SCience, Vol. XXIV, 1907, pp. 199-218.
22. Lotka, A. J. "Relation Between Birth Rates and Death Rates."
Scie1l£.£, Vol. XXVI, 1907, pp. 21-23.
23. Lotka, A. J. Theorie Analytique des Associations Biologique,
Herman and Cie, Paris, 1939.
24. Pollard, J. H. "On the Use of Direct Matrix Product in Analyzing
Certain Stochastic Population Nodels." Biometrika, Vol. 53,
1966, pp. 397-415.
25. Reed, L. J. and M. Merrell. "A Short Nethod for Constructing an
Abridged Life Table." American Journal of Hygiene, Vol. 30, 1939,
pp. 33-62.
26. Sen, P. K. "On the Properties of U-Statistics When the Observations are Not Independent, Part 1, Estimation of Non-Serial
Parameters in Some Stationary Stochastic Processes." Calcutta
Statistical Association Bulletin, Vol. 12, 1963, pp. 69-92.
I
27. Sheps, M. C. "Applications of Probability Hodels to the Study
of Patterns of Human Reproduction." l'-!.!.g).i..<;';..l1ci:!J.1::h.....fl..!l..tl
P0.p.!:~1_~_t!2.ll-Cha22Kc:.., Hinck1 C. Sheps and Jeane Cl arc Ridley,
Editors, University of Pittsburgh I)ress, 1965, p. 309.
28. Sykes, Z. H. "Some Stochastic Versions of the Matrix Hodel
for Population Dynamics." Journal_.9_L.tJ1.e A!nslL~~~-g_gil~t;.~sUs~l
t>-ssoci..ation, Vol. 64, Number 325, 1969, pp. 111-·130.
29. Taboh, L. "Relation Between Age Structure, Fertility, HortalHy
and ¥~gration, Population Replacement and Renewal." ~~
'~orlcLX9...PE.l.ation Conference_, 1965, Backr;round Paper B,
7/15/E/476.
30. Technical Publications Department, IBM Corporation • T~_M-,-~~J£ll­
tifie Subroutines Packap~; White Plains, New York, 1969.
_.
I
I
I
I
I
I
I
_I
I
I
I
I
I
I
I
-.
I