Download 1 Important Formulas in Ratio Estimation

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
STAT 152, Fall 2003
Summary for Chapter 3
The theme for this chapter is using known information to improve the precision of estimates of means and totals. Ratio and Regression estimation improve
precision when the variability of the residuals (yi − yi ), where yi is the predicted
value from the ratio or regression model, is smaller than the variability of the
(yi − y).
1
Important Formulas in Ratio Estimation
For ratio estimation to apply, two quantities yi and xi must be measured on
each sample unit. If an SRS is taken, natural estimators for ratio B, population
total ty , and population mean y U are:
=
• B
y
x
ty
tx
=
x
• tyr = Bt
U
• y r = Bx
Bias and mean squared error of ratio estimators:
= −Cov(B,
x)/xU
• |Bias(B)|
•
|Bias(B)|
1/2
[V (B)]
≤ CV (x)
− B] ≈ (1 −
• E[B
− B)2 ] ≈
• E[(B
= (1 −
• V [B]
n
1
N ) nx2U
1
x2U
(BSx2 − RSx Sy ) =
1
x2U
[BV (x) − Cov(x, y)]
V (d), where d = y − Bx
s2e
n
N ) nx2U
= (1 −
= N 2 (1 −
• V [
tyr ] = V [tx B]
= (1 −
• V [
y r ] = V [xU B]
n
1
N ) nx2U
2
n se
N) n
2
n se
N)n
1
i∈S
i )2
(yi −Bx
n−1
i
, where ei = yi − Bx
2
Regression Estimation
Ratio estimation works best if the data are well fit by a straight line through
the origin. Sometimes, data appear to be evenly scattered about a straight line
that does not go through the origin-that is, the data look as though the usual
straight line regression model: y = B0 + B1 x
Important formulas in regression estimation:
0 + B
1 xU , estimator of y U .
y reg = B
• 0 and B
• B
1 are the ordinary least squares regression coefficients:
(xi −x)(yi −y)
rs
i∈S
1 = 0 = y − B
1 x.
B
= sxy , B
(x −x)2
i∈S
i
1 , x)
• Bias: E[
yreg − y U ] = −Cov(B
2
(yi −[y U +B1 (xi −xU )])2
n Sd
• Mean Squared Error: MSE(
y reg ) ≈ (1− N
) n , where Sd2 = N
i=1
N −1
n s2e
0 + B
1 xi ).
• Standard Error: SE(
y reg ) = (1 − N
) n , where ei = yi − (B
3
Estimation in Domains
Suppose there are D domains. Let Ud be the index set of the units in the population that are in domain d and let Sd be the index set of the units in the sample
that are in domain d, for d = 1, 2, ..., D. Let Nd be the number of population
units in Ud , and nd be the number of sample units in Sd .
Important formulas in domain estimation:
• y Ud = i∈Ud Nyid .
• y d = i∈Sd nydi , a natural estimator of y Ud
2
n syd
• Standard Error of y d : SE(y d ) ≈ (1 − N
) nd
yi if i ∈ Ud
• tyd = N u, where ui =
0 if i ∈
/ Ud
n s2u
• Standard Error of tyd : SE(tyd ) = N (1 − N
)n
2
4
Comparison
SRS
Ratio
Regression
Population Total
Estimator
Population Mean
Estimator
ei
ty
ty ( tx )
tx
1 (xU − x)]
N [y + B
y
y( tx )
tx
1 (xU − x)]
y+B
yi − y
i
yi − Bx
0 − B
1 xi
yi − B
The population total estimator variance is N 2 (1 −
The population mean estimator variance is (1 −
3
2
n se
N)n.
2
n se
N)n .
Related documents