Browse by

2. Composite optimal regression estimation for design (c)

Takis Merkouris

A general estimation method for matrix sampling is illustrated for design (c) through the simplest setting involving three samples $S_{1}, S_{2}$ and $S_{3}$ with arbitrary designs and sizes $n_{1}, n_{2}, n_{3},$ which may be subsamples of an initial sample of size $n = n_{1} + n_{2} + n_{3}$ from a population labeled $U = 1, \dots, k, \dots, N,$ or may be drawn independently from $U .$ A $p -$ dimensional vector of variables $x$ and a $q -$ dimensional vector of variables $y$ are surveyed in $S_{1}$ and $S_{2},$ respectively, and both vectors are surveyed in $S_{3} .$ These two modes of matrix sampling, depicted in Figure 2.1, will henceforth be referred to as nested and non-nested matrix sampling, respectively, in analogy with the nested and non-nested two-phase sampling (Hidiroglou 2001).

Figure 2.1 Nested and non-nested matrix sampling design (c)

Figure 2.1 Nested and non-nested matrix sampling design (c)

Description for Figure 2.1

We denote by $w_{i}$ the vector of design weights for sample $S_{i}, i = 1,2,3,$ and by $X_{i}$ and $Y_{i}$ the sample matrices of $x$ and $y,$ the subscripts indicating the sample. We obtain simple Horvitz-Thompson (HT) estimators ${\hat{X}}_{1} (= {X^{'}}_{1} w_{1})$ and ${\hat{X}}_{3}$ of the population total $t_{x}$ of $x,$ using $S_{1}$ and $S_{3},$ respectively, and HT estimators ${\hat{Y}}_{2}$ and ${\hat{Y}}_{3}$ of the total $t_{y}$ of $y,$ using $S_{2}$ and $S_{3} .$ For more efficient estimation of the totals $t_{x}$ and $t_{y}$ we seek composite estimators that combine all the available information on $x$ and $y$ in the three samples. Such composite estimators that are best linear unbiased estimators (BLUE), i.e., minimum-variance linear unbiased combinations of the four estimators ${\hat{X}}_{1}, {\hat{Y}}_{2}, {\hat{X}}_{3}$ and ${\hat{Y}}_{3},$ are denoted by ${\hat{X}}^{B}$ and ${\hat{Y}}^{B}$ and given in matrix form by

$(\begin{matrix} {\hat{X}}^{B} \\ {\hat{Y}}^{B} \end{matrix}) = P (\begin{matrix} {\hat{X}}_{1} \\ {\hat{Y}}_{2} \\ {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}), (2.1)$

where $P = {(W^{'} V^{- 1} W)}^{- 1} W^{'} V^{- 1},$ the matrix $W$ satisfies $E [{({\hat{X}}^{'}_{1}, {\hat{Y}}^{'}_{2}, {\hat{X}}^{'}_{3}, {\hat{Y}}^{'}_{3})}^{'}] = W {({t^{'}}_{x}, {t^{'}}_{y})}^{'}$ and has entries $1 ’ s$ and $0 ’ s,$ and $V$ is the variance-covariance matrix of ${({\hat{X}}^{'}_{1}, {\hat{Y}}^{'}_{2}, {\hat{X}}^{'}_{3}, {\hat{Y}}^{'}_{3})}^{'} .$ This estimation method was proposed by Chipperfield and Steel (2009), who provided analytical expressions of the BLUE for scalars $x$ and $y$ in non-nested matrix sampling, assuming simple random sampling and known $V .$ Such an approach to composite estimation has been explored also in a different context of survey sampling; see Wolter (1979), Jones (1980) and Fuller (1990). In general, computation of the BLUE given by (2.1) is not at all practical, as the computation of an estimated matrix $V$ (and its inverse) in $P$ would be quite laborious, especially if the number of variables or the sizes of the samples were large; it would be prohibitive if estimates for subpopulations were also required. Of course, the problem would become more difficult with more samples involved.

A more practical formulation of this estimation procedure is as follows. First, we express the composite estimators in (2.1) explicitly as linear combinations of the HT estimators ${\hat{X}}_{1}, {\hat{Y}}_{2}, {\hat{X}}_{3}$ and ${\hat{Y}}_{3},$ i.e.,

$\begin{array}{l} {\hat{X}}^{B} & = & B_{1 x} {\hat{X}}_{1} + B_{2 x} {\hat{Y}}_{2} + B_{3 x} {\hat{X}}_{3} + B_{4 x} {\hat{Y}}_{3} \\ {\hat{Y}}^{B} & = & B_{1 y} {\hat{X}}_{1} + B_{2 y} {\hat{Y}}_{2} + B_{3 y} {\hat{X}}_{3} + B_{4 y} {\hat{Y}}_{3} . \end{array}$

The condition of unbiasedness, $E ({\hat{X}}^{B}) = t_{x}$ and $E ({\hat{Y}}^{B}) = t_{y},$ implies that $B_{3 x} = I - B_{1 x},$ $B_{4 x} = - B_{2 x}$ and $B_{4 y} = I - B_{2 y},$ $B_{3 y} = - B_{1 y} .$ Thus, $P$ and $W$ can be expressed as

$P = (\begin{matrix} B_{1 x} & B_{2 x} & I - B_{1 x} & - B_{2 x} \\ B_{1 y} & B_{2 y} & - B_{1 y} & I - B_{2 y} \end{matrix}), W^{'} = (\begin{matrix} I & 0 & I & 0 \\ 0 & I & 0 & I \end{matrix}),$

respectively, and the two composite estimators have necessarily the regression form

$\begin{array}{l} {\hat{X}}^{B} & = & {\hat{X}}_{3} + B_{1 x} ({\hat{X}}_{1} - {\hat{X}}_{3}) + B_{2 x} ({\hat{Y}}_{2} - {\hat{Y}}_{3}) \\ {\hat{Y}}^{B} & = & {\hat{Y}}_{3} + B_{1 y} ({\hat{X}}_{1} - {\hat{X}}_{3}) + B_{2 y} ({\hat{Y}}_{2} - {\hat{Y}}_{3}) . \end{array} (2.2)$

Then writing $P = (ℬ, I - ℬ),$ in obvious notation for matrix $ℬ,$ we can express (2.1) as

$(\begin{matrix} {\hat{X}}^{B} \\ {\hat{Y}}^{B} \end{matrix}) = ℬ (\begin{matrix} {\hat{X}}_{1} \\ {\hat{Y}}_{2} \end{matrix}) + (I - ℬ) (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}) = (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}) + ℬ (\begin{matrix} {\hat{X}}_{1} - {\hat{X}}_{3} \\ {\hat{Y}}_{2} - {\hat{Y}}_{3} \end{matrix}), (2.3)$

the right-hand side of (2.3) being the matrix form of (2.2). The problem of finding the optimal (variance-minimizing) $P$ of the BLUE in (2.1) reduces then to that of finding the optimal matrix $ℬ$ in (2.3). The estimated optimal ${\hat{ℬ}}^{o}$ is given by

${\hat{ℬ}}^{o} = - \hat{Cov} (\begin{matrix} (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}), (\begin{matrix} {\hat{X}}_{1} - {\hat{X}}_{3} \\ {\hat{Y}}_{2} - {\hat{Y}}_{3} \end{matrix}) \end{matrix}) {[\hat{V} (\begin{matrix} {\hat{X}}_{1} - {\hat{X}}_{3} \\ {\hat{Y}}_{2} - {\hat{Y}}_{3} \end{matrix})]}^{- 1}, (2.4)$

and when the three samples are independent it reduces to

${\hat{ℬ}}^{o} = \hat{V} (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}) {[\hat{V} (\begin{matrix} {\hat{X}}_{1} \\ {\hat{Y}}_{2} \end{matrix}) + \hat{V} (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix})]}^{- 1} . (2.5)$

In view of (2.3), with such optimal ${\hat{ℬ}}^{o}$ the estimated BLUE in (2.1) (involving the estimated $\hat{V},$ and with $\hat{P} = ({\hat{ℬ}}^{o}, I - {\hat{ℬ}}^{o})$ is a special type of optimal multivariate regression estimator. For the form of the ordinary (single-sample) optimal regression estimator and relevant discussion, see Montanari (1987) and Rao (1994).

Expressing the estimated variance of the HT estimator of a total (see, for example, Särndal, Swensson and Wretman (1992), page 43) as a quadratic form with associated non-negative definite matrix $Λ^{0} = {(π_{k l} - π_{k} π_{l}) / π_{k} π_{l} π_{k l}},$ where $π_{k}, π_{k l}$ are first-and-second order inclusion probabilities, it can be shown after some matrix algebra that

${\hat{ℬ}}^{o} = ({X^{'}}_{3} Λ^{0} X) {(X^{'} Λ^{0} X)}^{- 1}, (2.6)$

where

$X = (\begin{matrix} - X_{1} & 0 \\ 0 & - Y_{2} \\ X_{3} & Y_{3} \end{matrix}) (2.7)$

is the $n \times (p + q)$ design matrix corresponding to the regression estimator (2.3), $X_{3}$ is the matrix $X$ with the first two rows set equal to zero, and $Λ^{0}$ is associated with the combined sample $S = S_{1} \cup S_{2} \cup S_{3},$ reducing in the non-nested sampling to the block-diagonal matrix $diag {Λ_{i}^{0}}$ with $Λ_{i}^{0}$ associated with the sample $S_{i} .$ For the nested design, the probabilities defining $Λ^{0}$ are products of the probabilities of inclusion in $S$ and the conditional (on $S$ ) subsampling probabilities. With this estimated ${\hat{ℬ}}^{o},$ the estimated BLUE in (2.3), called composite optimal regression estimator (COR) and denoted by ${\hat{X}}^{COR},$ is written compactly as ${\hat{X}}^{COR} = {\hat{X}}_{3} - {\hat{ℬ}}^{o} \hat{X} [= {(X_{3} - X {\hat{ℬ}}^{o^{'}})}^{'} w],$ where $w = {({w^{'}}_{1}, {w^{'}}_{2}, {w^{'}}_{3})}^{'}$ is the vector of design weights of the combined sample $S .$ It transpires that the COR estimator is in fact the sum of weighted sample regression residuals, and ${\hat{ℬ}}^{o}$ minimizes the quadratic form ${(X_{3} - X {\hat{ℬ}}^{o^{'}})}^{'}$ $Λ^{0} (X_{3} - X {\hat{ℬ}}^{o^{'}})$ in these residuals, which is the estimated approximate (large-sample) variance of ${\hat{X}}^{COR} .$

Now, upon writing ${\hat{X}}^{COR}$ as ${\hat{X}}^{COR} = {X^{'}}_{3} [w + Λ^{0} X {(X^{'} Λ^{0} X)}^{- 1} (0 - X^{'} w)],$ it appears that the COR estimator has the form of a calibration estimator (with vector of calibration totals $0 = {(0^{'}, 0^{'})}^{'}$ of dimension $(p + q)),$ whose components satisfy the constraints ${\hat{X}}_{1}^{COR} = {\hat{X}}_{3}^{COR}$ and ${\hat{Y}}_{2}^{COR} = {\hat{Y}}_{3}^{COR},$ i.e., calibrated estimates of the same total from two different samples are equal. Indeed, the vector

$c = w + Λ^{0} X {(X^{'} Λ^{0} X)}^{- 1} (0 - X^{'} w), (2.8)$

is the vector of calibrated weights that minimizes the generalized least-squares distance ${(c - w)}^{'} {(Λ^{0})}^{- 1} (c - w)$ while satisfying the constraints ${X^{'}}_{1} c_{1} = {X^{'}}_{3} c_{3}$ and ${Y^{'}}_{2} c_{2} = {Y^{'}}_{3} c_{3},$ where the subcector $c_{i}$ corresponds to sample $S_{i} .$ This follows from a general result for the single-sample case, according to which calibration with the generalized least-squares distance measure may involve an arbitrary $n \times n$ positive definite matrix $R$ instead of $Λ^{0};$ see Andersson and Thorburn (2005).

We may now write the COR estimator formally as a calibration estimator, ${\hat{X}}^{COR} = {X^{'}}_{3} c,$ and using the subvector of calibrated weights $c_{3},$ for sample $S_{3}$ only, we obtain the components of ${\hat{X}}^{COR}$ directly in the simple linear forms

${\hat{X}}^{COR} = {X^{'}}_{3} c_{3} = \sum_{S_{3}} c_{k} x_{k}; {\hat{Y}}^{COR} = {Y^{'}}_{3} c_{3} = \sum_{S_{3}} c_{k} y_{k},$

as in common survey practice. Yet, a decomposition of the vector $c$ based on the following general lemma on calibration gives an analytic expression of ${\hat{X}}^{COR}$ and ${\hat{Y}}^{COR}$ of the form (2.2), which provides insight into the structure and the efficiency of the COR estimator. The proof of the lemma is given in the Appendix.

Lemma 1 Let $X$ be a design matrix of dimension $n \times (p + q)$ and of full rank and written in partition form $(X, Ψ),$ with corresponding vector of calibration totals $t_{X} = {({t^{'}}_{X}, {t^{'}}_{Ψ})}^{'},$ and let $R$ be any positive definite matrix of dimension $n \times n .$ Then the vector of calibrated weights $c = w + R X {(X^{'} R X)}^{- 1}$ $(t_{X} - X^{'} w),$ obtained from the calibration procedure involving the distance measure ${(c - w)}^{'} R^{- 1}$ $(c - w)$ and the constraint $X^{'} c = t_{X},$ can be decomposed as

$c = w + L_{Ψ} X {(X^{'} L_{Ψ} X)}^{- 1} [t_{X} - X^{'} w] + L_{X} Ψ {(Ψ^{'} L_{X} Ψ)}^{- 1} [t_{Ψ} - Ψ^{'} w], (2.9)$

where $L_{X} = R (I - P_{X})$ with $P_{X} = X {(X^{'} R X)}^{- 1} X^{'} R,$ and $L_{Ψ} = R (I - P_{Ψ})$ with $P_{Ψ} = Ψ {(Ψ^{'} R Ψ)}^{- 1} Ψ^{'} R .$ The vector $c$ can be written as

$c = c_{Ψ} + L_{Ψ} X {(X^{'} L_{Ψ} X)}^{- 1} [t_{X} - X^{'} c_{Ψ}], (2.10)$

where the vector

$c_{Ψ} = w + R Ψ {(Ψ^{'} R Ψ)}^{- 1} [t_{Ψ} - Ψ^{'} w]$

is generated by calibration of the design weights involving only $Ψ$ and $t_{Ψ} .$ By symmetry,

$c = c_{X} + L_{X} Ψ {(Ψ^{'} L_{X} Ψ)}^{- 1} [t_{Ψ} - Ψ^{'} c_{X}], (2.11)$

where

$c_{X} = w + R X {(X^{'} R X)}^{- 1} [t_{X} - X^{'} w] .$

Now, if $X$ is as in (2.7), with corresponding vector of calibration totals $t_{X} = {(0^{'}, 0^{'})}^{'},$ and if $R = Λ^{0},$ then it follows from (2.9) that (2.8) can be written in the form

$c = w + L_{Ψ} X {(X^{'} L_{Ψ} X)}^{- 1} [{\hat{X}}_{1} - {\hat{X}}_{3}] + L_{X} Ψ {(Ψ^{'} L_{X} Ψ)}^{- 1} [{\hat{Y}}_{2} - {\hat{Y}}_{3}],$

and thus

$\begin{array}{l} {\hat{X}}^{COR} & = & {X^{'}}_{3} c_{3} = {\hat{X}}_{3} + {\hat{B}}_{1 x}^{o} ({\hat{X}}_{1} - {\hat{X}}_{3}) + {\hat{B}}_{2 x}^{o} ({\hat{Y}}_{2} - {\hat{Y}}_{3}) \\ = & {\hat{B}}_{1 x}^{o} {\hat{X}}_{1} + (I - {\hat{B}}_{1 x}^{o}) {\hat{X}}_{3} + {\hat{B}}_{2 x}^{o} ({\hat{Y}}_{2} - {\hat{Y}}_{3}), (2.12) \end{array}$

in obvious notation for ${\hat{B}}_{1 x}^{o}$ and ${\hat{B}}_{2 x}^{o} .$ A similar expression is obtained for ${\hat{Y}}^{COR} .$ It is seen from (2.12) that the COR estimator ${\hat{X}}^{COR}$ of $t_{x}$ is approximately (for large samples) unbiased, and derives its efficiency from combining the two elementary estimators ${\hat{X}}_{1}$ and ${\hat{X}}_{3}$ (pooling information from samples $S_{1}$ and $S_{3})$ and from borrowing strength from sample $S_{2}$ through the correlation between $x$ and $y .$ In view of (2.10), the estimator ${\hat{X}}^{COR}$ takes the alternative forms

$\begin{array}{l} {\hat{X}}^{COR} & = & {X^{'}}_{3} c_{3 Ψ} + {X^{'}}_{3} L_{Ψ} X {(X^{'} L_{Ψ} X)}^{- 1} [{X^{'}}_{1} c_{1 Ψ} - {X^{'}}_{3} c_{3 Ψ}] \\ = & {\hat{X}}_{3}^{OR} + {\hat{B}}_{1 x}^{o} [{\hat{X}}_{1}^{OR} - {\hat{X}}_{3}^{OR}] \\ = & {\hat{B}}_{1 x}^{o} {\hat{X}}_{1}^{OR} + (I - {\hat{B}}_{1 x}^{o}) {\hat{X}}_{3}^{OR}, (2.13) \end{array}$

where ${\hat{X}}_{i}^{OR} = {\hat{X}}_{i} + {X^{'}}_{i} Λ^{0} Ψ {(Ψ^{'} Λ^{0} Ψ)}^{- 1} ({\hat{Y}}_{2} - {\hat{Y}}_{3})$ are optimal regression (OR) estimators incorporating the regression effect of the last term in (2.12).

In non-nested matrix sampling, $Λ^{0} = diag {Λ_{i}^{0}},$ ${\hat{X}}_{1}^{OR} = {\hat{X}}_{1},$ ${\hat{X}}_{3}^{OR} = {\hat{X}}_{3} + \hat{Cov} ({\hat{X}}_{3}, {\hat{Y}}_{3}) {[\hat{V} ({\hat{Y}}_{2}) + \hat{V} ({\hat{Y}}_{3})]}^{- 1} [{\hat{Y}}_{2} - {\hat{Y}}_{3}],$ having estimated approximate variance $\hat{AV} ({\hat{X}}_{3}^{OR}) = \hat{V} ({\hat{X}}_{3}) - \hat{Cov} ({\hat{X}}_{3}, {\hat{Y}}_{3}) {[\hat{V} ({\hat{Y}}_{2}) + \hat{V} ({\hat{Y}}_{3})]}^{- 1} {\hat{Cov}}^{'} ({\hat{X}}_{3}, {\hat{Y}}_{3}),$ and ${\hat{B}}_{1 x}^{o} = \hat{AV} ({\hat{X}}_{3}^{OR}) {[\hat{V} ({\hat{X}}_{1}) + \hat{AV} ({\hat{X}}_{3}^{OR})]}^{- 1}$ is the coefficient that minimizes the variance $\hat{AV} ({\hat{X}}^{COR}) .$ From the explicit form $I - {\hat{B}}_{1 x}^{o} = \hat{V} ({\hat{X}}_{1}) {[\hat{V} ({\hat{X}}_{1}) + \hat{V} ({\hat{X}}_{3}) - \hat{Cov} ({\hat{X}}_{3}, {\hat{Y}}_{3}) \times {[\hat{V} ({\hat{Y}}_{2}) + \hat{V} ({\hat{Y}}_{3})]}^{- 1} {\hat{Cov}}^{'} ({\hat{X}}_{3}, {\hat{Y}}_{3})]}^{- 1},$ it is then clear that the stronger the correlation between $x$ and $y$ the larger the $I - {\hat{B}}_{1 x}^{o}$ and more weight is given to the less variable component ${\hat{X}}_{3}^{OR} .$ In this connection, it can be easily shown that $\hat{AV} ({\hat{X}}^{COR})$ satisfies

$\hat{AV} ({\hat{X}}^{COR}) {[\hat{V} ({\hat{X}}_{1})]}^{- 1} = {\hat{B}}_{1 x}^{o} < I, \hat{AV} ({\hat{X}}^{COR}) {[\hat{AV} ({\hat{X}}_{3}^{OR})]}^{- 1} = I - {\hat{B}}_{1 x}^{o} < I .$

These inequalities hold also for any linear combination of the components of each of the estimators involved. The optimal composite regression estimator ${\hat{X}}^{COR}$ is more efficient than each of its two components ${\hat{X}}_{1}$ and ${\hat{X}}_{3}^{OR}$ by the shown quantities, with the efficiency depending on the strength of the correlation between $x$ and $y .$ The estimator ${\hat{X}}^{COR}$ is also more efficient than the estimator ${\tilde{X}}^{COR} = {\tilde{B}}_{1 x}^{o} {\hat{X}}_{1} + (I - {\tilde{B}}_{1 x}^{o}) {\hat{X}}_{3},$ with ${\tilde{B}}_{1 x}^{o} = \hat{V} ({\hat{X}}_{3}) {[\hat{V} ({\hat{X}}_{1}) + \hat{V} ({\hat{X}}_{3})]}^{- 1},$ which does not incorporate the information on $y$ (does not borrow strength from sample $S_{2})$ and has estimated variance $\hat{AV} ({\tilde{X}}^{COR}) = \hat{V} ({\hat{X}}_{1}) {[\hat{V} ({\hat{X}}_{1}) + \hat{V} ({\hat{X}}_{3})]}^{- 1} \hat{V} ({\hat{X}}_{3}) .$ Indeed, writing the variance $\hat{AV} ({\hat{X}}^{COR}) = \hat{V} ({\hat{X}}_{1}) {\hat{B}}_{1 x}^{o}$ as $\hat{AV} ({\hat{X}}^{COR}) = \hat{V} ({\hat{X}}_{1}) {[\hat{V} ({\hat{X}}_{1}) + \hat{V} ({\hat{X}}_{3})]}^{- 1} \hat{V} ({\hat{X}}_{3}) E,$ where $E = E_{1} E_{2}$ with $E_{1} = [I - {(\hat{V} ({\hat{X}}_{3}))}^{- 1} \hat{Cov} ({\hat{X}}_{3}, {\hat{Y}}_{3}) {[\hat{V} ({\hat{Y}}_{2}) + \hat{V} ({\hat{Y}}_{3})]}^{- 1} {\hat{Cov}}^{'} ({\hat{X}}_{3}, {\hat{Y}}_{3})]$ and $E_{2} = {[I - {[\hat{V} ({\hat{X}}_{1}) + \hat{V} ({\hat{X}}_{3})]}^{- 1} \hat{Cov} ({\hat{X}}_{3}, {\hat{Y}}_{3}) {[\hat{V} ({\hat{Y}}_{2}) + \hat{V} ({\hat{Y}}_{3})]}^{- 1} {\hat{Cov}}^{'} ({\hat{X}}_{3}, {\hat{Y}}_{3})]}^{- 1},$ and noticing that $E \leq I,$ it follows that

$\hat{AV} ({\hat{X}}^{COR}) {[\hat{AV} ({\tilde{X}}^{COR})]}^{- 1} = E \leq I,$

that is, borrowing strength from $S_{2}$ reduces the variance of the composite estimator of $t_{x}$ by the factor $E,$ which depends on the strength of the correlation between $x$ and $y .$ It can be easily verified that for two scalar variables $x$ and $y$ and simple random sampling this result reduces to the analogous analytical result on the efficiency of BLUE given in Chipperfield and Steel (2009, page 231). In this simple case $E = [n_{1} + n_{3}] [n_{3} + n_{2} (1 - ρ^{2})] / [(n_{1} + n_{3}) (n_{2} + n_{3}) - n_{1} n_{2} ρ^{2}],$ where $ρ$ is the correlation between $x$ and $y .$ As an illustration, assuming equal sample sizes and correlation $ρ = 0.7,$ the efficiency gain is 13.96%.

In nested matrix sampling, the two estimators in (2.13) are ${\hat{X}}_{i}^{OR} = {\hat{X}}_{i} + \hat{Cov} ({\hat{X}}_{i}, \hat{Ψ}) {[\hat{V} (\hat{Ψ})]}^{- 1} [{\hat{Y}}_{2} - {\hat{Y}}_{3}],$ and ${\hat{B}}_{1 x}^{o} = [\hat{AV} ({\hat{X}}_{3}^{OR}) - \hat{AC} ({\hat{X}}_{1}^{OR}, {\hat{X}}_{3}^{OR})] {[\hat{AV} ({\hat{X}}_{1}^{OR}) + \hat{AV} ({\hat{X}}_{3}^{OR}) - 2 \hat{AC} ({\hat{X}}_{1}^{OR}, {\hat{X}}_{3}^{OR})]}^{- 1},$ where AC denotes approximate covariance. In this case, in addition to the correlation $ρ_{x 3, y 3}$ between ${\hat{X}}_{3}$ and ${\hat{Y}}_{3}$ in sample $S_{3},$ the efficiency of ${\hat{X}}^{COR}$ depends on the estimators' correlations $ρ_{x 1, x 3}, ρ_{y 2, y 3}, ρ_{y 2, x 3}$ due to the dependence of the subsamples. For univariate $x$ and $y$ and with the simplifying assumption of identical designs for the three subsamples (as in equal splitting of the full sample), we obtain some insight through the simple expressions $\hat{AV} ({\hat{X}}^{COR}) = V ({\hat{X}}_{3}) [2 (1 - ρ_{x 1, x 3}^{2}) (1 - ρ_{y 2, y 3}) - {(ρ_{x 3, y 3} - ρ_{y 2, x 3})}^{2}] / [4 (1 - ρ_{x 1, x 3}) (1 - ρ_{y 2, y 3}) - {(ρ_{x 3, y 3} - ρ_{y 2, x 3})}^{2}],$ and $\hat{AV} ({\tilde{X}}^{COR}) = V ({\hat{X}}_{3}) (1 + ρ_{x 1, x 3}) / 2 .$ Clearly, the estimator ${\tilde{X}}^{COR},$ which ignores information on $y,$ is more efficient than the simple average of single-sample estimators of $t_{x}$ only when there is negative correlation $ρ_{x 1, x 3} .$ The efficiency of ${\hat{X}}^{COR}$ relative to ${\tilde{X}}^{COR}$

$\frac{\hat{AV} ({\hat{X}}^{COR})}{\hat{AV} ({\tilde{X}}^{COR})} = \frac{4 (1 - ρ_{x 1, x 3}^{2}) (1 - ρ_{y 2, y 3}) - 2 {(ρ_{x 3, y 3} - ρ_{y 2, x 3})}^{2}}{4 (1 - ρ_{x 1, x 3}^{2}) (1 - ρ_{y 2, y 3}) - (1 + ρ_{x 1, x 3}) {(ρ_{x 3, y 3} - ρ_{y 2, x 3})}^{2}}$

depends on the sign and size of $ρ_{x 1, x 3}$ and the size of $| ρ_{x 3, y 3} - ρ_{y 2, x 3} | .$

Although the calibration procedure, with vector of calibrated weights (2.8), substantially facilitates the computation of the composite optimal regression estimator for any total of interest, the matrix $Λ^{0}$ makes the calculations exceedingly demanding, particularly in nested sampling where the subsamples are dependent and thus $Λ^{0}$ is not diag ${Λ_{i}^{0}} .$ Besides, the probabilities $π_{k l}$ are not known for most sampling designs. An alternative composite regression estimator that is computationally very efficient is developed in the next section.

Previous | Next

Date modified:: 2015-11-27

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

2. Composite optimal regression estimation for design (c)