Browse by

3. Composite generalized regression estimation for design (c)

Takis Merkouris

A computationally very convenient, but generally suboptimal, variant of ${\hat{ℬ}}^{o}$ in (2.6) is obtained by replacing the matrix $Λ^{0}$ with the diagonal "weighting matrix� $Λ$ having $w_{i k} / q_{i k}$ as $i k^{th}$ diagonal entry, where ${w_{i k}}$ are the design weights of $S_{i}$ and ${q_{i k}}$ are positive constants. This gives the multivariate composite generalized regression (CGR) estimator of ${({t^{'}}_{x}, {t^{'}}_{y})}^{'}$

$(\begin{matrix} {\hat{X}}^{CGR} \\ {\hat{Y}}^{CGR} \end{matrix}) = \hat{ℬ} (\begin{matrix} {\hat{X}}_{1} \\ {\hat{Y}}_{2} \end{matrix}) + (I - \hat{ℬ}) (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}) = (\begin{matrix} {\hat{X}}_{3} \\ {\hat{Y}}_{3} \end{matrix}) + \hat{ℬ} (\begin{matrix} {\hat{X}}_{1} - {\hat{X}}_{3} \\ {\hat{Y}}_{2} - {\hat{Y}}_{3} \end{matrix}), (3.1)$

where $\hat{ℬ} = ({X^{'}}_{3} Λ X) {(X^{'} Λ X)}^{- 1}$ is the associated matrix regression coefficient. For an extensive discussion of the generalized regression estimator in a single sample, see Särndal et al. (1992, Chapter 6). The CGR estimator may be compactly written as ${\hat{X}}^{CGR} = {\hat{X}}_{3} - \hat{ℬ} \hat{X} [= {(X_{3} - X {\hat{ℬ}}^{'})}^{'} w],$ i.e., as a sum of weighted sample regression residuals. The coefficient $\hat{ℬ}$ is optimal in the sense of generalized least squares, i.e., it minimizes the quadratic form ${(X_{3} - X {\hat{ℬ}}^{'})}^{'} Λ (X_{3} - X {\hat{ℬ}}^{'})$ in these residuals. Similarly to the COR estimator, the CGR estimator too can be obtained in calibration form as ${X^{'}}_{3} c,$ where the vector $c = w + Λ X {(X^{'} Λ X)}^{- 1} (0 - X^{'} w)$ minimizes the generalized least-squares distance ${(c - w)}^{'} Λ^{- 1}$ $(c - w)$ and satisfies the constraints ${\hat{X}}_{1}^{CGR} = {\hat{X}}_{3}^{CGR}$ and ${\hat{Y}}_{2}^{CGR} = {\hat{Y}}_{3}^{CGR} .$ This extends to the present context the well-known equivalence of generalized regression estimation and calibration estimation (Deville and Särndal 1992) for a single-sample setting. Now using the subvector of calibrated weights $c_{3},$ for sample $S_{3}$ only, we obtain the composite estimators in (3.1) in the simple linear forms ${\hat{X}}^{CGR} = {X^{'}}_{3} c_{3}$ and ${\hat{Y}}^{CGR} = {Y^{'}}_{3} c_{3} .$ Using Lemma 1 and the diagonal structure of $Λ,$ it works out that ${\hat{X}}^{CGR}$ can be written as

${\hat{X}}^{CGR} = {\hat{B}}_{1 x} {\hat{X}}_{1} + (I - {\hat{B}}_{1 x}) {\hat{X}}_{3}^{GR}, (3.2)$

where ${\hat{X}}_{3}^{GR} = {\hat{X}}_{3} + {X^{'}}_{3} Λ Ψ {(Ψ^{'} Λ Ψ)}^{- 1} ({\hat{Y}}_{2} - {\hat{Y}}_{3})$ is the generalized regression (GR) counterpart of ${\hat{X}}_{3}^{OR} .$ The matrix regression coefficient ${\hat{B}}_{1 x}$ is written explicitly as ${\hat{B}}_{1 x} = {X^{'}}_{3} L_{Ψ} X {({X^{'}}_{1} Λ_{1} X_{1} + {X^{'}}_{3} L_{Ψ} X)}^{- 1},$ where ${X^{'}}_{3} L_{Ψ} X = {X^{'}}_{3} Λ_{3} X_{3} - {X^{'}}_{3} Λ_{3} Y_{3} {({Y^{'}}_{2} Λ_{2} Y_{2} + {Y^{'}}_{3} Λ_{3} Y_{3})}^{- 1} {Y^{'}}_{3} Λ_{3} X_{3} .$ If $x$ and $y$ were uncorrelated, or if information on $y$ was not used in the estimation of $t_{x},$ then it would be ${\hat{X}}_{3}^{GR} = {\hat{X}}_{3}$ and ${\hat{B}}_{1 x} = {X^{'}}_{3} Λ_{3} X_{3} {({X^{'}}_{1} Λ_{1} X_{1} + {X^{'}}_{3} Λ_{3} X_{3})}^{- 1} .$ But the GR estimator ${\hat{X}}_{3}^{GR}$ is generally more efficient than the HT estimator ${\hat{X}}_{3},$ and since ${X^{'}}_{1} Λ_{1} X_{1} + {X^{'}}_{3} L_{Ψ} X < {X^{'}}_{1} Λ_{1} X_{1} + {X^{'}}_{3} Λ_{3} X_{3}$ (in the partial ordering of non-negative definite matrices), it is clear that more weight is given to ${\hat{X}}_{3}^{GR}$ in (3.2), through $I - {\hat{B}}_{1 x} = {X^{'}}_{1} Λ_{1} X_{1} {({X^{'}}_{1} Λ_{1} X_{1} + {X^{'}}_{3} L_{Ψ} X)}^{- 1},$ than would have been given to the component estimator ${\hat{X}}_{3}$ in the simple composite estimator involving only information on $x .$ This suggests that the CGR estimator in (3.2), incorporating information from sample $S_{2},$ is a more efficient estimator. Suggestive of the efficiency of ${\hat{X}}^{CGR}$ is also its alternative expression, obtained using (2.11), ${\hat{X}}^{CGR} = {\tilde{X}}^{CGR} + {X^{'}}_{3} L_{X} Ψ {(Ψ^{'} L_{X} Ψ)}^{- 1} [{\hat{Y}}_{2} - {\hat{Y}}_{3}^{GR}],$ where ${\tilde{X}}^{CGR} = {\hat{X}}_{3} + {X^{'}}_{3} Λ X {(X^{'} Λ X)}^{- 1} ({\hat{X}}_{1} - {\hat{X}}_{3}) = {\tilde{B}}_{1 x} {\hat{X}}_{1} + (I - {\tilde{B}}_{1 x}) {\hat{X}}_{3}$ is the composite regression estimator of $t_{x}$ using information on $x$ from $S_{1}$ and $S_{3} .$

In general, the computationally simpler CGR estimator $({\hat{X}}^{CGR}, {\hat{Y}}^{CGR}),$ involving the coefficient $\hat{ℬ},$ is less efficient than the optimal composite regression estimator $({\hat{X}}^{COR}, {\hat{Y}}^{COR})$ which involves the estimated optimal coefficient ${\hat{ℬ}}^{o}$ and has the same asymptotic variance as the BLUE in (2.3); the efficiency loss may be larger in nested matrix sampling, for which the matrix $Λ^{0}$ is not block-diagonal. On the other hand, $({\hat{X}}^{COR}, {\hat{Y}}^{COR})$ may be unstable in small samples, when there is a small number of degrees of freedom available for the estimation of ${\hat{ℬ}}^{o},$ which is particularly so in nested matrix sampling; for a discussion of the relative stability of the optimal versus the generalized regression estimator in the single-sample case see Rao (1994) or Montanari (1998). For certain sampling strategies, described in the following theorem, $\hat{ℬ} = {\hat{ℬ}}^{o}$ and the CGR estimator is the COR estimator, and asymptotically is BLUE; the proof is given in the Appendix.

Theorem 1 Consider the following sampling strategies.

Non-nested design

$(a)$ For all three samples $S_{1}, S_{2}$ and $S_{3}$ assume stratified simple random sampling without replacement (STRSRS) with sampling fraction $f_{i h} = n_{i h} / N_{i h}$ in stratum $h$ of sample $i,$ $h = 1, \dots, H_{i}$ and $N_{i h}$ denoting stratum size, and specify the constants $q_{i k}$ in $Λ_{i}$ as $q_{i k} = (n_{i h} - 1) / N_{i h} (1 - f_{i h})$ for all units of stratum $h .$ Furthermore, assume that within each sample the units are sorted by stratum, and consider the augmented design matrix $Z = (X, D)$ in (2.7), where $D$ is the block diagonal matrix $diag {D_{1}, D_{2}, D_{3}}$ and $D_{i}$ is the diagonal matrix $diag {1_{i 1}, \dots, 1_{i h}, \dots, 1_{i H_{i}}},$ with diagonal element $1_{i h}$ being a vector of ones for all units of stratum $h$ in sample $S_{i},$ and consider the corresponding augmented vector of calibration totals $t_{Z} = {(0^{'}, 0^{'}, {N^{'}}_{1}, {N^{'}}_{2}, {N^{'}}_{3})}^{'},$ where $N_{i}$ is the vector of strata sizes for sample $S_{i} .$
$(b)$ For all three samples $S_{1}, S_{2}$ and $S_{3}$ assume stratified Poisson sampling and specify the constants $q_{i k}$ in the entries of $Λ_{i}$ as $q_{i k} = π_{i h k} / (1 - π_{i h k})$ for the units of stratum $h,$ where $π_{i h k}$ is the inclusion probability of unit $k$ in stratum $h$ of the $i^{th}$ survey.

Nested design

$(a ’)$ Assume that an initial stratified simple random sample $S$ is split by stratum into three simple random subsamples $S_{1}, S_{2}$ and $S_{3} .$ Specify the sampling fractions $f_{i h},$ the constants $q_{i k}$ in $Λ_{i},$ the design matrix $Z = (X, D)$ and the vector of calibration totals $t_{Z}$ as in part $(a) .$
$(b ’)$ Assume that an initial stratified Poisson sample $S$ is randomly split by stratum into three subsamples $S_{1}, S_{2}$ and $S_{3},$ with unequal inclusion probabilities for the units of each subsample. Specify the constants $q_{i k}$ in $Λ_{i}$ as $q_{i k} = π_{i h k} / (1 - π_{i h k})$ for the units of stratum $h,$ where $π_{i h k}$ is the marginal inclusion probability of unit $k$ in stratum $h$ of the $i^{th}$ subsample.

Under each of strategies $(a)$ and $(b),$ the calibration procedure with matrix $Λ$ in the least-squares distance measure gives the CGR estimator in (3.1) with $\hat{ℬ} = {\hat{ℬ}}^{o},$ implying that the CGR estimator is the COR estimator. For $(a ’)$ and $(b ’),$ this holds approximately when the strata sampling fractions are approximately zero.

Corollary 1 The result of Theorem 1 holds also for the unstratified versions of all four designs. For simple random sampling without replacement (SRS), in particular, the matrix $D$ reduces to the diagonal matrix $diag {1_{1}, 1_{2}, 1_{3}}$ having as its $i^{th}$ diagonal element the $n_{i} -$ dimensional unit vector $1_{i},$ and the vector of calibration totals is then $t_{Z} = {(0^{'}, 0^{'}, N, N, N)}^{'} .$

Corollary 2 In non-nested sampling, when the sampling design for each of the three samples is one of the designs in $(a)$ and $(b)$ or one of their unstratified versions, but not the same for all samples, the result of Theorem 1 holds provided that the matrix $D$ in $Z$ and the vector $t_{Z}$ are reduced so as to correspond only to the samples for which SRS or STRSRS is used.

The extended calibration scheme in Theorem $1 (a, a ’)$ includes calibration to the stratum sizes (or to the population size in the SRS version), through the inclusion of an intercept for each stratum in the design matrix $X .$ No additional information is used beyond what is assumed in the sampling design in $(a)$ and $(a ’),$ and the form of the resulting CGR estimator remains the same as in (3.1) because the HT estimates of the population and strata sizes are exact. The effect of this extended calibration (with the specified values of $q_{i k})$ is only to convert the CGR coefficient $\hat{ℬ}$ to the optimal coefficient ${\hat{ℬ}}^{o}$ and, thus, the CGR estimator to the COR estimator. The practical significance of this conversion lies in carrying out optimal composite regression estimation through the much simpler calibration procedure of generalized regression estimation.

Subsampling as in part $(a ’),$ with a priori fixed sample sizes, is a natural procedure in matrix sampling involving splitting a questionnaire. In contrast, in the subsampling scheme of part $(b ’)$ $n_{i}$ is the expected sample size of $S_{i},$ the actual size being random. Unequal subsampling probabilities may be determined adaptively for increased efficiency; see Gonzalez and Eltinge (2008).

The results of Theorem 1 could extend to other sampling designs, e.g., stratified two-stage simple random sampling in non-nested matrix sampling. However, the required adjustments in the matrices $Λ_{i}$ would not be easier than using directly the matrices $Λ_{i}^{0}$ in the calibration to obtain the optimal composite regression estimator.

For sampling designs other than those assumed in Theorem 1, the value of $q_{i k}$ in the entries of $Λ_{i}$ should be set to $q_{i k} = {\tilde{n}}_{i} / ({\tilde{n}}_{1} + {\tilde{n}}_{2} + {\tilde{n}}_{3}),$ where ${\tilde{n}}_{i} = n_{i} / d_{i}, d_{i}$ denoting design effect, to take into account the differential in effective sample sizes among the three samples. If the same design is used for all samples, then ${\tilde{n}}_{i} = n_{i} .$ The justification for this adjustment is based on the argument given in Merkouris (2010) for a similar problem of composite regression estimation.

Previous | Next

Date modified:: 2015-11-27

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

3. Composite generalized regression estimation for design (c)