Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please contact us to request a format other than those available.
Survey steps >EstimationScope and purpose Measures of precision are usually computed to evaluate the quality
of a population parameter estimate and to obtain valid inferences. Although
the quality of the computed estimates is in large part dependent on the
preceding survey steps, the choice of an estimation method also plays
an important role. In particular, auxiliary data can be used judiciously
to improve the precision of these estimates. The estimation method and the sampling design determine the properties of the sampling error. Criteria to evaluate the magnitude of the sampling error include the sampling bias and the sampling variance. Estimation methods that result in both the smallest bias and the smallest sampling variance should be chosen. Design consistency is another desirable property of an estimate. The basic design-consistent Horvitz-Thompson estimator is the most natural estimator to use if there is no auxiliary information available at the estimation stage. It weights data with the inverses of the inclusion probabilities of the sampled units. Such a weight is called a sampling weight. The sampling weight can be interpreted as the number of times that each sampled unit should be replicated to represent the full population. The properties of the Horvitz-Thompson estimator can be improved when auxiliary information is available. Calibration is a procedure that can be used to incorporate auxiliary data. This procedure adjusts the sampling weights by multipliers known as calibration factors that make the estimates agree with known totals. The resulting weights are called calibration weights or final estimation weights. These calibration weights will generally result in estimates that are design consistent, and that have a smaller sampling variance than the Horvitz-Thompson estimator. If there is nonresponse, the observed sample is smaller in size than the original sample selected. To compensate for nonresponse, imputation (see section on Imputation) or reweighting can be performed. Reweighting consists of adjusting the sampling weights by nonresponse adjustment factors before applying the calibration technique. The basic principle in computing the nonresponse adjustment factors is to use the inverse of the response probabilities. However, response probabilities are unknown and must be estimated, as opposed to inclusion probabilities, which are known. The key to reducing nonresponse bias and nonresponse variance is to obtain a useful nonresponse model by taking advantage of the auxiliary information available, as much as possible. Guidelines
References Binder, D.A. and Hidiroglou, M.A. (1988). Sampling in time. Handbook of Statistics, P.K. Krishnaish and C.R. Rao (eds.), 187 211. Brick, J., Morganstein D. and Valliant R. (2000). Analysis of complex sample data using replication. See also http://www.westat.com/wesvar/techpapers. Brogan, D. (1998). Software for sample survey data, misuse of standard
packages. In Encyclopedia of Biostatistics, Volume 5
(P. Armitage and T. Colton, Eds.). Wiley, New York, 4167-4174. See also Chambers, R.L., Kokic P., Smith P. and Crudas M. (2000). Winsorization for identifying and treating outliers in business surveys. Proceedings of the Second International Conference on Establishment Surveys, June 17-21, 2000, Buffalo, New York, 717-726. Cochran, W.G. (1977). Sampling Techniques. Wiley, New York. Deville, J.-C. and Särndal, C.E. (1992). Calibration estimators in survey sampling. Journal of the American Statistical Association, 87, 376-382. Duchesne, P. (1999). Robust calibration estimators. Survey Methodology, 25, 43-56. Eltinge, J.L. and Yansaneh, I.S. (1997). Diagnostics for formation of nonresponse adjustment cells, with an application to income nonresponse in the U.S. Consumer Expenditure Survey. Survey Methodology, 23, 33-40. Estevao, V., Hidiroglou, M.A. and Särndal, C.E. (1995). Methodological principles for a generalized estimation system at Statistics Canada. Journal of Official Statistics, 11, 181-204. Franklin, Sarah F., Thomas, S. and Brodeur, M. (2000). Robust multivariate outlier detection using Mahalanobis’ distance and modified Stahel-Donoho estimators. Proceedings of the Second International Conference on Establishment Surveys, June 17-21, 2000, Buffalo, New York, 697-706. Fuller, W.A. (1991). Simple estimators of the mean of skewed populations. Statistica Sinica, 1, 137-158. Fuller, W.A. (2002). Regression estimation for survey samples. Survey Methodology, 28, 5-23. Fuller, W.A. and Rao, J.N.K. (2001). A regression composite estimator with application to the Canadian Labour Force Survey. Survey Methodology, 27, 45-52. Gambino, J., Kennedy, B. and Singh, M.P. (2001). Regression composite estimation for the Canadian Labour Force Survey: evaluation and implementation. Survey Methodology, 27, 65-74. Ghosh, M. and Rao, J.N.K. (1994). Small area estimation: an appraisal. Statistical Science, 9, 55-93. Gwet, J.-P. and Lee, H., (2000). An evaluation of outlier-resistant procedures in establishment surveys. Proceedings of the Second International Conference on Establishment Surveys, June 17-21, 2000, Buffalo, New York, 707-716. Hidiroglou, M.A. (2001). Double sampling. Survey Methodology, 27, 143-154. Hidiroglou, M.A. and Laniel, N. (2001). Sampling and estimation issues for annual and sub-annual Canadian business surveys. International Statistical Review, 69, 487-504. Hidiroglou, M.A. and Särndal, C.E. (1998). Use of auxiliary information for two-phase sampling. Survey Methodology, 24, 11-20. Hidiroglou, M.A. and Srinath, K.P. (1981). Some estimators of population total containing large units. Journal of the American Statistical Association, 47, 663-685. Holt, D. and Smith, T.M.F. (1979). Post stratification. Journal of the Royal Statistical Society, A 142, 33 46. Huang, E. and Fuller, W.A. (1978). Nonnegative regression estimation for sample survey data. Proceedings of the Social Statistics Section, American Statistical Association, 300 303. Hulliger, B. (1995). Outlier robust Horvitz-Thompson estimators. Survey Methodology, 21, 79-87. Kass, G.V. (1980). An exploratory technique for investigating large quantities of categorical data, Applied Statistics, 29, 119-127. Lohr, S. (1999). Sampling: Design and Analysis. Duxbury Press. Lavallée, P. (2001). La méthode généralisée du partage des poids et le calage sur marges. In Enquêtes, modèles et applications, J.-J. Droesbeke and L. Lebart (eds), Paris: Dunod, 396-403. Lavallée, P. (2002). Le sondage indirect. Éditions de l’Université de Bruxelles. Lavallée, P. and Caron, P. (2001). Estimation using the generalized weight share method: the case of record linkage. Survey Methodology, 27, 155-170. Lee, H. (1995). Outliers in business surveys. In Business Survey Methods, B.G. Cox et al. (eds). Wiley, New York, 503-526. Lemaître, G. and Dufour, J. (1987). An integrated method for weighting persons and families. Survey Methodology, 13, 199-207. Patak, Z. (1990). Robust principal component analysis via projection pursuit. Master’s thesis, University of British Columbia, Canada. Platek, R., Rao, J.N.K., Särndal, C.E. and Singh, M.P. (eds.) (1987). Small Area Statistics. Wiley, New York. Rao, J.N.K. (1996). On the estimation with imputed survey data. Journal of the American Statistical Association, 91, 499 506. Rao, J.N.K. (1999). Some recent advances in model-based small area estimation. Survey Methodology, 25, 175-186. Särndal, C.E., Swensson, B. and Wretman, J.H. (1992). Model Assisted Survey Sampling. Springer-Verlag, New York. Schnell, D., Kennedy, W.J., Sullivan, G, Park, H.J. and Fuller, W.A. (1988). Personal computer variance software for complex surveys. Survey Methodology, 14, 59-69. See also http://www.statlab.iastate.edu/survey/software/pccarp.html. Shah, B.V., Barnwell, B.G. and Bieler, G.S. (1997). SUDAAN User’s Manual Release 7.5. Research Triangle Institute, North Carolina. See also http://www.rti.org/sudaan/home.cfm. Singh, A.C., Kennedy, B. and Wu, S. (2001). Regression composite estimation for the Canadian Labour Force Survey with a rotating design. Survey Methodology, 23, 33-44. Singh, A.C. and Mohl, C. (1996). Understanding calibration estimators in survey sampling. Survey Methodology, 22, 107-115. Singh, M.P., Gambino, J. and Mantel, H. (1994). Issues and strategies for small area data. Survey Methodology, 20, 3-14. Singh, M.P., Hidiroglou, M.A., Gambino, J. and Kovacevic, M. (2001). Estimation methods and related systems at Statistics Canada. International Statistical Review, 69, 461-486. STATA Corporation (1997). STATA User’s Guide. STATA Press, College Station, TX. See http://www.fas.harvard.edu/~stats/survey-soft/iass.html#stata. Stukel, D., Hidiroglou, M.A., and Särndal, C.-E. (1996). Variance estimation for calibration estimators: a comparison of Jackknifing versus Taylor series linearization. Survey Methodology, 22, 117-125. Thompson, M.E. (1997). Theory of Sample Surveys. Chapman and Hall. Valliant, R., Dorfman, A.H., and Royall, R.M. (2000). Finite Population Sampling and Inference. Wiley, New York. Yung, W. and Rao, J.N.K. (1996). Jackknife linearization variance estimators under multistage sampling. Survey Methodology, 22, 23-32. |
|