Optimum allocation for a dual-frame telephone survey 6. Summary

We investigated two designs for a dual-frame telephone survey: a take-all protocol in which every respondent in the cell-phone sample is interviewed and a screening protocol in which respondents in the cell-phone sample are screened for phone status and only CPO respondents are interviewed. For each design, we derived the optimum allocation of the overall survey resources to the two sampling frames.

We studied the allocation problem given the two traditional meanings of the word “optimum”: (1) to minimize variance subject to a constraint on data collection cost, and (2) to minimize data collection cost subject to a constraint on variance. Given fixed variance, we find that the screening approach tends to achieve lower total cost than the take-all approach when the per-unit cost of screening is low relative the unit cost of the survey interview. The take-all approach can achieve the lower total cost when the per-unit cost of screening is relatively high. Similarly, given fixed total cost, the screening protocol tends to be the more efficient approach when the per-unit cost of screening is relatively low, and the take-all protocol can be the more efficient approach as the per-unit cost of screening rises. Both the landline and cell-phone samples have the capacity to produce estimators for the dual-user domain, while only the cell-phone sample can produce estimators for the CPO domain. Thus, when screening is relatively inexpensive on a per-unit basis, then it should be used to produce the largest possible sample from the CPO domain. But when screening is relatively expensive, then it is better to avoid the screening step and invest the survey resources in a larger interview sample. These results were obtained under an assumption of simple random sampling, and they may not carry over exactly to other sampling designs.

The take-all design results in two estimators for the dual-user domain, which are combined using factors of p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbaaaa@36FC@ and 1 p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaaIXaGaeyOeI0IaamiCaaaa@39C9@ for the estimators from the landline and cell-phone samples, respectively. We studied the optimum choice of p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbaaaa@3821@ and gave expressions for reasonable compromise values of p . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbGaaiOlaaaa@38D3@ When variance (or cost) is considered as a function of p , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbGaaiilaaaa@38D1@ we found that it is fairly flat in a neighborhood of the optimum. The optimum allocation itself is a function of p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbaaaa@3821@ and we found that the allocation is relatively insensitive to choices of p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbaaaa@3821@ within a broad neighborhood of the optimum p . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGWbGaaiOlaaaa@38D3@

We initiated this work before 2010 at a time when the CPO population in the U.S. was only a fifth to a quarter of the total population of households. At that time it made sense to contemplate a protocol in which the larger landline sample is interviewed in its entirety and the smaller cell-phone sample is screened for CPO status. At this writing, however, the CPO population comprises more than a third of the total population of households and it is still growing. It has become reasonable to consider a new screening protocol in which the landline sample is screened for telephone status and only LLO respondents are interviewed. The foregoing allocations and findings apply to this new protocol by symmetry.

We illustrated the optimum allocations and the two interviewing protocols using the 2011 National Immunization Survey. The survey is designed to minimize cost under a fixed variance constraint. The NIS results are limited to the population of children age 19 35 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9Lq=Je9 vqaqFeFr0xbbG8FaYPYRWFb9fi0FXxbbf9=e0dfrpm0dXdHqVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaGaaGymaiaaiM dacaaMc8UaeyOeI0IaaGPaVlaaiodacaaI1aaaaa@3E09@  months. Similar results may or may not obtain for a general population survey or for a survey with a different structure of per-unit costs.

Acknowledgements

The authors’ kindly acknowledge suggestions for improved readability offered by the Associate Editor and referees. Disclaimer: The findings and conclusions in this paper are those of the author(s), and do not necessarily represent the official position of the Centers for Disease Control and Prevention.

References

Biemer, P.P. (1984). Methodology for optimal dual frame sample design. Bureau of the Census SRD Research Report CENSUS/SRD/RR-84/07 available at www.census.gov/srd/papers/pdf/rr84-07.pdf.

Blumberg, S.J., and Luke, J.V. (2010). Wireless substitution: Early release of estimates from the National Health Interview Survey, January MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9IqqrpepC0xbbL8F4rqqr=hbbG8pue9Fbe9q8 qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9Fve9 Fve8meqabeqadiqaceGabeWabeWabeqaeeaakeaaieaajugybabaaa aaaaaapeGaa83eGaaa@3853@ June 2010. National Center for Health Statistics. December 2010. Available at http://www.cdc.gov/nchs/nhis/releases.htm.

Blumberg, S.J., and Luke, J.V. (2013). Wireless substitution: Early release of estimates from the National Health Interview Survey, July-December 2012. National Center for Health Statistics. June 2013. Available at http://www.cdc.gov/nchs/nhis/releases.htm.

CDC (2012). National Immunization Survey: A User’s Guide for the 2011 Public Use Data File. Available at http://www.cdc.gov/nchs/nis/data_files.htm.

Cochran, W.G. (1977). Sampling Techniques, 3rd Edition, New York: John Wiley & Sons, Inc.

Fuller, W.A., and Burmeister, L.F. (1972). Estimators for samples from two overlapping frames. Proceedings of the Social Statistics Section, American Statistical Association, 245-249.

Hartley, H.O. (1962). Multiple-frame surveys. Proceedings of the Social Statistics Section, American Statistical Association, 203-206.

Hartley, H.O. (1974). Multiple frame methodology and selected applications. Sankhyā, Series C, 36, 99-118.

Lepkowski, J.M., and Groves, R.M. (1986). A mean squared error model for multiple frame, mixed mode survey design. Journal of the American Statistical Association, 81, 930-937.

Lohr, S.L., and Rao, J.N.K. (2000). Inference from dual frame surveys. Journal of the American Statistical Association, 95, 271-280.

Lohr, S.L., and Rao, J.N.K. (2006). Estimation in multiple-frame surveys. Journal of the American Statistical Association, 101, 1019-1030.

Skinner, C.J., and Rao, J.N.K. (1996). Estimation in dual frame surveys with complex designs. Journal of the American Statistical Association, 91, 349-356.

Smith, P.J., Hoaglin, D.C., Battaglia, M.P., Khare, M. and Barker, L.E. (2005). Statistical Methodology of the National Immunization Survey: 1994-2002. National Center for Health Statistics, Vital and Health Statistics, 2(138).

Wolter, K.M., Smith, P. and Blumberg, S.J. (2010). Statistical foundations of cell-phone surveys. Survey Methodology, 36, 2, 203-215.

Date modified: