A few remarks on a small example by Jean-Claude Deville regarding non-ignorable non-response
Section 6. DiscussionA few remarks on a small example by Jean-Claude Deville regarding non-ignorable non-response
Section 6. Discussion
Deville’s example is especially welcome since,
for both models, the three estimation methods provide exactly the same
estimators. Obviously, if the model is more complicated, using the maximum
likelihood method becomes cumbersome, if not impossible. The calibration and
generalized calibration method works in all cases as long as the number of
calibration variables whose totals are known is sufficient and the matrix
is invertible. In this example,
the determinant of this matrix appears in the denominator of the estimators.
Therefore, a small determinant makes the estimates especially risky. Lesage and
Haziza (2015) recommend verifying that the correlations between variables
and
are
great enough to avoid potentially amplifying the bias.
If the variables are quantitative, the
solutions will depend on the calibration function used
The use
of the calibration function
is
recommended, since it has the advantage of providing weights greater than 1.
The inverse of the weights can now be interpreted as a response probability
estimated using a logistic model.
The main difficulty is obviously choosing
between the two proposed models. In Deville’s example, it may seem more
“logical” to see the non-response depend rather on drug use than on gender.
However, we are not well equipped to make a choice between the two models. The
values of the two likelihood functions for the estimated parameters are equal.
Is it possible to choose the model based on more than a strong conviction? As
suggested in Haziza and Lesage (2016), we recommend always calculating both
weightings and comparing the weights and estimates obtained with each of them.
One option may be to calculate an indicator of
the dispersion of the response probabilities, such as the variance. For
example, if the variance is great, it means that the model has made it possible
to calculate response probabilities with greater contrast between individuals
and that the model has therefore taken better account of the non-response.
Validation through a search for contrasting weights is the basis for
identifying response homogeneity groups (RHGs) for all segmentation methods,
for example with the chi-square automatic interaction detector (CHAID)
algorithm developed by Kass (1980). For example, with CHAID, in each step the
RHGs are split based on categories that result in response probabilities with
the greatest contrast. By using the same principle in choosing the model, we
can select the model that provides the weights with the greatest contrast. For
example, if the variance is small, it means that the non-response model could
not highlight the differences in non-response probabilities between
individuals. Incidentally, the variance in response probabilities is the square
of the R-indicator defined by Schouten, Cobben and Bethlehem (2009), used here
to choose a non-response model.
In both cases, the average response probability
equals 0.5. Specifically,
and
For the MAR
model, the variance is
For the NMAR
model, the variance is
The greater variance of the NMAR
model is an argument in its favour. In fact, the response probabilities show
much greater contrast.
Acknowledgements
The author thanks
Audrey-Anne Vallée for her meticulous proofreading of an earlier version of
this text and an anonymous referee for their especially pertinent comments.
References
Chang, T., and
Kott, P.S. (2008). Using calibration weighting to adjust for nonresponse under
a plausible model. Biometrika, 95, 555-571.
Deville, J.-C.
(2000). Generalized calibration and application to weighting for non-response. In Compstat - Proceedings in Computational Statistics: 14th Symposium held in Utrecht,Netherlands,
pages 65-76, New York: Springer.
Deville, J.-C. (2002). La correction de la
nonréponse par calage généralisé. In the Actes des Journées de Méthodologie
Statistique, Paris. Insee-Méthodes.
Kott, P.S., and
Chang, T. (2010). Using calibration weighting to adjust for nonignorable unit
nonresponse. Journal of the American Statistical Association, 105(491),
1265-1275.
Lesage, E., and
Haziza, D. (2015). On the problem of bias and variance amplification of the
instrumental calibration estimator in the presence of unit nonresponse. Under
revision for Journal of Survey Statistics and Methodology.
Schouten, B.,
Cobben, F. and Bethlehem, J. (2009). Indicators
for the representativeness of survey response. Survey Methodology,
35, 1, 101-113. Paper available at http://www.statcan.gc.ca/pub/12-001-x/2009001/article/10887-eng.pdf.
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.