Browse by

2. One-step calibration weighting

Phillip S. Kott and Dan Liao

2.1 Calibration weighting and unit nonresponse

In the absence of nonresponse (or frame errors), calibration weighting is a sampling-weight-adjustment method that creates a set of weights ${w_{k}; k \in S},$ asymptotically close to the original design weights, $d_{k} = 1 / π_{k},$ that satisfy a set of calibration equations (one for each component of $z_{k}) :$

$\sum_{S} w_{k} z_{k} = \sum_{U} z_{k},$

where $S$ denotes the sample, $π_{k}$ the sample-selection probability of unit $k, U$ the population of size $N, z_{k}$ a vector with $P$ components each having a known population total, and $\sum_{A}$ means $\sum_{k \in A} .$

Kott (2009) describes a conservative set of mild conditions under which $t_{y} = \sum_{S} w_{k} y_{k}$ is a nearly unbiased estimator for the population total $T_{y} = \sum_{U} y_{k}$ (i.e., the relative bias of $t_{y}$ is asymptotically zero). Most importantly, each $π_{k} N / n$ is assumed to be bounded from below by a positive value as $N$ and the (expected) sample size, $n,$ grow arbitrarily large (we add the parenthetical “expected” in case the sample size is random).

In addition, the first four central population moments of each component of $z_{k}$ is assumed to be bounded from above, while $N^{- 1} \sum_{U} z_{k} z_{k}^{T}$ converges to a positive definite matrix.

Using calibration-weighting will tend to reduce mean squared error relative to the expansion estimator, $t_{y}^{E} = \sum_{S} d_{k} y_{k},$ when $y_{k}$ is correlated with some components of $z_{k} .$ One should keep in mind, however, that most surveys have many $y_{k} ’ s .$

A simple way to compute calibration weights is linearly with the following formula:

$\begin{array}{l} w_{k} & = & d_{k} [1 + {(\sum_{U} z_{j} - \sum_{S} d_{j} z_{j})}^{T} {(\sum_{S} d_{j} z_{j} z_{j}^{T})}^{- 1} z_{k}] \\ = & d_{k} [1 + g^{T} z_{k}] . \end{array}$

Fuller et al. (1994) and later Lundström and Särndal (1999) argued that this linear calibration can also be used to handle unit nonresponse. The sample $S$ is replaced by the respondent sample $R,$ while

$g = [(1 - θ) {(\sum_{U} z_{j} - \sum_{R} d_{j} z_{j})}^{T} + θ {(\sum_{S} d_{j} z_{j} - \sum_{R} d_{j} z_{j})}^{T}] {(\sum_{R} d_{j} z_{j} z_{j}^{T})}^{- 1},$

depending on whether the respondent sample is calibrated to the population $(θ = 0)$ or calibrated to the original sample $(θ = 1) .$ Either way, the estimate is nearly unbiased under the quasi-sample-design that treats response as a second phase of random sampling so long as each unit’s probability of response has the form:

$p_{k} = 1 / (1 + γ^{T} z_{k}), (2.1)$

and $g$ is a consistent estimator for the unknown parameter vector $γ$ in equation (2.1).

The problem with the response function in equation (2.1) is that the implicit estimator for $p_{k}, {\hat{p}}_{k} = 1 / (1 + g^{T} z_{k})$ can be negative. A nonlinear form of calibration weighting avoiding this possibility was suggested by Kott and Liao (2012) based on the generalized exponential form of Folsom and Singh (2000). It uses Newton’s method (iterative Taylor-series approximations) to find a $g$ such that the calibration equation (from here on, we refer to the vector of component calibration equations as the calibration equation):

$\sum_{R} w_{k} z_{k} = \sum_{R} d_{k} α (g^{T} z_{k}) z_{k} = (1 - θ) \sum_{U} z_{k} + θ \sum_{S} d_{k} z_{k} (2.2)$

holds, where $θ = 0$ or $1,$

$α (g^{T} z_{k}) = \frac{ℓ + \exp (g^{T} z_{k})}{1 + \exp (g^{T} z_{k}) / u}, (2.3)$

$ℓ,$ the lower bound of $α (\cdot),$ is nonnegative (so that calibration weights are likewise nonnegative), and the upper bound of $α (\cdot), u > ℓ,$ can be either finite or infinite.

Although there are other reasonable forms the weight-adjustment function $α (g^{T} z_{k})$ can take, we will restrict our attention to functions in the form in equation (2.3). This is a generalization of both raking where $ℓ = 0, u = \infty,$ and the implicit estimation of a logistic response model, where $ℓ = 1, u = \infty .$ In Deming and Stephan’s original (1940) iterative-proportional-fitting algorithm for raking, the components of $z_{k}$ were restricted to indicator functions. We use “raking” more broadly here to mean calibration weighting with a weight-adjustment function of the form $α (g^{T} z_{k}) = \exp (g^{T} z_{k}) .$

When $ℓ < 1,$ equation (2.3) becomes the generalized-raking adjustment introduced in Deville and Särndal (1992) and discussed further in Deville, Särndal and Sautory (1993). Generalized raking not only lets the components of $z_{k}$ be continuous but also allows the range of the $α (g^{T} z_{k})$ to be constrained between a positive $ℓ$ and a (possibly) finite $u .$

Deville and Särndal (1992) required $α (0) = α^{'} (0) = 1.$ Since the authors were not treating samples with nonresponse (or incorrect frames), $g^{T} z_{k}$ needed to converge to 0 and $α (g^{T} z_{k})$ to 1 as the (expected) sample size grew arbitrarily large. When adjusting design weights for nonresponse, however, setting $ℓ \geq 1$ is a more sensible strategy, so that the implicit estimated probability of response does not exceed 1.

Although the original definition of calibration weighting in Deville and Särndal (1992) involved minimizing the differences between the $w_{k}$ and $d_{k}$ in $R$ as measured by some loss function, later formulations (e.g., Estevao and Särndal 2000) removed the loss function from the definition. Forcing $w_{k}$ and $d_{k}$ to be close makes little sense when calibration weighting is used to adjust for unit nonresponse since if a sampled $k$ has a relatively small probability of response, then the difference between $w_{k}$ and $d_{k}$ should be relatively large.

Rather than assuming a response model with a particular functional form, an alternative justification for using calibration weighting as a mean of removing unit-nonresponse bias assumes a prediction model in which the survey variable $y_{k}$ is itself a random variable such that $E (y_{k} | z_{k}) = z_{k}^{T} β$ for some unknown $β$ whether or not $k$ is sampled or whether it responds when sampled. Kott (2006) and others have observed the calibration-weighted estimator for $T_{y} = \sum_{U} y_{k}$ will be nearly unbiased under the prediction model when calibration is done to the population (when $θ = 0$ in equation (2.2)) and under the combination of the prediction model and the original sample-selection mechanism when calibration is done to the original sample (when $θ = 1) .$

The property that a calibration-weighted estimator is nearly unbiased in some sense when either an assumed response model or an assumed prediction model holds has been called “double protection against nonresponse bias” by Kim and Park (2006). It is known as “double robustness” in the biostatics literature (Bang and Robins 2005) and attributed to Robins, Rotnitzky and Zhao (1994), which dealt with item rather than unit nonresponse.

The distribution of $y_{k} | z_{k}$ under the prediction model is often assumed to be the same for sampled and nonsampled population members. That is to say, the sampling mechanism is assumed to be ignorable. In addition, the distribution of $y_{k} | z_{k}$ is often assumed to be the same whether or not a population member responds when sampled, that is, that the response mechanism is also assumed to be ignorable (Little and Rubin 2002). Here, we make weaker analogous assumptions under the prediction model, namely, that $E (y_{k} | z_{k})$ does not depend on whether $k$ is sampled or when sampled responds. Let us say that the sampling and response mechanisms are assumed to be “first-moment ignorable”.

2.2 Instrumental variables

Deville (2000) observed that instrumental-variable calibration can be used to adjust for potential nonresponse bias by assuming a response model that depended on $x_{k},$

$p_{k} = {[α (γ^{T} x_{k})]}^{- 1} = \frac{1 + \exp (γ^{T} x_{k}) / u}{ℓ + \exp (γ^{T} x_{k})}, (2.4)$

but fitting calibration equations with $z_{k} :$

$\sum_{R} w_{k} z_{k} = \sum_{R} d_{k} α (g^{T} x_{k}) z_{k} = (1 - θ) \sum_{U} z_{k} + θ \sum_{S} d_{k} z_{k}, (2.5)$

where the $g$ satisfying equation (2.5) with $θ = 0$ or $1$ a consistent estimator of unknown parameter vector $γ$ in equation (2.4). Some mild conditions are needed for this. Sufficient are the following: $N^{- 1} \sum_{R} d_{k} α (γ^{T} x_{k}) z_{k}$ is a consistent and bounded estimator for $N^{- 1} [(1 - θ) \sum_{U} z_{k} + θ \sum_{S} d_{k} z_{k}],$ $α (ϕ)$ is everywhere twice differentiable, and $N^{- 1} \sum_{R} d_{k} α^{'} (ϕ) z_{k} x_{k}^{T}$ is always invertible and bounded as the sample grows arbitrarily large.

Let $R_{k} = 1$ when $k \in R, 0$ otherwise. It is not hard to show that

$\begin{array}{l} g - γ & = & - {(\sum_{S} d_{k} R_{k} α^{'} (c_{k}) z_{k} x_{k}^{T})}^{- 1} {\sum_{S} d_{k} R_{k} α (γ^{T} x_{k}) z_{k} - [(1 - θ) \sum_{U} z_{k} + θ \sum_{S} d_{k} z_{k}]} \\ - {(N^{- 1} \sum_{S} d_{k} R_{k} α^{'} (c_{k}) z_{k} x_{k}^{T})}^{- 1} {N^{- 1} \sum_{S} d_{k} R_{k} α (γ^{T} x_{k}) z_{k} - N^{- 1} [(1 - θ) \sum_{U} z_{k} + θ \sum_{S} d_{k} z_{k}]} \end{array}$

for some $c_{k}$ between $g^{T} x_{k}$ and $γ^{T} x_{k},$ as Kott and Liao (2012) demonstrated when $x_{k} = z_{k} .$

Deville also noted that it is possible for components of the $x_{k}$ to be survey variables with values known only for respondents. Chang and Kott (2008) extended the notion of calibration weighting to allow the dimension of the $z_{k} -$ vector to be greater than that of the $x_{k} -$ vector. We will not treat either possibility in the following sections.

Kim and Shao (2013) in treating nonignorable nonresponse call the components of $z_{k}$ not wholly functions of the components of $x_{k}$ “instrumental variables”. To limit future confusion, we will henceforth use to term “model variables” to refer to the components of $x_{k} .$

Previous | Next

Date modified:: 2015-11-27

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

2. One-step calibration weighting

2.1 Calibration weighting and unit nonresponse

2.2 Instrumental variables