Browse by

4. Estimation of the parameters of interest

Andrés Gutiérrez, Leonardo Trujillo and Pedro Luis do Nascimento Silva

Let $N_{i j}$ be the total number of respondents for the population of interest having a classification $i$ at time $t - 1$ and $j$ at time $t .$ Let $R_{i}$ be the total number of individuals in the population not responding at time $t$ but responding at time $t - 1$ with classification $i .$ Let $C_{j}$ denote the total number of individuals in the population not responding at time $t - 1$ but responding at time $t$ with classification $j$ and finally let $M$ be the total number of individuals at the population not responding at any of the two periods of observation. It follows that the total size of the population, $N$ , must satisfy:

$N = \sum_{i} \sum_{j} N_{i j} + \sum_{j} C_{j} + \sum_{i} R_{i} + M .$

Defining the following characteristics of interest, it is possible to define the parameters of interest:

$\begin{matrix} y_{1 i k} & = {\begin{array}{l} 1, & if the k - th individual responds at t - 1 with classification i; \\ 0, & otherwise . \end{array} \\ y_{2 j k} & = {\begin{array}{l} 1, & if the k - th individual responds at t with classification j; \\ 0, & otherwise . \end{array} \end{matrix}$

Then, the product of these quantities, defined as $y_{1 i k} y_{2 j k}$ , corresponds to a new characteristic of interest taking the value one if the individual has responded at both times and is classified in the cell $i j$ , or zero otherwise. Also,

$N_{i j} = \sum_{k \in U} y_{1 i k} y_{2 j k} .$

Define the following dichotomic characteristics:

$\begin{array}{l} z_{1 k} = {\begin{array}{l} 1, & if the k - th individual responds at t - 1; \\ 0, & otherwise . \end{array} \\ z_{2 k} = {\begin{array}{l} 1, & if the k - th individual responds at t; \\ 0, & otherwise . \end{array} \end{array}$

It follows that

$\begin{array}{l} R_{i} & = & \sum_{k \in U} y_{1 i k} (1 - z_{2 k}) \\ C_{j} & = & \sum_{k \in U} y_{2 j k} (1 - z_{1 k}) \\ M & = & \sum_{k \in U} (1 - z_{1 k}) (1 - z_{2 k}) . \end{array}$

Let $w_{k}$ denote the weight for the $k$ -th individual corresponding to a specific sampling strategy (sampling design and estimator) in both waves. Then the following expressions represent the estimators of the parameters of interest:

$\begin{matrix} {\hat{N}}_{i j} & = \sum_{k \in S} w_{k} y_{1 i k} y_{2 j k} \\ {\hat{R}}_{i} & = \sum_{k \in S} w_{k} y_{1 i k} (1 - z_{2 k}) \\ {\hat{C}}_{j} & = \sum_{k \in S} w_{k} y_{2 j k} (1 - z_{1 k}) \\ \hat{M} & = \sum_{k \in S} w_{k} (1 - z_{1 k}) (1 - z_{2 k}) \end{matrix}$

for $N_{i j}$ , $R_{i}$ , $C_{j}$ and $M,$ respectively. Note that an unbiased estimation for the population size is given by

$\hat{N} = \sum_{i} \sum_{j} {\hat{N}}_{i j} + \sum_{j} {\hat{C}}_{j} + \sum_{i} {\hat{R}}_{i} + \hat{M} = \sum_{s} w_{k} v_{k}$

where

$v_{k} = \sum_{i} y_{1 i k} \sum_{j} y_{2 j k} + \sum_{j} y_{2 j k} (1 - z_{1 k}) + \sum_{i} y_{1 i k} (1 - z_{2 k}) + (1 - z_{1 k}) (1 - z_{2 k}) .$

Taking into account the functional form of all the parameters of interest, and noticing that the likelihood function of the model is proportional to (3.1), we arrive at the following result.

Result 4.1 The log-likelihood for the observed data at the population can be rewritten as

$l_{U} = \sum_{k \in U} f_{k} (ψ, ρ_{R R}, ρ_{M M}, η, p, y_{1}, y_{2}, z_{1}, z_{2}) (4.1)$

where

$\begin{array}{l} f_{k} (ψ, ρ_{R R}, ρ_{M M}, η, p, y_{1}, y_{2}, z_{1}, z_{2}) \\ = \sum_{i} \sum_{j} y_{1 i k} y_{2 j k} \ln (ψ ρ_{R R} η_{i} p_{i j}) \\ + \sum_{i} y_{1 i k} (1 - z_{2 k}) \ln (\sum_{j} ψ (1 - ρ_{R R}) η_{i} p_{i j}) \\ + \sum_{j} y_{2 j k} (1 - z_{1 k}) \ln (\sum_{i} (1 - ψ) (1 - ρ_{M M}) η_{i} p_{i j}) \\ + (1 - z_{1 k}) (1 - z_{2 k}) \ln (\sum_{i} \sum_{j} (1 - ψ) ρ_{M M} η_{i} p_{i j}) \end{array}$

where $y_{1}$ is a vector containing the characteristics $y_{1 i k}$ , $y_{2}$ is a vector containing the characteristics $y_{2 j k},$ $z_{1}$ is a vector containing the characteristics $z_{1 k}$ , and $z_{2}$ is a vector containing the characteristics $z_{2 k}$ (for every $k = 1, \dots, N$ and $i, j = 1, \dots, G$ ).

Now, in order to obtain estimators of the parameters, it is necessary to maximize this last function. Using standard techniques of maximum likelihood, the corresponding likelihood equations are given by

$\sum_{k \in U} u_{k} (θ) = 0$

where the vector $u_{k}$ , commonly known as scores, is defined by

$u_{k} (θ) = \frac{\partial f_{k} (θ)}{\partial θ} .$

Also, as it is not usual to survey the whole population, a probability sample is selected and the expression $\sum_{k \in U} u_{k} (θ)$ is considered as a population parameter. In this way, considering $w_{k} = 1 / π_{k}$ as the corresponding sampling weights, an unbiased estimator for this sum of scores is defined as $\sum_{k \in S} w_{k} u_{k} (θ) .$ The next expression is known as the pseudo-likelihood equation and it is an effective way to find estimators for the model parameters taking into account the sampling weights:

$\sum_{k \in S} w_{k} u_{k} (θ) = 0 .$

It is assumed that for the model in this paper, the initial probability of an individual responding at time $t - 1$ is the same for all the possible classifications in the survey. Also, the transition probabilities between respondents and nonrespondents do not depend on the classification of the individual in the survey, $ρ_{M M}$ and $ρ_{R R}$ . Considering these assumptions, the following results will let the estimation of the Markov model probabilities take into account the sampling weights.

Result 4.2 Under the assumptions of the model, the resulting maximum pseudo-likelihood estimators for $ψ,$ $ρ_{R R}$ and $ρ_{M M}$ are given by

$\begin{array}{l} {\hat{ψ}}_{m p v} & = \frac{\sum_{i} \sum_{j} {\hat{N}}_{i j} + \sum_{i} {\hat{R}}_{i}}{\sum_{i} \sum_{j} {\hat{N}}_{i j} + \sum_{i} {\hat{R}}_{i} + \sum_{j} {\hat{C}}_{j} + \hat{M}} \\ {\hat{ρ}}_{R R, m p v} & = \frac{\sum_{i} \sum_{j} {\hat{N}}_{i j}}{\sum_{i} \sum_{j} {\hat{N}}_{i j} + \sum_{i} {\hat{R}}_{i}} \\ {\hat{ρ}}_{M M, m p v} & = \frac{\hat{M}}{\sum_{j} {\hat{C}}_{j} + \hat{M}} \end{array}$

respectively.

Result 4.3 Under the assumptions of the model, the resulting maximum pseudo-likelihood estimators for $η_{i}$ and $p_{i j}$ are obtained through iteration until convergence of the next expressions

$\begin{array}{l} {\hat{η}}_{i, m p v}^{(v + 1)} & = & \frac{\sum_{j} {\hat{N}}_{i j} + {\hat{R}}_{i} + \sum_{j} ({\hat{C}}_{j} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)} / \sum_{i} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)})}{\sum_{i} \sum_{j} {\hat{N}}_{i j} + \sum_{i} {\hat{R}}_{i} + \sum_{j} {\hat{C}}_{j}} \\ {\hat{p}}_{i j, m p v}^{(v + 1)} & = & \frac{{\hat{N}}_{i j} + ({\hat{C}}_{j} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)} / \sum_{i} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)})}{\sum_{j} {\hat{N}}_{i j} + \sum_{j} ({\hat{C}}_{j} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)} / \sum_{i} {\hat{η}}_{i}^{(v)} {\hat{p}}_{i j}^{(v)})} \end{array}$

respectively. The superindex $(v)$ denotes the value of the estimation for the parameters of interest at the $v - t h$ iteration.

The results before provide an exhaustive frame for the implementation of the two-stage Markovian model in order to take into account the sampling weights in longitudinal surveys. Another question of interest is how to choose the initial values ${{\hat{η}}_{i}^{(0)}}$ and ${{\hat{p}}_{i j}^{(0)}}$ . In general, any set of values is valid if they follow the initial restrictions. These are

$\begin{matrix} \sum_{i} {\hat{η}}_{i}^{(0)} & = 1 \\ \sum_{j} {\hat{p}}_{i j}^{(0)} & = 1. \end{matrix}$

However, following the guidelines at Chen and Fienberg (1974) and considering the hypothetical case where all of the individuals responded in both periods, then $M = 0, R_{i} = 0$ (for every $i = 1, \dots, G$ ) and $C_{j} = 0$ (for every $j = 1, \dots, G$ ) and their sampling estimations are also null. Given this, and considering the expressions of the resulting estimators, a sensible choice is given by

$\begin{matrix} {\hat{η}}_{i}^{(0)} & = \frac{\sum_{j} {\hat{N}}_{i j}}{\sum_{i} \sum_{j} {\hat{N}}_{i j}} \\ {\hat{p}}_{i j}^{(0)} & = \frac{{\hat{N}}_{i j}}{\sum_{j} {\hat{N}}_{i j}} . \end{matrix}$

Lastly, this iterative approach is commonly implemented for estimation problems by maximum likelihood in contingency tables. However, some approaches for the fit of log-linear models in contingency tables for complex survey designs can be found at Clogg and Eliason (1987), Rao and Thomas (1988), Skinner and Vallet (2010), among others. The next result provides an approach to gross flow estimation considering the sampling weights at both periods of interest.

Result 4.4 Under the assumptions of the model, a sampling estimator of $μ_{i j}$ is

${\hat{μ}}_{i j, m p v} = \hat{N} {\hat{η}}_{i, m p v} {\hat{p}}_{i j, m p v} .$

Previous | Next

Date modified:: 2017-09-20

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

4. Estimation of the parameters of interest