3 A dynamic adaptive survey design: Re-assigning interviewers in a follow-up survey
Barry Schouten, Melania Calinescu and Annemieke Luiten
Previous | Next
In this section, we provide an example of a dynamic
adaptive design: the re-assignment of interviewers based on observations of the
propensity to cooperate. The example is based on hypothetical response
propensities and cost functions. Interviewers are assigned to sample cases that
have refused once, based on an assessment of the propensity to respond made
during a first phase of the survey. The assessment is made for respondents and
refusers, but it is not available for sample units who were not contacted
during the first phase. It provides a judgement on the propensity that the
sample unit participates in the survey when contacted again. The assessment is
made on a three point scale: easy,
medium, difficult. Easy means that there is a high probability that if
contacted again the sample unit would respond.
After a first phase of data collection, the intermediate
survey results are evaluated and sample units are divided into respondents,
refusers and noncontacts. Refusers receive a different treatment. Interviewers
are rated based on their historic performance and grouped in good and less good interviewers. Refusers
are re-assigned to one of the two groups of interviewers. Since there is no
assessment available for non-contacts, the treatment for this group is not
altered.
We use the R-indicator given by (2.7) as the quality
objective function. We split the sample using into two groups, labelled as young and old. The goal in the second phase is to assign refusers to the two
interviewer groups such that the R-indicator with respect to age is maximized.
Let be the sample size of the survey. The
population proportions of the two subpopulations, young and old, are
denoted by and We let be the conditional probability that a sample
unit from age subpopulation is of type where Furthermore, let be the probability that a sample unit of type from age subpopulation is a refusal. If a person is not a refuser,
then is the probability that the person either was
a respondent after the first phase or becomes a respondent when he/she was a
noncontact after the first phase.
The total number of interviewers is and represents the number of interviewers with skill
and . The set forms the set of strategies, i.e., we want to assign each refuser to
either a good or a less good interviewer. We assume that each interviewer can
handle at most refusal cases in the second phase of the
survey. The probability that a refusal of type from subpopulation will respond if contacted by an interviewer of
skill is denoted by and it is again assumed to be known from
previous surveys.
Let be the set of decision variables, where represents the probability that a sample unit
of type will be assigned to an interviewer of skill given that he/she belongs to subpopulation In other words, we allow for a random
assignment of sample units to the two interviewer groups.
In this example, we express costs in terms of the
overall interviewer occupation rates. Since interviewers can handle at most cases, there are two constraints
In other words, the total number of refusers that can be
assigned to interviewers of skill is restrained to the maximum possible workload
for that skill group.
The response propensity for a unit from subpopulation can now be derived as
and form the input to the R-indicator.
Now, consider the following input data for the example:
a sample size of 2,000,
a total of 80 interviewers, 80,
a maximal workload of 30 cases per interviewer, 30,
an age distribution equal to 0.5,
conditional distributions of refusal type and and 25% of the interviewers are classified as
good,
Tables 3.1 and 3.2 give the hypothetical response
probabilities for the two subgroups when refusal conversion
is applied, as well as the cooperation probabilities and refusal probabilities
We optimize the R-indicator with respect to the two age
groups. For two strata, it can be shown that the R-indicator is maximal when
the absolute distance between the two strata response propensities is minimal.
The optimal value of the R-indicator turns out to be 0.827. Table 3.3 shows the
optimal values of the decision variables; all but one of the decision variables
are either 0 or 1, i.e., the re-assignments are mostly non-probabilistic. The
exception is the subpopulation of young persons with medium response propensity
assessment.
Table 3.1
Response probabilities when refusal conversion is applied to young and old refusers given the assessment of propensity to respond.
Table summary
This table displays the results of response probabilities when refusal conversion is applied to young and old refusers given the assessment of propensity to respond. good interviewer and less good interviewer, calculated using easy, medium and difficult units of measure (appearing as column headers).
|
Good interviewer |
Less good interviewer |
Easy |
Medium |
Difficult |
Easy |
Medium |
Difficult |
Young refuser |
|
|
0.8 |
0.6 |
0.4 |
0.7 |
0.5 |
0.3 |
Old refuser |
|
|
0.9 |
0.7 |
0.5 |
0.8 |
0.6 |
0.4 |
Table 3.2
Refusal and cooperation probabilities in the first phase of data collection
Table summary
This table displays the results of refusal and cooperation probabilities in the first phase of data collection young and old, calculated using easy, medium and difficult units of measure (appearing as column headers).
|
Young |
Old |
Easy |
Medium |
Difficult |
Easy |
Medium |
Difficult |
|
0.5 |
0.6 |
0.7 |
0.2 |
0.3 |
0.4 |
|
0.85 |
0.8 |
0.76 |
0.95 |
0.93 |
0.91 |
Table 3.3
Optimal assignment of cases to interviewers
Table summary
This table displays the results of optimal assignment of cases to interviewers young and old, calculated using easy, medium and difficult units of measure (appearing as column headers).
|
Young |
Old |
Easy |
Medium |
Difficult |
Easy |
Medium |
Difficult |
Good |
1 |
0.83 |
1 |
0 |
0 |
0 |
Less good |
0 |
0.17 |
0 |
1 |
1 |
1 |
It is useful to compare
the optimal allocation to a random allocation of interviewers in order to see
how much is gained. If we would randomly assign the refusals to the
interviewers, then the value of the R-indicator equals 0.749. The optimal
assignment, thus, leads to a considerable increase in the R-indicator. The
response rates are, respectively, 72.0% and 70.1% for the optimal and the
random assignment.
If we increase the number of interviewers, while fixing
the maximal number of cases per interviewer as well as the other parameters,
then for any interviewer number higher than the R-indicator does not improve. Both
interviewer groups are sufficiently big to handle the entire sample and the
cost constraint is no real constraint anymore. The R-indicator for is equal to 0.830 and the response rate is
72.1%. If we would maximize the response rate rather than the R-indicator, then
the allocation of interviewers will converge towards assigning only good interviewers to all cases.
Previous | Next