4. Simulations
Yan Lu
Previous | Next
In this section, a
small simulation has been conducted to study the proposed chi-squared tests
under a simple hypothesis to investigate the performance of
chi-squared tests proposed in Section 3. We compare the percentages of samples
for which the test statistics exceed the critical value to the nominal level R (www.r-project.org) is used to
perform simulation study and other analysis.
We generated the
data following Skinner and Rao (1996), with and A cluster sample from frame was generated with psus and observations in each psu, and a
simple random sample of observations was generated for
frame We generated the clustered binary
responses for the sample from frame by generating correlated
multivariate normal random vectors and then using the probit function to
convert the continuous responses to binary responses. After the sample was
generated, we calculated the PML estimators of and (see Section 2.2). These
estimated proportions were used to compute the chi-squared test statistics. We
then compared the percentages of samples for which the test statistics exceed
the critical value to the nominal level under different settings.
The simulation
study was performed with factors: (1) (2) (3) clustering parameter (4) sample sizes: 10, 30 or 50; 3, 5, or 10, 100, 300 or 500. (5) Simulation
runs: 1,000 times for each setting and 100 times when estimating the variance
covariance matrix using bootstrapping. All runs
used probability parameters and Table 4.1 reported the
percentages of samples for which the test statistics exceed the critical value.
Table 4.1
Comparison of the actual significance levels (%) among different tests.
is the uncorrected test;
is the first order corrected
and
is the second order corrected
Table summary
This table displays the results of Comparison of the actual significance levels (%) among different tests.
is the uncorrected test;
is the first order corrected
and
is the second order corrected
. The information is grouped by
(appearing as row headers),
and Wald (appearing as column headers).
|
|
|
|
Wald |
|
|
10 |
3 |
100 |
12.1 |
17.3 |
5.6 |
4.9 |
30 |
3 |
300 |
13.6 |
8.4 |
4.8 |
4.8 |
50 |
3 |
500 |
15.5 |
10.0 |
6.4 |
3.6 |
10 |
5 |
100 |
25.7 |
13.5 |
7.5 |
4.9 |
30 |
5 |
300 |
29.2 |
9.3 |
7.9 |
5.3 |
50 |
5 |
500 |
31.5 |
8.5 |
8.1 |
4.9 |
10 |
10 |
100 |
46.1 |
21.2 |
6.6 |
5.4 |
30 |
10 |
300 |
50.2 |
11.5 |
7.5 |
5.6 |
50 |
10 |
500 |
58.7 |
8.0 |
9.6 |
5.1 |
Table 4.1 indicates that naively using uncorrected test for complex survey data is
dangerous. With increased psu size and number of psu’s, the actual significance
level even reaches 62.2%. Extended Wald test doesn’t perform well since the
estimate of the variance may be unstable. Extended first order corrected test
is acceptable with actual significance level around 7%. Extended second order
corrected tests almost reach the nominal level 5%, for which is the one we
recommend to use in a dual frame survey categorical data analysis.
Previous | Next