2. Motivation

Andrés Gutiérrez, Leonardo Trujillo and Pedro Luis do Nascimento Silva

Previous | Next

2.1 Sampling designs and estimators

Consider a finite population as a set of N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtaaaa@36BA@ units, where N < MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtaiabgY da8iabg6HiLcaa@392F@ , forming the universe of study. N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtaaaa@36BA@ is known as the population size. Each element belonging to the population can be identified with an index k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaac6 caaaa@3789@ Let U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyvaaaa@36C1@ be the index set given by U = { 1,..., k ,..., N } . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyvaiabg2 da9maacmaabaGaaGymaiaaiYcacaaIUaGaaGOlaiaai6cacaaISaGa am4AaiaaiYcacaaIUaGaaGOlaiaai6cacaaISaGaamOtaaGaay5Eai aaw2haaiaai6caaaa@4456@ The selection of a sample s = { k 1 , k 2 , , k n ( s ) } MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Caiabg2 da9maacmaabaGaam4AamaaBaaaleaacaaIXaaabeaakiaaiYcacaWG RbWaaSbaaSqaaiaaikdaaeqaaOGaaGilaiablAciljaaiYcacaWGRb WaaSbaaSqaaiaad6gacaGGOaGaam4CaiaacMcaaeqaaaGccaGL7bGa ayzFaaaaaa@4586@ is done according to a sampling design defined as the multivariate probability distribution over a support Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyuaaaa@36BD@ in a way that p ( s ) > 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiCaiaacI cacaWGZbGaaiykaiabg6da+iaaicdaaaa@3AEF@ for every s Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4CaiabgI Giolaadgfaaaa@3939@ and

s Q p ( s ) = 1. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabuaeqale aacaWGZbGaeyicI4Saamyuaaqab0GaeyyeIuoakiaadchacaGGPaGa aiikaiaadohacaGGPaGaeyypa0JaaGymaiaac6caaaa@41CC@

Under a sampling design p ( ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiCaiaacI cacqGHflY1caGGPaaaaa@3A7F@ , an inclusion probability is assigned to every element in the population in order to denote the probability that the element belongs to the sample. For the k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36D7@ -th element in the population this probability is denoted as π k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgaaeqaaaaa@38C0@ and it is known as the first order inclusion probability given by

π k = P r ( k S ) = P r ( I k = 1 ) = s k p ( s ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgaaeqaaOGaeyypa0Jaamiuaiaadkhadaqadaqaaiaa dUgacqGHiiIZcaWGtbaacaGLOaGaayzkaaGaeyypa0Jaamiuaiaadk hadaqadaqaaiaadMeadaWgaaWcbaGaam4AaaqabaGccqGH9aqpcaaI XaaacaGLOaGaayzkaaGaeyypa0ZaaabuaeqaleaacaWGZbGaeyydIC Iaam4Aaaqab0GaeyyeIuoakiaadchacaGGOaGaam4CaiaacMcaaaa@51C4@

where I k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamysamaaBa aaleaacaWGRbaabeaaaaa@37D1@ is a random variable denoting the membership of the element k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36D7@ to the sample, and the subindex s k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Caiabg2 GiNiaadUgaaaa@38B1@ refers to the sum over all the possible samples containing the k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36D7@ -th element. Analogously, π k l MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgacaWGSbaabeaaaaa@39B1@ is known as the second order inclusion probability and it denotes the probability that the elements k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36D7@ and l MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiBaaaa@36D8@ belong to the sample and it is given by

π k l = P r ( k S ; l S ) = P r ( I k = 1 ; I l = 1 ) = s k,l p ( s ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgacaWGSbaabeaakiabg2da9iaadcfacaWGYbWaaeWa aeaacaWGRbGaeyicI4Saam4uaiaabUdacaWGSbGaeyicI4Saam4uaa GaayjkaiaawMcaaiabg2da9iaadcfacaWGYbWaaeWaaeaacaWGjbWa aSbaaSqaaiaadUgaaeqaaOGaeyypa0JaaGymaiaabUdacaWGjbWaaS baaSqaaiaadYgaaeqaaOGaeyypa0JaaGymaaGaayjkaiaawMcaaiab g2da9maaqafabeWcbaGaam4Caiabg2GiNiaabUgacaqGSaGaaeiBaa qab0GaeyyeIuoakiaadchacaGGOaGaam4CaiaacMcacaGGUaaaaa@5D82@

The aim of the sample survey is to study a characteristic of interest y MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@36E4@ associated with every unit in the population and to estimate a function of interest T , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamivaiaacY caaaa@3770@ called a parameter.

T = f ( y 1 , , y k , , y N ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamivaiabg2 da9iaadAgacaGGOaGaamyEamaaBaaaleaacaaIXaaabeaakiaaiYca cqWIMaYscaaISaGaamyEamaaBaaaleaacaWGRbaabeaakiaaiYcacq WIMaYscaaISaGaamyEamaaBaaaleaacaWGobaabeaakiaacMcacaGG Uaaaaa@45F1@

This inferential approach is known as design-based inference. Under this approach, the estimates of the parameters and their properties depend directly on the discrete probability measure related to the chosen sampling design and do not take into account the properties of the finite population. Also, the values y k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEamaaBa aaleaacaWGRbaabeaaaaa@3801@ are taken as the observation for the individual k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36D7@ for the characteristic of interest y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@36E5@ . Also, y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@36E5@ is considered as a fixed quantity rather than a random variable.

Then, the Horvitz-Thompson (HT) estimator can be defined as:

t ^ y , π = k s y k π k = k s d k y k MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmiDayaaja WaaSbaaSqaaiaadMhacaaISaGaeqiWdahabeaakiabg2da9maaqafa beWcbaGaam4AaiabgIGiolaadohaaeqaniabggHiLdGcdaWcaaqaai aadMhadaWgaaWcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWcbaGa am4AaaqabaaaaOGaeyypa0ZaaabuaeqaleaacaWGRbGaeyicI4Saam 4Caaqab0GaeyyeIuoakiaadsgadaWgaaWcbaGaam4AaaqabaGccaWG 5bWaaSbaaSqaaiaadUgaaeqaaaaa@5116@

where d k = 1 / π k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamizamaaBa aaleaacaWGRbaabeaakiabg2da9maalyaabaGaaGymaaqaaiabec8a WnaaBaaaleaacaWGRbaabeaaaaaaaa@3CA6@ is the reciprocal of the first-order inclusion probability and it is known as the expansion factor or basic design weight. The HT estimator is unbiased for the total population t y = U y k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiDamaaBa aaleaacaWG5baabeaakiabg2da9maaqababeWcbaGaamyvaaqab0Ga eyyeIuoakiaadMhadaWgaaWcbaGaam4Aaaqabaaaaa@3DFC@ , (assuming all the first order inclusion probabilities are greater than zero) and its variance is given by

V a r ( t ^ y , π ) = k U l U Δ k l y k π k y l π l . ( 2.1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvaiaadg gacaWGYbWaaeWaaeaaceWG0bGbaKaadaWgaaWcbaGaamyEaiaaiYca cqaHapaCaeqaaaGccaGLOaGaayzkaaGaeyypa0Zaaabuaeqaleaaca WGRbGaeyicI4Saamyvaaqab0GaeyyeIuoakmaaqafabeWcbaGaamiB aiabgIGiolaadwfaaeqaniabggHiLdGccqqHuoardaWgaaWcbaGaam 4AaiaadYgaaeqaaOWaaSaaaeaacaWG5bWaaSbaaSqaaiaadUgaaeqa aaGcbaGaeqiWda3aaSbaaSqaaiaadUgaaeqaaaaakmaalaaabaGaam yEamaaBaaaleaacaWGSbaabeaaaOqaaiabec8aWnaaBaaaleaacaWG SbaabeaaaaGccaaIUaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7ca aMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6cacaaIXaGaaiykaaaa @692D@

where Δ k l = C o v ( I k , I l ) = π k l π k π l MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeuiLdq0aaS baaSqaaiaadUgacaWGSbaabeaakiabg2da9iaadoeacaWGVbGaamOD amaabmaabaGaamysamaaBaaaleaacaWGRbaabeaakiaaiYcacaWGjb WaaSbaaSqaaiaadYgaaeqaaaGccaGLOaGaayzkaaGaeyypa0JaeqiW da3aaSbaaSqaaiaadUgacaWGSbaabeaakiabgkHiTiabec8aWnaaBa aaleaacaWGRbaabeaakiabec8aWnaaBaaaleaacaWGSbaabeaaaaa@4ECD@ . If all the second-order inclusion probabilities are greater than zero, an unbiased estimator of (2.1) is given by

V a r ^ ( t ^ y , π ) = k s l s Δ k l π k l y k π k y l π l . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaecaaeaaca WGwbGaamyyaiaadkhaaiaawkWaamaabmaabaGabmiDayaajaWaaSba aSqaaiaadMhacaaISaGaeqiWdahabeaaaOGaayjkaiaawMcaaiabg2 da9maaqafabeWcbaGaam4AaiabgIGiolaadohaaeqaniabggHiLdGc daaeqbqabSqaaiaadYgacqGHiiIZcaWGZbaabeqdcqGHris5aOWaaS aaaeaacqqHuoardaWgaaWcbaGaam4AaiaadYgaaeqaaaGcbaGaeqiW da3aaSbaaSqaaiaadUgacaWGSbaabeaaaaGcdaWcaaqaaiaadMhada WgaaWcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWcbaGaam4Aaaqa baaaaOWaaSaaaeaacaWG5bWaaSbaaSqaaiaadYgaaeqaaaGcbaGaeq iWda3aaSbaaSqaaiaadYgaaeqaaaaakiaac6caaaa@5E17@

Gambino and Silva (2009) suggest that in a household survey, the main interest is to focus on characteristics for particular household members that could be related to health variables, educational variables, income/expenses, employment status, etc. In general, the sampling designs used for this kind of survey are complex and use techniques such as stratification, clustering or unequal probabilities of selection. Some of the results from repeated surveys consider the estimation of level at a particular point of time, estimation of changes between two survey rounds and the estimation of the average level parameters over repeated rounds of a survey. Different rotation schemes and the frequency of the survey can affect considerably the precision of the estimators.

2.2 Pseudo-likelihood

Some authors such as Fuller (2009), Chambers and Skinner (2003, p. 179), and Pessoa and Silva (1998, chapter 5) consider the problem where the maximum likelihood estimation is appropriate for simple random samples, as is the case in Stasny (1987), but not for samples resulting from a complex survey design. Under this scheme, it is assumed that the density population function is f ( y , θ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOzamaabm aabaGaamyEaiaaiYcacqaH4oqCaiaawIcacaGLPaaaaaa@3BC5@ where the parameter of interest is θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@379D@ . If there is access to the information for the whole population, through a census, the maximum likelihood estimator of θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@379D@ can be obtained by maximizing

L ( θ ) = k U log f ( y k , θ ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamitamaabm aabaGaeqiUdehacaGLOaGaayzkaaGaeyypa0ZaaabuaeqaleaacaWG RbGaeyicI4Saamyvaaqab0GaeyyeIuoakiGacYgacaGGVbGaai4zai aadAgadaqadaqaaiaadMhadaWgaaWcbaGaam4AaaqabaGccaaISaGa eqiUdehacaGLOaGaayzkaaaaaa@4A4C@

with respect to θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@379D@ . We will denote θ N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaad6eaaeqaaaaa@389C@ as the value maximizing the last expression. The likelihood equations for the population are given by

k U u k ( θ ) = 0. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabuaeqale aacaWGRbGaeyicI4Saamyvaaqab0GaeyyeIuoakiaadwhadaWgaaWc baGaam4AaaqabaGcdaqadaqaaiabeI7aXbGaayjkaiaawMcaaiabg2 da9iaaicdacaGGUaaaaa@4333@

The u k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyDamaaBa aaleaacaWGRbaabeaaaaa@37FD@ are known as scores and they are defined as

u k ( θ ) = log f ( y k , θ ) θ . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyDamaaBa aaleaacaWGRbaabeaakmaabmaabaGaeqiUdehacaGLOaGaayzkaaGa eyypa0ZaaSaaaeaacqGHciITciGGSbGaai4BaiaacEgacaWGMbWaae WaaeaacaWG5bWaaSbaaSqaaiaadUgaaeqaaOGaaGilaiabeI7aXbGa ayjkaiaawMcaaaqaaiabgkGi2kabeI7aXbaacaGGUaaaaa@4B63@

The pseudo-likelihood approach considers that θ N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaad6eaaeqaaaaa@389C@ is the parameter of interest according to the information collected in a complex sample. If k U u k ( θ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabeaeqale aacaWGRbGaeyicI4Saamyvaaqab0GaeyyeIuoakiaadwhadaWgaaWc baGaam4AaaqabaGcdaqadaqaaiabeI7aXbGaayjkaiaawMcaaaaa@4082@ is considered as the parameter of interest, it is possible to estimate it using a weighted linear estimator

k s d k u k ( θ ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabuaeqale aacaWGRbGaeyicI4Saam4Caaqab0GaeyyeIuoakiaadsgadaWgaaWc baGaam4AaaqabaGccaWG1bWaaSbaaSqaaiaadUgaaeqaaOWaaeWaae aacqaH4oqCaiaawIcacaGLPaaaaaa@42EE@

where d k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamizamaaBa aaleaacaWGRbaabeaaaaa@37EC@ is a sampling design weight such as the inverse of the inclusion probability of the individual k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaac6 caaaa@3789@ Then, it is possible to obtain an estimator for θ N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaad6eaaeqaaaaa@389C@ solving the resulting equation system.

Definition 2.1 A maximum pseudo-likelihood estimator θ ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGafqiUdeNbaK aadaWgaaWcbaGaam4Caaqabaaaaa@38D1@ for θ N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaad6eaaeqaaaaa@389C@ corresponds to the solution of the pseudo-likelihood equations given by

k s d k u k ( θ ) = 0. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabuaeqale aacaWGRbGaeyicI4Saam4Caaqab0GaeyyeIuoakiaadsgadaWgaaWc baGaam4AaaqabaGccaWG1bWaaSbaaSqaaiaadUgaaeqaaOWaaeWaae aacqaH4oqCaiaawIcacaGLPaaacqGH9aqpcaaIWaGaaiOlaaaa@4560@

Using the Taylor linearization method, the asymptotic variance of a maximum pseudo-likelihood estimator based on the sampling design is given by

V p ( θ ^ s ) [ J ( θ N ) ] 1 V p [ k s d k u k ( θ N ) ] [ J ( θ N ) ] 1 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvamaaBa aaleaacaWGWbaabeaakmaabmaabaGafqiUdeNbaKaadaWgaaWcbaGa am4CaaqabaaakiaawIcacaGLPaaacqGHijYUdaWadaqaaiaadQeada qadaqaaiabeI7aXnaaBaaaleaacaWGobaabeaaaOGaayjkaiaawMca aaGaay5waiaaw2faamaaCaaaleqabaGaeyOeI0IaaGymaaaakiaadA fadaWgaaWcbaGaamiCaaqabaGcdaWadaqaamaaqafabeWcbaGaam4A aiabgIGiolaadohaaeqaniabggHiLdGccaWGKbWaaSbaaSqaaiaadU gaaeqaaOGaamyDamaaBaaaleaacaWGRbaabeaakmaabmaabaGaeqiU de3aaSbaaSqaaiaad6eaaeqaaaGccaGLOaGaayzkaaaacaGLBbGaay zxaaWaamWaaeaacaWGkbWaaeWaaeaacqaH4oqCdaWgaaWcbaGaamOt aaqabaaakiaawIcacaGLPaaaaiaawUfacaGLDbaadaahaaWcbeqaai abgkHiTiaaigdaaaaaaa@61E9@

where V p [ k s d k u k ( θ N ) ] MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvamaaBa aaleaacaWGWbaabeaakmaadmaabaWaaabeaeqaleaacaWGRbGaeyic I4Saam4Caaqab0GaeyyeIuoakiaadsgadaWgaaWcbaGaam4Aaaqaba GccaWG1bWaaSbaaSqaaiaadUgaaeqaaOWaaeWaaeaacqaH4oqCdaWg aaWcbaGaamOtaaqabaaakiaawIcacaGLPaaaaiaawUfacaGLDbaaaa a@47B0@ is the variance of the estimator for the population total of the scores based on the sampling design and

J ( θ N ) = k U u k ( θ ) θ | θ = θ N . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOsamaabm aabaGaeqiUde3aaSbaaSqaaiaad6eaaeqaaaGccaGLOaGaayzkaaGa eyypa0ZaaqGaaeaadaWcaaqaaiabgkGi2oaaqababaGaamyDamaaBa aaleaacaWGRbaabeaakmaabmaabaGaeqiUdehacaGLOaGaayzkaaaa leaacaWGRbGaeyicI4Saamyvaaqab0GaeyyeIuoaaOqaaiabgkGi2k abeI7aXbaaaiaawIa7amaaBaaaleaacqaH4oqCcqGH9aqpcqaH4oqC daWgaaqaaiaad6eaaeqaaaqabaGccaGGUaaaaa@5313@

An estimator for V p ( θ ^ s ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvamaaBa aaleaacaWGWbaabeaakmaabmaabaGafqiUdeNbaKaadaWgaaWcbaGa am4CaaqabaaakiaawIcacaGLPaaaaaa@3C6A@ is given by

V ^ p ( θ ^ s ) = [ J ^ ( θ ^ s ) ] 1 V ^ p [ k s d k u k ( θ ^ s ) ] [ J ^ ( θ ^ s ) ] 1 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmOvayaaja WaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaacuaH4oqCgaqcamaaBaaa leaacaWGZbaabeaaaOGaayjkaiaawMcaaiabg2da9maadmaabaGabm OsayaajaWaaeWaaeaacuaH4oqCgaqcamaaBaaaleaacaWGZbaabeaa aOGaayjkaiaawMcaaaGaay5waiaaw2faamaaCaaaleqabaGaeyOeI0 IaaGymaaaakiqadAfagaqcamaaBaaaleaacaWGWbaabeaakmaadmaa baWaaabuaeqaleaacaWGRbGaeyicI4Saam4Caaqab0GaeyyeIuoaki aadsgadaWgaaWcbaGaam4AaaqabaGccaWG1bWaaSbaaSqaaiaadUga aeqaaOWaaeWaaeaacuaH4oqCgaqcamaaBaaaleaacaWGZbaabeaaaO GaayjkaiaawMcaaaGaay5waiaaw2faamaadmaabaGabmOsayaajaWa aeWaaeaacuaH4oqCgaqcamaaBaaaleaacaWGZbaabeaaaOGaayjkai aawMcaaaGaay5waiaaw2faamaaCaaaleqabaGaeyOeI0IaaGymaaaa aaa@621D@

where V ^ p [ k s d k u k ( θ ^ s ) ] MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmOvayaaja WaaSbaaSqaaiaadchaaeqaaOWaamWaaeaadaaeqaqabSqaaiaadUga cqGHiiIZcaWGZbaabeqdcqGHris5aOGaamizamaaBaaaleaacaWGRb aabeaakiaadwhadaWgaaWcbaGaam4AaaqabaGcdaqadaqaaiqbeI7a XzaajaWaaSbaaSqaaiaadohaaeqaaaGccaGLOaGaayzkaaaacaGLBb Gaayzxaaaaaa@47F5@ is a consistent estimator for the variance of the estimator of the population total of the scores and

J ^ ( θ ^ s ) = k s d k u k ( θ ) θ | θ = θ ^ s . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmOsayaaja WaaeWaaeaacuaH4oqCgaqcamaaBaaaleaacaWGZbaabeaaaOGaayjk aiaawMcaaiabg2da9maaeiaabaWaaSaaaeaacqGHciITdaaeqaqaai aadsgadaWgaaWcbaGaam4AaaqabaGccaWG1bWaaSbaaSqaaiaadUga aeqaaOWaaeWaaeaacqaH4oqCaiaawIcacaGLPaaaaSqaaiaadUgacq GHiiIZcaWGZbaabeqdcqGHris5aaGcbaGaeyOaIyRaeqiUdehaaaGa ayjcSdWaaSbaaSqaaiabeI7aXjabg2da9iqbeI7aXzaajaWaaSbaae aacaWGZbaabeaaaeqaaOGaaiOlaaaa@55BA@

Then, following Binder (1983), the asymptotic distribution of θ ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGafqiUdeNbaK aadaWgaaWcbaGaam4Caaqabaaaaa@38D1@ is normal since

V ^ p ( θ ^ s ) 1 / 2 ( θ ^ s θ N ) N ( 0,1 ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmOvayaaja WaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaacuaH4oqCgaqcamaaBaaa leaacaWGZbaabeaaaOGaayjkaiaawMcaamaaCaaaleqabaGaeyOeI0 IaaGymaiaac+cacaaIYaaaaOWaaeWaaeaacuaH4oqCgaqcamaaBaaa leaacaWGZbaabeaakiabgkHiTiabeI7aXnaaBaaaleaacaWGobaabe aaaOGaayjkaiaawMcaaebbfv3ySLgzGueE0jxyaGqbaiab=XJi6iaa d6eadaqadaqaaiaaicdacaaISaGaaGymaaGaayjkaiaawMcaaiaac6 caaaa@52E2@

These definitions offer a solid background for the correct inference when using large samples as is the case in labour force surveys.

2.3 Nonresponse

Särndal and Lundström (2005) state that nonresponse has been a topic of increasing interest in national statistical offices during the last decades. Also, in the sampling survey literature, the attention to this topic has increased considerably. Nonresponse is a common non desirable issue in the development of a survey that can affect considerably the quality of the estimates.

Lohr (1999) discusses several types of nonresponse mechanisms:

  • The nonresponse mechanism is ignorable when the probability of an individual responding to the survey does not depend on the characteristic of interest. Note that the word "ignorable" makes reference to a model explaining the mechanism.
  • On the other hand, the nonresponse mechanism is nonignorable when the probability of an individual responding to the survey depends on the characteristic of interest. For example, in a labour survey, the possibility of response may depend on the labour force classification of the individuals in a household.

Lumley (2009, chapter 9) analyses individual nonresponse with partial data for a respondent considering a design-based approach adjusting the sampling weights. Fuller (2009, chapter 5) considers some imputation techniques for the nonresponse treatment through probabilistic models and sampling weights. Särndal (2011) considers a model-based approach through balanced sets in order to achieve higher representativeness of the estimates. In the same way, Särndal and Lundström (2010) propose a set of indicators in order to judge the effectiveness of auxiliary information in order to control the bias generated by nonresponse. Särndal and Lundström (2005) give a large number of references about nonresponse. These references examine two main complementary aspects in a survey: prevention of the problem of nonresponse (before it happens) and estimation techniques in order to take into account nonresponse in the inference process. This second aspect is known as adjustment for nonresponse.

Previous | Next

Date modified: