Data quality, concepts and methodology: Methodology and data quality
Archived Content
Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.
Introduction
This section provides an overview of the underlying methodology of the survey and of key aspects of the data quality. It will also provide an understanding of the strengths and limitations of the data. The information may be of particular relevance when making comparisons with data from other surveys or sources of information and when drawing conclusions from time series.
Reference period
Respondents of the Households and the Environment Survey (HES) were asked to refer to behaviours and activities that were undertaken by the household for the following reference periods (examples of the questions or modules using the reference period):
Reference period: At the time of the interview
- Dwelling's water source
- Type of heating equipment
- Energy conservation
- Septic system
- Water conservation (water meters, low-flow showerheads, reduced volume toilets, rain barrel or cistern)
- Recycling programs
- Radon awareness
- Ethanol blended gasoline and bio-diesel availability
Reference period: During the previous summer
- Lawn and garden watering
Reference period: During the "last twelve months"
- Drinking water choice
- Water treatment
- Water testing
- Water conservation (indoor water use)
- Fertilizer and pesticide use
- Recycling behaviour
- Hazardous waste disposal
- Composting
- Cleaning and chemical products
- Recreation vehicles and gas-powered equipment
- Motor vehicles
- Public transit
- Ethanol blended gasoline and bio-diesel use
- Air quality
- Purchasing decisions
- Total household income
Reference period: Warmer months and colder months
- Mode of transport to work
Reference period: Winter season and summer season
- Thermostat use
- Indoor temperature
Reference period: "In the last five years"
- Major appliance purchases
Target population
The target population consisted of households in Canada excluding households located in Yukon, Northwest Territories and Nunavut, households located on Indian reserves or Crown lands, and households consisting entirely of full-time members of the Canadian Armed Forces. Institutions and households of certain remote regions were also excluded.
Variables measured
Broadly, the 2007 HES measured variables that explored the following themes:
- Water quality concerns of households
- Consumption and conservation of water
- Consumption and conservation of energy
- Home heating and cooling
- Use of household lawn and garden equipment
- Use of gasoline-powered recreation equipment
- Pesticide and fertilizer use on lawns and gardens
- Recycling, composting and waste disposal practices
- Impacts of air and water quality on households
- Transportation decisions
- Purchasing decisions
Instrument design
The questionnaire was designed by Statistics Canada in consultation with stakeholders involved in the Canadian Environment Sustainability Indicators project and in consideration of the data needs of both the project and the larger research and policy communities. Testing of the questionnaire was done by Statistics Canada's Questionnaire Design Research Centre (QDRC). Focus group sessions were conducted along with a number of one-on-one interviews. These were conducted in both English and French by Statistics Canada's Questionnaire Design Resource Centre in five cities across Canada in January and February 2007.
The questionnaire was designed to follow standard practices and wording, when applicable, in a computer-assisted interviewing environment. This included the automatic control of question wording and flows that depended upon answers to earlier questions and the use of online edits to check for logical inconsistencies and gross capture errors.
The computer application for data collection was subjected to extensive testing before its use in the survey.
Sampling
The HES sample was selected from the 2007 (January to June) respondents to the Canadian Community Health Survey (CCHS). All the details of the CCHS sample design can be obtained upon request. In Quebec and in Ontario, the HES sample was selected from the CCHS respondents in order to allow for reliable estimates; i.e., with a coefficient of variation (CV) of 16.5% or better for proportions as small as 10% in census metropolitan areas (CMAs) and in the non-CMA portion of each province. In the other provinces, all the CCHS responding dwellings were selected in order to allow for the most reliable estimates possible. The initial HES 2007 sample size consisted of 29,980 dwellings.
Data collection
Data collection took place from October 2007 to February 2008. Participation in the survey was voluntary and data were collected directly from a representative of the selected household by telephone interview. Depending on this person's availability and operational constraints, the HES interview was completed immediately or arrangements were made to call back in order to complete the interview. An automated call scheduler managed follow-up calls in order to try to make contact with the respondent at different times of day throughout the collection period.
Interviews for the HES were conducted from Statistics Canada's regional offices using a computer-assisted telephone interviewing (CATI) application. The initial sample size consisted of 29,980 dwellings. A total of 21,690 responding units yielded a final response rate of 72.3% to the HES.
Error detection
The HES questionnaire incorporated many features to maximize the quality of the data collected. There were multiple edits in the computer-assisted interview questionnaire to compare the entered data against unusual values and logical inconsistencies between sections of the questionnaire. When an edit failed, the interviewer was prompted to correct the information, with the help of the respondent. As well, the interviewer had the ability to enter a response of "Don't Know" or "Refused" if the respondent did not answer the question.
Once the data were received at Statistics Canada's head office, an extensive series of processing steps was undertaken to examine each record received. A top-down flow edit was used to clean up any question paths that may have been mistakenly followed during the interview.
Estimation
Estimates representing in-scope households were produced by assigning weights to each sampled household. The weight of a sampled household indicated the number of households in the population that the unit represented. The initial weight was provided by the CCHS and incorporated the probability of selecting the unit in their sample, as well as other adjustments such as the treatment of non-response to the CCHS.
In order to produce the HES weights, a first adjustment was made to the initial weight to reflect the fact that only a subsample of the CCHS was used. A second adjustment was made to account for the HES nonresponse. Finally, a third and final adjustment was made to produce the final weight. This final adjustment consisted of a post-stratification to the Census projections. The quality of the estimates was assessed using estimates of their CV. Given the complexity of the HES design, CVs cannot be calculated using a simple formula therefore bootstrap replicate weights were used to obtain the CVs of the estimates.
Quality evaluation
All published data were compared to identical or similar HES data from previous surveys to ensure consistency. Explanations were found for any significant changes. Subject-matter experts confronted the data using other sources as well as by identifying and researching any values that were not consistent with others in the same domain.
Disclosure control
Statistics Canada is prohibited by law from releasing any data that would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.
Coverage
The coverage error of the CCHS, of which the HES is a subsample, is estimated at less than 2%.
Response rates and sampling error
The response rate for this survey was 72.3%. Provincial response rates ranged from 68% to 75%.
The results estimated from HES are based on a sample of households in Canada. The results obtained from asking the same questions to all Canadian households would differ to some known extent. The extent of this sampling error is quantified by the CV with the following guidelines:
- 16.5% and below: acceptable estimate;
- 16.6% to 33.3%: marginal estimate requiring cautionary note to users; and
- 33.3% and above: unacceptable estimate.
Estimates that do not meet an acceptable level of quality are either flagged for caution or suppressed. CV tables are prepared by Statistics Canada and made available to help users understand the quality of individual estimates. For example, CVs for the estimated proportion of households that had a compact fluorescent light bulb in 2007 for Canada and the provinces are as follows:
Data comparability over time
For the 2007 version of the survey, improvements were made to some questions. Some were reworded or reordered to reflect what was learned during the 2006 collection cycle. While these quality improvements were necessary, they have impacted the comparability of some of the 2007 data with those of 2006. Thus, care should be exercised when making direct year-to-year comparisons for certain topics.
Data obtained from the 2007 survey are directly comparable with data from the 2006 survey for the following variables:
- Main source of water
- Access to and use of recycling programs
- Household composting
- Presence of a thermostat and a programmable thermostat
- Presence of energy-saving light bulbs
- Presence of low-flow shower heads
- Presence of a low-flow toilet or a toilet tank with the water volume modified
- Presence of water purifiers or filters
- Presence of a yard
The following topics describe some of the more significant changes and offer some guidance when making such comparisons for these topics. Further information on making comparisons for topics not listed here can be obtained upon request.
Topic: Thermostats and dwelling temperature
Topic: Drinking water
Topic: Drinking water treatment
Topic: Pesticide use
Topic: Fertilizer and pesticide use
- Date modified: