2006
Information identified as archived on the Web is for reference, research or recordkeeping purposes. It has not been altered or updated after the date of archiving. Web pages that are archived on the Web are not subject to the Government of Canada Web Standards. As per the Communications Policy of the Government of Canada, you can request alternate formats on the "Contact Us" page.
The following information is to ensure a clear understanding of the basic concepts that are being measured, the underlying survey methodology (how the concepts are measured), and key aspects of data quality. This information will provide you with a better understanding of the strengths and limitations of the data, and of how they can be effectively used and analysed. The information may be of particular importance when making comparisons with data from other surveys or sources of information, and in drawing conclusions regarding changes over time.
The target population consists of all statistical establishments (sometimes referred to as firms or units) classified as Newspaper Publishers according to the North American Industry Classification System (NAICS) during the reference year 2006. This industry comprises establishments, known as newspaper publishers, primarily engaged in carrying out operations necessary for producing and distributing newspapers, including gathering news, writing news columns, feature stories and editorials, and selling and preparing advertisements. These establishments may publish newspapers in print or electronic form. Data users who wish to learn more about NAICS, its underlying principles, and many of the other statistical concepts discussed in this brief summary, are referred to http://www.statcan.gc.ca/ under “Definitions, Data sources and Methods”, Standard Classifications, Industry for NAICS 2002.
Even though the basic objective of the survey is to produce estimates for the whole industry—all incorporated and unincorporated businesses—not all businesses are surveyed. Rather, a sample is surveyed and the portion eligible for sampling is defined as all statistical establishments with revenue above a certain threshold. This low revenue threshold varied between provinces. The excluded portion represents a substantial proportion of the industry in terms of number of establishments (46%), but its contribution to the overall industry revenue is only about 2.5%. These excluded establishments are accounted for in the final estimates through the use of administrative data. However, only basic information is obtained from administrative sources, i.e., total revenue, expenses, depreciation and salaries, wages and benefits. Detailed characteristics such as revenue by type of service, and detailed expense items are collected only for surveyed establishments.
The frame is the list of establishments from which the portion eligible for sampling is determined and the sample is taken. The frame provides basic information about each firm, including: address, industry classification, and information from administrative data sources (as discussed above). The frame is maintained by Statistics Canada’s Business Register, and is updated using administrative data.
Prior to the selection of a random sample, establishments are classified into homogeneous groups (i.e., groups with the same NAICS codes, same geography (province/territory), and same business type (incorporated/unincorporated) attributes). Quality requirements are targeted, and then each group is divided into sub-groups called strata: take-all, must-take, and take-some.
The take-all stratum includes the largest firms in terms of performance (based on revenue) in an industry. Every firm is sampled, which means each firm represents itself and is given a weight of one. The must-take stratum is also comprised of self-representing units, but these are selected on the basis of complex structure characteristics (multi-establishment, multi-legal, multi-NAICS, or multi-province enterprises). Units in the take-some strata are subjected to simple random sampling.
Finally, the sample size is increased, mostly to compensate for firms that no longer belong in the industry, i.e., they have gone out of business, changed their primary business activity, are inactive, or are duplicates on the frame. After removing such firms, the sample size for this survey was 135 collection entities.
Data are collected through a mail-out/mail-back process, while providing respondents with the option of telephone or other electronic filing methods. The statistical establishment is used as the sampling unit, but selected establishments belonging to the same company, the same industry, and the same province are aggregated to create a collection entity. This reduces respondent burden and simplifies collection. Therefore, companies with production in more than one province are mailed one questionnaire per province and instructed to report for all Canadian operations.
Several checks are performed on the collected data to verify internal consistency and identify extreme values. Where information is missing, imputation is performed using either a "nearest neighbour" procedure (donor imputation), using historical data where available or finally, using administrative data as a proxy for reported data.
As part of the production of final numbers (referred to as estimation), data for companies operating in more than one province or territory are allocated to the provincial level. Administrative data are used to estimate for the portion of the industry that was excluded from survey activity (i.e., small firms whose revenues fell below cut-off thresholds). Sampled data are then weighted to produce estimates representative of the target population.
Prior to publication, combined survey results are analysed for comparability; in general, this includes a detailed review of: individual responses (especially for the largest companies), general economic conditions, historic trends, and comparisons with administrative data (e.g., income tax, goods and services tax, payroll deductions records, industry and trade association sources).
While considerable effort is made to ensure high standards throughout all stages of collection and processing, the resulting estimates are inevitably subject to a certain degree of error. These errors can be broken down into two major types: non-sampling and sampling.
Non-sampling error is not related to sampling and may occur for many reasons. For example, non-response is an important source of non-sampling error. Population coverage, differences in the interpretation of questions, incorrect information from respondents, and mistakes in recording, coding and processing data are other examples of non-sampling errors.
The response rate for this survey was 98.7% in reference year 2006, after taking into account the factors discussed in this statement.
Sampling error occurs because population estimates are derived from a sample of the population rather than the entire population. Sampling error depends on factors such as sample size, sampling design, and the method of estimation. An important property of probability sampling is that sampling error can be computed from the sample itself by using a statistical measure called the coefficient of variation (CV). The assumption is that over repeated surveys, the relative difference between a sample estimate and the estimate that would have been obtained from an enumeration of all units in the universe would be less than twice the CV, 95 times out of 100. The range of acceptable data values yielded by a sample is called a confidence interval. Confidence intervals can be constructed around the estimate using the CV. First, we calculate the standard error by multiplying the sample estimate by the CV. The sample estimate plus or minus twice the standard error is then referred to as a 95% confidence interval.
For the Newspaper Publishers Survey, CVs were calculated for each estimate. Generally, the more commonly reported variables obtained excellent CVs (5% or less), while the less commonly reported variables were associated with higher but still very good CVs (under 10%). The CVs are available upon request.