The Canadian Consumer Price Index Reference Paper
Chapter 5 – Sample Strategy and Price Collection

Warning View the most recent version.

Archived Content

Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

Text begins

5.1 The number and variety of transactions that consumers engage in is immense. It would be neither practical nor affordable to collect prices for all transactions of products sold in all outletsNote 1 to compile the Consumer Price Index (CPI). Therefore a sample strategy is necessary.

5.2 The CPI has always had a policy of adopting the most appropriate measurement methodologies for each of its elementary indices. This has led to sometimes very divergent sampling practices in different parts of the CPI. This chapter will cover the range of sampling practices currently used in the CPI, first focusing on the general sampling approach which covers more than 50% of the CPI by basket weight.Note 2 The chapter will then discuss some of the more specific sampling approaches in other parts of the basket, including full universe price coverage, cut-off sampling and price modelling.

5.3 Not all elementary price indices are estimated with observed prices. For some elementary aggregates, particularly the “catch-all” classes with heterogeneous product mixes that typically represent more marginal consumer expenditures, price collection is not practical or necessary. In these cases, imputations are made whereby the price movement of the elementary aggregate is estimated via proxy, using the price movement of a donor class.

General Sampling Approach

5.4 The general sampling approach for the CPI can be seen as a three-stage survey design. The first stage covers geography, the second is for outlets and the third stage is for the products. Even though the product component appears at the last stage, types of products have a major influence during the second stage when outlets are selected. Figure 5.1 depicts this general approach for selecting sampling units, starting from geography, then using the product type information for outlet selection and then assigning Representative Products (RPs) to be observed within each outlet. More details on each of these stages are provided in the following paragraphs.

Figure 5.1 – Stages of Sampling Strategy in the Consumer Price Index

Figure 5.1 – Stages of sampling strategy in the Consumer Price Index

Description for Chart 1

This figure demonstrate the stages of sampling strategy in the Consumer Price Index.

The first stage is Geography

  • Standard Geographical Classification
  • Business Register

The second stage is our Outlet which includes our Product components

  • Business Register
  • Quarterly Retail Commodity Survey
  • Service Industry Surveys

The last stage, Representative Product

  • NAICS/Quarterly Retail Commodity Survey/Service Industry Surveys
  • Market Intelligence.

5.5. The geographical sampling unit is primarily the Census Sub-Division (CSD) as defined by Statistics Canada’s Standard Geographical Classification (SGC). CSDs are similar to municipalities and are chosen for the CPI sample based on information such as population counts and economic activity.Note 3 The CSDs are selected as the sampling unit mainly because they are stable over time and because every location in the Business Register (BR),Note 4 the frame for outlet selection, is mapped to CSDs.

5.6 To facilitate outlet sampling and price collection management, CPI collection areas have been defined. For small to medium size CSDs, collection areas correspond to the CSD. In cases where the CSD is too large to represent one collection area it is broken down into smaller areas which are amalgamations of Census Tracts (CTs), which can be seen as equivalent to neighbourhoods. The number of collection areas within a CSD depends on the number and variety of retail and services locations as well as the size, in terms of square kilometres, of the CSD.Note 5

5.7 The CPI survey frame is used to select the outlets where price collection is to take place. The frame was built using existing Statistics Canada sources, mainly the BR, the quarterly Retail Commodity Survey (RCS)Note 6 and services industry surveys such as the monthly Food Services and Drinking Places Survey. The RCS and services industry surveys are used to link the CPI product classification to the industry classification in the BR, which is the North American Industrial Classification System (NAICS).Note 7

5.8 A major feature of the CPI survey frame is that it gives the revenue, according to the BR, of each outlet by RCS commodity class or service industry using the data reported in various Statistics Canada surveys. This characteristic of the frame helps in dealing with big retailers such as the department stores, which typically sell a range of products. It facilitates the selection of outlets which are among the most popular in each product class or service industry.

5.9 The outlet sampling process for the CPI is done in two phases. The first phase, pre-contact sampling, is designed to validate the information from the BR, such as activity status, industrial classification and contact information. Depending on the industrial classification, additional questions may be asked to determine whether specific products are sold. This phase is done quarterly, by telephone. The second phase, field sampling, consists of selecting a subset of outlets from the pre-contact output (after all
out-of-scope and refusal units are removed) to be sent for field collection by specialized Statistics Canada employees, known as price interviewers. Remaining in-scope outlets that were not selected for the field sample are kept as a replacement list.

5.10 After outlets are selected, a set of RPs is assigned to be observed in each outlet. Subject matter experts use external databases and market research to help define RPs in a way that strikes a balance between specificity and flexibility. Price interviewers then select specific products that meet the RP definitions. This approach ensures the interviewers have a clear understanding about what kind of product to select (keeping intact the matched-model approach of the CPI) while at the same time providing leeway to choose products that are locally popular (upholding the representativeness of the estimated elementary indices).

5.11 The RP list may be different from one outlet to another according to the information obtained from the CPI survey frame and the pre-contact phase. For example, one pharmacy may be designated to have health care goods as well as some food items (such as milk and bread) collected, if those products correspond to its main streams of revenue. However, this may not be the case for all pharmacies in the CPI sample.

5.12 The number of RPs assigned to each elementary aggregate depends on the weight and the complexity of measuring price change for the given product class. In a complex elementary aggregate, one which may have a lot of heterogeneous products included, several RPs will likely be assigned. In a more simple elementary aggregate, just one or two RPs might be enough to measure price change adequately.

5.13 An RP assigned to an outlet is called a Target Product Offer (TPO). The TPO acts as the sample intention. The sampling allocation scheme allots a number of TPOs to each elementary aggregate, taking into account the basket weight, price variability and cost of collection for each elementary aggregate. The objective is to allocate the available sample optimally in order to estimate elementary indices of the best possible quality.Note 8

5.14 Every month the price interviewers observe Product Offers (POs) for TPOs. The monthly process begins with a detailed sample request from headquarters to the interviewers. Most of this request is the same as in the previous month, since the CPI follows the matched-model approach. The sample request is loaded into Computer-Assisted Personal Interview (CAPI) devices. Carrying these devices, the interviewers visit each of the outlets in their particular workload. In each outlet, the interviewer finds the required POs and enters their prices and characteristics into the devices.

5.15 When a PO is advertised as being “on special” interviewers record this in the CAPI device. If the item cannot be found because the shelf is empty, the interviewer consults with a store manager and determines whether it is temporarily out of stock or is no longer carried by the outlet. Either way, this information is recorded in the device and transmitted back to Statistics Canada headquarters.

5.16 If the PO being sought is determined to be no longer carried, a substitute PO is selected and this information is recorded in the CAPI device. For especially complex POs such as high-tech goods and items of clothing, the interviewers also fill out forms providing additional details about the characteristics of the substitute PO. This additional information assists the analysts at headquarters who assess the extent of quality change and estimate an appropriate adjustment.Note 9

5.17 When an interviewer must select a substitute PO, as just described, he or she is guided by the RP description, a set of detailed product specifications loaded in the CAPI devices. The interviewer is asked to select a substitute PO that fits the description.

5.18 For many RPs it is not necessary to observe POs every month, either because their prices tend to change less frequently or because they are only available in specific times of the year. For one example, tuition fees typically change only once a year, in a predictable month, so they are collected only in that month. For another, Internet access service fees are collected every second month because their prices change infrequently. In fact, every RP description specifies the month(s) of collection.

5.19 If a price change is known to take place outside an RP’s default pricing schedule Statistics Canada will conduct a special pricing.Note 10 This ensures that elementary indices represent, as much as possible, price change in a timely manner.Note 11

5.20 As for the timing of collection within the month, most POs are observed in the first two weeks, with the first week being defined as the one containing at least three business days from the calendar month. Gasoline prices are an exception due to their typical intra-month volatility and thus are collected in four weeks.

5.21 When the POs have been collected they are sent to Statistics Canada headquarters by encrypted digital transmission, where they are compared against the original sample request to determine the sample’s completeness and conformity to requirements. Thereafter, the POs undergo further review and processing. Any unusual price movements are carefully checked to ascertain their validity. Corrections are made if necessary. Where POs have been substituted, the degree of quality difference is assessed and an appropriate quality adjustment is made if necessary. Steps are also taken to ensure the POs include applicable taxes. Finally, with all checks and adjustments completed, the elementary price indices are calculated.

Specific Sampling Approaches

5.22 Exceptions are made to the general sampling approach. These exceptions are intended to capture price change for elementary aggregates where information on the universe of consumer transactions is available or where the attributes of the products within the elementary aggregate are complex and require different techniques.

5.23 Elementary aggregates where full information on the universe of consumer transactions is available include those goods or services that have only one market and/or seller. Examples include passport fees and drivers’ licenses. For these there is no sample drawn because all prices are collected and used in the CPI.

5.24 For some elementary aggregates in which there is one seller in a particular geographical stratum,Note 12 a sample of outlets and products is not required. However, in these cases the first stage of the general sampling approach, in which collection areas are selected, is still necessary. Usually, the collection of these POs is done by employees at Statistics Canada headquarters rather than by interviewers in the field.

5.25 The prices used to calculate the rent price index come from the Labour Force Survey, which uses a probabilistic sample.Note 13

5.26 Some elementary aggregates in the CPI follow cut-off sampling.Note 14 When there is information available on the outlet and/or product universe the goal is to maximize the coverage of both. Most times information on market composition comes from third party administrative databases, often available via the Internet. These databases are used to rank outlets and/or products so the sample covers a majority of the market. Once TPOs are assigned via cut-off sampling, POs are collected at Statistics Canada headquarters.

5.27 There are some elementary aggregates where the target population purchases bundles of services rather than individual productsNote 15 or where prices are based on a set of specific conditions.Note 16 The CPI uses the profiles method to capture price change for these products.

5.28 Representative bundles or consumer profiles are selected using available market information. The intention is to cover a majority of the services and outlets that are available in a particular market. In these cases POs for the defined consumer profiles are observed at Statistics Canada headquarters from large industry databases.

5.29 The profiles method is another application of the matched-model framework. By observing POs of identical consumer profiles every month, this method ensures that the quantity and quality of bundled services are constant over time and that the CPI reflects pure price change. As with the entire CPI sample, it is important that consumer profiles are reviewed and updated regularly to ensure that representative consumption bundles are being priced. When profiles must be changed because the component products are determined to be new or of different quality, then quality adjustment techniques should be applied.

5.30 Some elementary aggregates are characterized by products whose prices are determined not by their physical characteristics, but by their intellectual content and novelty. For these elementary aggregates the CPI uses the bestsellers method to estimate price change.Note 17 In these cases POs are observed in the field by price interviewers as well as on the Internet.

5.31 There are two elementary aggregates which use modelling to estimate price change: the Mortgage interest cost and the Homeowners’ replacement cost indices, two components of the owner-occupied accommodation price index. Unlike the treatment of other durable goods in the CPI, owner-occupied housing follows a user-cost approach, which aims to measure the implicit price of the flow of services coming from a fixed stock of owned dwellings. This price index requires special measurement methods. Using collected data from the Survey of Household Spending,Note 18 along with other data inputs on house prices and interest rates, price change is estimated.Note 19

Date modified: