Statistics Canada aims to ensure that the information it produces provides a consistent and coherent picture of the Canadian economy, society and environment, and that its various datasets can be analyzed together and in combination with information from other sources.
To this end, the Agency pursues three strategic goals:
- The use of consistent collection and processing methods for the production of statistical data across surveys.
This Policy deals with the second of these strategic goals. It provides a framework for reviewing, documenting, authorizing, and monitoring the use of standard names and definitions for populations, statistical units, concepts, variables and classifications used in Statistics Canada's programs. Standards for specific subject-matter areas will be issued from time to time under this Policy as required.
Statistics Canada aims to use consistent names and definitions for populations, statistical units, concepts, variables, and classifications used in its statistical programs. To this end:
- The Agency will build up a database of names and definitions used in its programs and make this database accessible to users and other players in the statistical system.
This policy applies to disseminated data however collected, derived or assembled, and irrespective of the medium of dissemination or the source of funding. This policy may also be applied to data at the stage of collection and processing at Statistics Canada.
Guidelines for the development and documentation of standards
These guidelines describe the requirements and give guidance for the development and documentation of standard names and definitions of populations, statistical units, concepts, variables and classifications. Section B defines the terminology; guidelines follow in Section C.
For purposes of these guidelines the following terms are used.
Population: The set of statistical units to which a dataset refers.
Concept: A general or abstract idea that expresses the social and/or economic phenomenon to be measured.
Statistical unit: The unit of observation or measurement for which data are collected or derived. The following list provides examples of standard statistical units that have been defined.
Variable: A variable consists of two components, a statistical unit and a property. A property is a characteristic or attribute of the statistical unit.
Classification: A classification is a systematic grouping of the values that a variable can take comprising mutually exclusive classes, covering the full set of values, and often providing a hierarchical structure for aggregating data. More than one classification can be used to represent data for a given variable.
Example: The following is an example of the variable: Age of Person.
Concept: Based on the subjects used by Statistics Canada to organize its statistical products and metadata, the variable Age of Person is listed under the concept of Population and Demography.
Statistical unit and property: The statistical unit and property that define this variable are Person and Age respectively. Person refers to an individual – this is the unit of analysis for most social statistics programmes. Age refers to the age of a person (or subject) of interest at last birthday (or relative to a specified, well-defined reference date).
Classification: Different classifications can be used to represent data for this variable. These classifications include: Age Categories, Five-year Age Groups; and Age Categories, Life Cycle Groupings.
The standard names and definitions of populations, statistical units, concepts, variables and classifications will be stored in the Integrated Metadatabase (IMDB). In the case of variables, the name stored in the IMDB will include a representation type, in addition to the statistical unit and property. In the age example given here, the full name of the variable in the IMDB would be Category of Age of Person. The representation type Category indicates that it is a categorical variable, which will be represented by a classification of age groups.
Each standard should have the following characteristics:
- describe the concept that the standard addresses when appropriate;
- identify the statistical unit(s) to which it applies;
- provide a name and definition of each variable included in the standard;
- provide the classification(s) to be used in the compilation and dissemination of data on each variable.
The most detailed level of a classification will always be included in a standard. Recommended and optional aggregation structures may also be present.
Concepts shall be described in relation to a framework when possible.
Every variable shall be given a name, in both official languages, which, once given, cannot be used to denote any other variable. Variables shall be defined with explanatory notes in terms of a property and the statistical unit to which it applies. Additionally, in the IMDB, the representation type will be defined.
Every classification shall be given a name, in both official languages, which, once given, cannot be used to denote any other classification. Classifications shall be defined, with exclusions listed and explanatory notes given, where required.
Every class shall be given a name, in both official languages, which, once given, cannot be used to denote any other grouping for the referenced variable within a given "family" of classifications (i.e. a given classification and all its variants). Classes shall be defined, with exclusions listed and explanatory notes given, where required.
The most frequently used populations shall be given a name, in both official languages, which, once given, cannot be used to denote any other population. These populations shall be defined with explanatory notes.
Every statistical unit shall be given a name, in both official languages, which, once given, cannot be used to denote any other statistical unit. Statistical units shall be defined with explanatory notes.
A standard shall be accompanied by a statement of conformity to relevant internationally recognized standards, or a description of the deviations from such a standard and, where possible, a concordance with the referenced standard.
Where a standard replaces an earlier one, a concordance between the old and the new shall be given.
A standard shall include a statement regarding the degree to which its application is compulsory. The different degrees are, in descending order of compulsion:
- program-specific standard: a standard adopted by a statistical program, and which is registered with Standards Division, to ensure consistency in a series over time periods.
- Date modified: