"A variable is a characteristic of a statistical unit being observed that may assume more than one of a set of values to which a numerical measure or a category from a classification can be assigned."
In the above definition, the key components are:
These components constitute the standard components used in this information package to name and structure variables. A statistical organization publishing data has to adopt a standard way to name and structure the variables to which the data relate. From the point of view of the users, they must be able to recognize the same structure underlying the name of variables whichever sub-division of the organization is producing the data and whatever the subject area being studied. From the point of view of the management of the information about the data (referred to as metadata) published by the organization, it is necessary to adopt a standard naming convention and structure for the variables in order to efficiently store the metadata in a central database, allow efficient extraction, and permit efficient search by users.
The naming convention and structure referred to above are adapted from the International Standards Organisation (ISO) standard; Information technology - Metadata registries (MDR) or ISO 11179. This standard is being adopted by an increasing number of National Statistical Organizations.
When it is decided that a statistical program will produce data to illuminate a certain subject area, the responsible analysts have to determine:
Imagine a time series table, applying to Canada, where the headings of the columns consist of the reference periods and the headings of the rows contain the name of the general characteristics being measured for the statistical unit being observed, e.g. "Total income of all Households". The documentation of variables you are consulting defines the characteristic being measured and the statistical unit being observed. The cells along the rows contain the numerical values using the unit of measure indicated in the documentation of the variables.
In most cases, the data in the table will be broken down by geographic areas within Canada, e.g. provinces and territories, or Canadian Metropolitan Areas, etc. The variable documentation informs the users of this geographic breakdown. In most cases, the value of the general characteristic being measured will be broken down by sub-categories of the characteristic and/or of the statistical unit as well or in other words by classes within classifications, e.g. classes of income sources, or classes of industries. The variable documentation always informs the users on the different classes of the specific classification(s) used to detail the data in the table. The names of these classes and groups of classes appear as the headings of rows in the table.