Complete a survey on your experience using this Departmental Results Report.
Catalogue no. 11-628-X
ISSN 2368-1160
Complete a survey on your experience using this Departmental Results Report.
Catalogue no. 11-628-X
ISSN 2368-1160
Statistics Canada ("the agency") is a member of the Innovation, Science and Economic Development portfolio.
Statistics Canada's role is to ensure that Canadians have access to a trusted source of statistics on Canada that meets their highest priority needs.
The agency's mandate derives primarily from the Statistics Act. The Act requires that the agency collects, compiles, analyzes and publishes statistical information on the economic, social, and general conditions of the country and its people. It also requires that Statistics Canada conduct the census of population and the census of agriculture every fifth year, and protects the confidentiality of the information with which it is entrusted.
Statistics Canada also has a mandate to co-ordinate and lead the national statistical system. The agency is considered a leader, among statistical agencies around the world, in co–ordinating statistical activities to reduce duplication and reporting burden.
More information on Statistics Canada's mandate, roles, responsibilities and programs can be found in the 2019–2020 Main Estimates and in the Statistics Canada 2019–2020 Departmental Plan.
The Quarterly Financial Report:
Statistics Canada has the authority to collect and spend revenue from other federal government departments and agencies, as well as from external clients, for statistical services and products.
This quarterly report has been prepared by management using an expenditure basis of accounting. The accompanying Statement of Authorities includes the agency's spending authorities granted by Parliament and those used by the agency consistent with the Main Estimates for the 2019–2020 fiscal year. This quarterly report has been prepared using a special purpose financial reporting framework designed to meet financial information needs with respect to the use of spending authorities.
The authority of Parliament is required before moneys can be spent by the Government. Approvals are given in the form of annually approved limits through appropriation acts or through legislation in the form of statutory spending authority for specific purposes.
The agency uses the full accrual method of accounting to prepare and present its annual departmental financial statements that are part of the departmental results reporting process. However, the spending authorities voted by Parliament remain on an expenditure basis.
This section highlights the significant items that contributed to the net increase in resources available for the year, as well as actual expenditures for the quarter ended June 30.
This bar graph shows Statistics Canada's budgetary authorities and expenditures, in thousands of dollars, as of June 30, 2018 and 2019:
Chart 1 outlines the gross budgetary authorities, which represent the resources available for use for the year as of June 30.
Total authorities available for 2019–2020 have increased by $23.8 million, or 4.0%, from the previous year, from $598.4 million to $622.2 million (Chart 1). This net increase is mostly the result of the following:
The variance is also explained by the reception of authorities at different quarters throughout the year.
In addition to the appropriations allocated to the agency through the Main Estimates, Statistics Canada also has vote net authority within Vote 1, which entitles the agency to spend revenues collected from other federal government departments, agencies, and external clients to provide statistical services. The vote netting authority is stable at $120 million when comparing the first quarter of fiscal years 2018–2019 and 2019–2020.
Year-to-date net expenditures recorded to the end of the first quarter increased by $10.2 million, or 7.9% from the previous year, from $129.3 million to $139.5 million (see Table A: Variation in Departmental Expenditures by Standard Object).
Statistics Canada spent approximately 28% of its authorities by the end of the first quarter, compared with 27% in the same quarter of 2018–2019.
Departmental Expenditures Variation by Standard Object: | Q1 year-to-date variation between fiscal year 2018–2019 and 2019–2020 | |
---|---|---|
$'000 | % | |
Note: Explanations are provided for variances of more than $1 million. | ||
(01) Personnel | 7,819 | 6.0 |
(02) Transportation and communications | 949 | 40.1 |
(03) Information | 258 | 27.7 |
(04) Professional and special services | 598 | 12.1 |
(05) Rentals | 875 | 19.6 |
(06) Repair and maintenance | -31 | -24.2 |
(07) Utilities, materials and supplies | -54 | -28.7 |
(08) Acquisition of land, buildings and works | -7 | -100.0 |
(09) Acquisition of machinery and equipment | -2,025 | -71.8 |
(10) Transfer payments | - | - |
(12) Other subsidies and payments | -905 | -76.9 |
Total gross budgetary expenditures | 7,477 | 5.1 |
Less revenues netted against expenditures: | ||
Revenues | -2,724 | -15.7 |
Total net budgetary expenditures | 10,201 | 7.9 |
Personnel: The increase is mainly due to the ratification of collective agreements and an overall increase in the agency's activities.
Acquisition of machinery and equipment: The decrease is mainly due to a temporary variance related to an invoice for which the cost of the service request has been reduced later in the fiscal year.
Revenues: The decrease is primarily the result of timing differences in the receipt of funds for scheduled key deliverables.
The variance is also explained by timing differences of payments in this period compared to last fiscal year.
Statistics Canada is currently expending significant effort in modernizing its business processes and tools, in order to maintain its relevance and maximize the value it provides to Canadians. As a foundation piece for some of these efforts, the agency is working in collaboration with Shared Services Canada and Treasury Board of Canada Secretariat, Office of the Chief Information Officer, to ensure the agency has access to adequate IT services and support to attain its modernization objectives and successfully transition its infrastructure to the cloud. Activities and related costs are projected based on various assumptions that can change, depending on the nature and degree of work required to accomplish the initiatives. Risks and uncertainties are being mitigated by the agency's strong financial planning management practices and business architecture.
There were no major changes to the departmental operations during this quarter. Minor changes in program activities with financial implications include:
Approved by:
Anil Arora, Chief Statistician
Kathleen Mitchell, Acting Chief Financial Officer
Ottawa, Ontario
Signed on: August 21, 2019
Fiscal year 2019–2020 | Fiscal year 2018–2019 | |||||
---|---|---|---|---|---|---|
Total available for use for the year ending March 31, 2020Tablenote 1 | Used during the quarter ended June 30, 2019 | Year-to-date used at quarter-end | Total available for use for the year ending March 31, 2019Tablenote 1 | Used during the quarter ended June 30, 2018 | Year-to-date used at quarter-end | |
in thousands of dollars | ||||||
|
||||||
Vote 1 — Net operating expenditures | 430,647 | 121,622 | 121,622 | 409,564 | 113,579 | 113,579 |
Statutory authority — Contribution to employee benefit plans | 71,552 | 17,865 | 17,865 | 68,855 | 15,707 | 15,707 |
Total budgetary authorities | 502,199 | 139,487 | 139,487 | 478,419 | 129,286 | 129,286 |
Fiscal year 2019–2020 | Fiscal year 2018–2019 | |||||
---|---|---|---|---|---|---|
Planned expenditures for the year ending March 31, 2020 | Expended during the quarter ended June 30, 2019 | Year-to-date used at quarter-end | Planned expenditures for the year ending March 31, 2019 | Expended during the quarter ended June 30, 2018 | Year-to-date used at quarter-end | |
in thousands of dollars | ||||||
Expenditures: | ||||||
(01) Personnel | 540,787 | 137,419 | 137,419 | 512,332 | 129,600 | 129,600 |
(02) Transportation and communications | 15,413 | 3,318 | 3,318 | 16,557 | 2,369 | 2,369 |
(03) Information | 7,559 | 1,191 | 1,191 | 7,198 | 933 | 933 |
(04) Professional and special services | 33,048 | 5,539 | 5,539 | 29,945 | 4,941 | 4,941 |
(05) Rentals | 10,676 | 5,343 | 5,343 | 12,207 | 4,468 | 4,468 |
(06) Repair and maintenance | 560 | 97 | 97 | 1,241 | 128 | 128 |
(07) Utilities, materials and supplies | 1,845 | 134 | 134 | 2,589 | 188 | 188 |
(08) Acquisition of land, buildings and works | 516 | - | - | 172 | 7 | 7 |
(09) Acquisition of machinery and equipment | 11,635 | 797 | 797 | 10,419 | 2,822 | 2,822 |
(10) Transfer payments | 100 | - | - | 100 | - | - |
(12) Other subsidies and payments | 60 | 272 | 272 | 5,659 | 1,177 | 1,177 |
Total gross budgetary expenditures | 622,199 | 154,110 | 154,110 | 598,419 | 146,633 | 146,633 |
Less revenues netted against expenditures: | ||||||
Revenues | 120,000 | 14,623 | 14,623 | 120,000 | 17,347 | 17,347 |
Total revenues netted against expenditures | 120,000 | 14,623 | 14,623 | 120,000 | 17,347 | 17,347 |
Total net budgetary expenditures | 502,199 | 139,487 | 139,487 | 478,419 | 129,286 | 129,286 |
Geography | Month |
---|---|
201906 | |
Percent | |
Canada | 0.6 |
Newfoundland and Labrador | 1.4 |
Prince Edward Island | 0.8 |
Nova Scotia | 1.5 |
New Brunswick | 1.7 |
Quebec | 1.1 |
Ontario | 1.3 |
Manitoba | 1.1 |
Saskatchewan | 1.6 |
Alberta | 0.8 |
British Columbia | 1.2 |
Yukon Territory | 1.1 |
Northwest Territories | 0.1 |
Nunavut | 1.1 |
The objective of this project was to create a linked dataset that can be used to examine a national cohort (save Quebec) of persons who died (for any age group of interest) in relation to the characteristics and intensity of end-of-life care and to identify patient, disease and healthcare factors associated with variations in care intensity and location of death.
To achieve this objective, death records in the Canadian Vital Statistics Death Database (CVSD) from 2008 to 2017 were linked to patient records in the Discharge Abstract Database (DAD) and the National Ambulatory Care Reporting System (NACRS) from 2004/2005 to 2017/2018 and the Ontario Mental Health Reporting System (OMHRS) from 2006/2007 to 2017/2018. Statistics Canada does not have Quebec hospitalization data as part of its data holdings and thus hospitalizations that occurred in the province of Quebec are not included in the linked datasets. Statistics Canada also does not have death data for decedents in the Yukon Territories for death year 2017.
Canadian Vital Statistics Death Database
The Canadian Vital Statistics Death Database (CVSD) is a census of all deaths occurring in Canada each year. Deaths are reported by the provincial and territorial Vital Statistics Registries to Statistics Canada; the information provided includes demographic and cause of death information. Cause-of-death information is coded using the version of the International Classification of Diseases (ICD) in effect at the time of death. Records eligible for record linkage were deaths that occurred from January 1, 2008 through December 31, 2017, excluding Yukon for 2017.
In addition to the variables from the CVSD, the file includes variables from the Postal Code Conversion File+ (PCCF+) for each linked record. The PCCF+ was generated using the CVSD variable DEA_Q150 (usual residence of the deceased: postal code).
Discharge Abstract Database
The Discharge Abstract Database (DAD) includes administrative, clinical and demographic information on hospital discharges (including in-hospital deaths, sign-outs and transfers) from all provinces and territories, except Quebec. Over time, the DAD has also been used to capture data on day surgery, long-term care, rehabilitation and other types of care. DAD data for fiscal years 2004/2005 to 2017/2018 were included in the linkage.
In the DAD, jurisdiction-specific instructions for collection of data elements evolve over time. Collection of each data element may be mandatory, mandatory if applicable, optional or not applicable. Collection requirements can vary by jurisdiction and by data year. Researchers will find the listings of DAD data elements under the heading “Data Elements” at the DAD Metadata website. The documents on the website include information on mandatory versus optional collection status for each data element by jurisdiction, which is key to understanding coverage of data elements in the DAD.
National Ambulatory Care Reporting System
The National Ambulatory Care Reporting System (NACRS) contains data for hospital-based and community-based ambulatory care including day surgery, outpatient and community-based clinics, and emergency departments. Client visit data are collected at time of service in participating facilities from several jurisdictions. NACRS data for fiscal years 2004/2005 to 2017/2018 were included in the linkage.
For details on the provincial data coverage, please refer to the Data Quality Documentation, available under the “Data Quality” section of the NACRS Metadata website.
In NACRS, jurisdiction-specific instructions for collection of data elements evolve over time. Similar to DAD, collection of each data element may be mandatory, mandatory if applicable, optional or not applicable. Collection requirements can vary by jurisdiction and by data year.
Researchers will find the listings of NACRS data elements under the “Data Elements” section of the NACRS Metadata website. The documents on the website include information on mandatory versus optional collection status for each data element by jurisdiction, which is key to understanding coverage of data elements in NACRS.
Ontario Mental Health Reporting System
The Ontario Mental Health Reporting System (OMHRS) contains data for all individuals receiving adult mental health services in Ontario, in addition to some individuals receiving services in youth inpatient beds and selected facilities in other provinces starting in fiscal year 2006/07. Information regarding mental and physical health, social supports and service use, care planning, outcome measurement, quality improvement, and case-mix funding applications are all part of the OMHRS. For this record linkage, the OMHRS files covering the fiscal years from 2006/07 to 2017/18 were linked to the CVSD. Researchers will find the listings of OMHRS data elements under the heading Data elements at the OMHRS Metadata website.
A single cohort file was produced of all the individual CVSD records between 2008 through 2017, including those that linked and those that did not link to the DAD, NACRS or OMHRS. A random, unique identifier (variable name: STC_ID) was generated for each record on the CVSD. Each record with a valid postal code has additional information added from the PCCF+. Names and other personal identifiers have been removed from the file.
Separate output files were created for each year of the DAD and for each year of the NACRS. A single cumulative file was created for OMHRS. Only records that linked to the CVSD are included in these outcome files. Since the DAD, NACRS and OMHRS are transactional files, the STC_ID from the CVSD cohort file was included on all records to identify those individuals with multiple transactions within a dataset and across datasets. Merging the STC_ID across datasets (i.e. CVSD to DAD, NACRS and OMHRS) will allow the larger picture of health interventions for an individual to be analysed.
Researchers can choose to use the outcome files as event-based (each row of data represents a hospitalization) or person-based (each row of data represents an individual). In order to use a file as a person-based file, the researcher must transform the data to include all hospital information for one person as one record (one row on the data file).
The linked data should not be used to produce official mortality statistics. Official counts and rates of mortality are available on the Statistics Canada website or can be generated by requesting use of the Canadian Vital Statistics Death Database which is accessible through the RDC or by requesting a custom tabulation from Statistics Canada (statcan.hd-ds.statcan@statcan.gc.ca).
The linked data cannot be used to produce statistics related to institutions and any outputs at the institution level will be restricted as per the vetting rules.
Institution information can be used as a method to generate other information (e.g., the postal code of the institution can be used to determine distance to a care facility) but cannot be used as an outcome of interest.
1. Indicate which type of corporation this report covers.
2. Is the reporting entity part of a Canadian consolidation?
3. Does this reporting entity have investments in partnerships or joint ventures?
4. Indicate the accounting standard used to complete this questionnaire.
5. Indicate the currency used to complete this survey.
6. What are the start and end dates of this enterprise's reporting period for the quarter ending:
7. Report your assets
Total Assets
8. Report your liabilities
Total Liabilities
9. Report your equity
Closing balance
Total equity
Total liability and total equity
10. What period does this income statement cover?
11. Report your revenue.
Specify all major items within other revenues
Total revenue
12. Report your expenses.
Specify all major items within other expenses
Total expenses
13. Report your income
14. Report other disclosures
15. Allocate the changes to selected assets and liabilities.
16. Distribute the selected balance sheet items by province
Catalogue no. 37260001
Issue no. 2022001
Version 2.1
Data Exploration and Integration Lab (DEIL)
Centre for Special Business Projects (CSBP)
Release date: November 28, 2022
A first version of the database was realized with funding by Indigenous Services Canada (ISC) and Crown-Indigenous Relations and Northern Affairs Canada (CIRNAC). This updated version, with inclusion of Official Language Minority Schools, was realized with funding from Treasury Board Secretariat (TBS) and consultation from Canadian Heritage (PCH). Valuable feedback and comments were provided by these organizations and they are gratefully acknowledged.
For the purpose of exploring open data for official statistics and to support geospatial research across various domains, the Data Exploration and Integration Lab (DEIL) undertook a project to create an accessible and harmonized database of educational facilities released as open data by various levels of government within Canada.Footnote 1 This document details the process of collecting, compiling, and standardizing the individual datasets of educational facilities that were used to create an update to the second version of the Open Database of Educational Facilities (ODEF), which is made available under the Open Government Licence – Canada.
In its current version (version 2.1), the ODEF contains 18,982 individual records. For this update to the database, information on public Official Language Minority Schools (OLMS) was added to the existing ODEF version 2.0. An OLMS is defined as an English-speaking school in Quebec, or a French-speaking school outside of Quebec. 967 existing records were identified as an OLMS and 38 new records were added for version 2.1. As the OLMS data were collected more recently than the ODEF data, some facilities had addresses updated to reflect changes. Additionally, latitude and longitude coordinates of OLMS facilities were updated for the matched ODEF records with missing data. CMA information was added with a spatial join using the SF packageFootnote 2 in R for all records with available coordinate data to be consistent with the OLMS. The database is expected to be updated periodically as new open datasets become available. The ODEF is provided as a compressed comma separated values (CSV) file.
This dataset is one of several datasets created as part of the Linkable Open Data Environment (LODE). The LODE is an initiative that aims at enhancing the use and harmonization of open data from authoritative sources by providing a collection of datasets released under a single licence, as well as open-source code to link these datasets together. Access to the LODE datasets and code are available through the Statistics Canada website and can be found at The Linkable Open Data Environment.
Multiple data sources were used to create the ODEF. The data providers, which include multiple levels of government, are provided in the Supplementary material as Table 1, including attribution to each data sources as per the licence requirements. Where applicable, licence versions are also shown. For further information on the individual licences, users should consult directly with the information provided on the open data portals of the various data providers. In addition to openly licensed databases, the ODEF also includes a set of publicly available listings of educational facilities for which permission to include was granted by the data providers.
With the inclusion of the OLMS variable for Version 2.1 of the ODEF, all sources for OLMS information are included in Table 2 in the Supplemental material. For each province and territory where multiple data sources on OLMS status were found, one primary data source was chosen that had the greatest number of records and useful attributes such as grade levels and address information.
In addition to the primary sources listed in Table 2, validation was done by comparing lists to the webpages of official minority language school boards. This led to the addition of a small number of facilities that had been missing from the original data sources. The supplementary sources used are listed in Table 3 in the Supplemental material.
The supplementary material lists either the update frequency or the date each underlying dataset was last updated by the provider (when known), as well as the date each dataset used in the ODEF was downloaded or provided by the data owner. Data were gathered between August 2019 and March 2021 for the ODEF data, and from November 2021 to March 2022 for the OLMS status. Users are cautioned that the download date should not be used to indicate the reference period of the data. If specific information concerning the reference period of data is required, users should contact the appropriate data providers.
An education facility is a physical site at which the primary activity is imparting instruction to a body of students or participants. All education facilities in Canada are in scope for this dataset. These include all levels of education, private and public schools with no exclusions for funding arrangement, operator type, subject area, denomination, student type, location, etc.
As a result of this definition, the database covers facilities such as early childhood education, kindergarten, elementary, secondary, and post-secondary institutions, and specific vocational training centres (such as hairdressing schools). The database does not include virtual educational institutions.
For the OLMS status the target population is restricted to public K-12 official language minority schools. This may include both traditional schools and alternative schools if they are controlled by official language minority school boards or authorities.
Only minimal editing of the original datasets was performed. As work on the experimental ODEF progresses, definitions and thresholds will evolve. Users are reminded that unedited data can be obtained directly from the open data portals or from the various data providers.
The primary processing component for the database comprised reformatting the source data to CSV format and mapping the original dataset attributes to standard variable (column) names. A data dictionary of the variables used is provided in section 6. Data dictionary. To compile the data into a single database, the following was done:
The original data files and fields were converted to standard formats and fields using the custom software OpenTabulate. A limited number of entries were manually edited when it was clear that the parsing had not been done correctly. An example is addresses with hyphenated numbers such as "1035-55 street nw", which may have been interpreted as having a civic number of "1035-55" and a street name of "street nw", rather than a civic number of 1035, and a street name of "55 street nw". While effort was made to ensure that the data is correct, it is possible that the scripts used to process and parse the addresses may unintentionally cause other, undetected, errors. Should any such errors be reported, they will be corrected in future versions of the ODEF.
In general, the data included in the ODEF is what is available from the original sources without imputation. The exception to this is the geocoding of entries missing coordinates, and the imputation of CSD names and ISCED levels, discussed below.
In version 2 of the ODEF, the unique identifier has been changed from an integer to a hash computed from the facility name, address, and source id (if available) of the record.
Records that did not include geocoordinates from the source were geocoded using the ESRI ArcGIS Online (AGOL) geocoder and the OpenStreetMap Nominatim geocoder. The AGOL geocoder returns coordinates, as well as a score and a geocoding type. Only records with a score above 90 and with address type indicating the coordinates were either an address, subaddress, point of interest, or intersection were retained for the final database. Records that could not be geocoded to the level of precision described above were then passed to the Nominatim geocoder. Schools were searched for using school names, city, and province, and were kept if the returned school name was a close match to the original school name. The Geo_Source column indicates if the coordinates of a record were provided by the source or if they were geocoded.
The original data sources use a variety of standards, classifications and nomenclature to describe the education level or grade range. The ODEF uses the International Standard Classification of Education (ISCED) to provide a standard definition of an education level. This required the conversion of a facility's grade range or education level to a corresponding ISCED level.
ISCED levels were derived from the grade range indicated by the data provider if available. Otherwise, education level was converted to a grade range, which was then mapped to ISCED levels. Entries in the original data that did not contain education level information were not assigned to ISCED levels and so these fields are blank in the ODEF.
Table 1 shows the direct mapping of ISCED levels from grade ranges and Table 2 shows the grade ranges in an education level by province and territory. It should be noted that the definition of "kindergarten" (K) as an education level label varies by providers as some of these schools support early childhood education. To avoid false positives, facilities that indicate support for pre-elementary students, as described by an education level string (not a grade range string), were not assigned values for the ISCED010 column. For example, Early Childcare Services in Alberta includes Kindergarten and may also include services for younger children, but was only mapped to ISCED020. Despite some of these facilities supporting childhood education, the notion of pre-elementary appears to vary between data providers and schools. This is shown in Table 2 with the assignment of "pre-elementary" to kindergarten when converted to a grade range.
Variable | Name | ISCED level | Grade range |
---|---|---|---|
Early childhood education | ISCED010 | 010 | Pre-K |
Kindergarten | ISCED020 | 020 | K |
Elementary | ISCED1 | 1 | 1-6 |
Junior secondary | ISCED2 | 2 | 7-9 |
Senior secondary | ISCED3 | 3 | 10-12 |
Post-secondary | ISCED4+ | 4+ | - |
Province / Territory | Pre-elementary / kindergarten | Elementary / primary | Junior high / middle | Senior high |
---|---|---|---|---|
Newfoundland and Labrador,
Prince Edward Island, Nova Scotia, Alberta, Northwest Territories, Nunavut |
K | 1-6 | 7-9 | 10-12 |
New Brunswick | K | 1-5 | 6-8 | 9-12 |
Quebec | K | 1-6 | 7-11 | |
Ontario | K | 1-8 | 9-12 | |
Manitoba | K | 1-4 | 5-8 | 9-12 |
Saskatchewan | K | 1-5 | 6-9 | 10-12 |
British Columbia,
Yukon |
K | 1-7 | 8-12 |
Census subdivision (CSD)Footnote 3 names were derived from geographic coordinates, namely latitude and longitude. These are placed into the corresponding CSDs by linking the coordinate points to the CSD polygons through a spatial join operation using the Python package GeoPandas.Footnote 4
The provided institution type (e.g., public, private, etc.) was used as stated in the source data set without further reinterpretation, reassignment or mapping to a uniform classification. In comparison with the use of ISCED to standardize education levels, there is no known standard for institution type. When the data source did not have a type column but the data source itself was for a particular type (e.g., a file of public schools or a file of Private schools), then the facility type was set manually.
Due to the different standards adopted in the original data, steps taken to standardize the data were liable to produce errors. The key principles of the methodology used were the avoidance of false positives and of significant alterations to the data. The methodology and limitations of each technique are described below. Trivial cleaning techniques, such as removal of whitespace characters and punctuation removal, are omitted from discussion.
The libpostal address parser, an open-source natural language processing solution to parsing addresses, was used to split concatenated address strings into strings corresponding to address variables, such as street name and street number. Occasionally, addresses were split incorrectly due to unconventional formatting of the original address. While effort was made to identify and correct these entries in the final database, some incorrectly parsed entries may have remained undetected. Exceptions are entries with street numbers of the form of two numbers separated by a hyphen or space. Entries of this form usually indicate that the address parser incorrectly parsed a numbered street name (e.g., "123 100 ave" is parsed into the street number "123 100" and the street name "ave", or else that a unit has not been identified correctly (as in "3-100 main st"). Numbers of this form are automatically separated, where the right most number is prepended to the street name if the street name is a variant of the word "street" or "avenue."
For OLMS entries where only a P.O. box address was provided, these addresses were removed and replaced with the civic addresses, which were found through manual web searches.
Finally, a limited number of entries that were not parsed correctly were identified by manual inspection and corrected.
The removal of duplicates was done using the Record Linkage Toolkit package in Python, where Levenshtein and Cosine distances were computed on name and address fields for facilities within the same CSD. Record pairs with string similarity metrics above 0.9 were flagged for inspection and removed if they were determined to be duplicates.
For OLMS entries, record pairs were manually inspected to determine whether the matches indicated true or false duplicates. Using web searches to compare names and addresses between the matched pairs and in some cases, ground-truthing with mapping sites, most record pairs were identified as false duplicates. In addition, several pairs were identified as belonging to the same school but covering different grade ranges – these were indicated separately. In the end, only entries that seemed to be clear duplicates (very similar names, addresses, and equal grade information) were chosen for removal, or facilities with exact matches on names and address information.
This data dictionary below describes the variables of the ODEF.
All education facility data in the ODEF were collected from government data sources, either from open data portals or otherwise public webpages. In general, other than the processing required to harmonize the different sources into one database, the underlying datasets were taken "as is."
A few exceptions apply to OLMS entries. Some entries that did not appear in the original data sources used were added after comparing them to the webpages of official language minority school boards. When schools were missing information such as address or school board, this was filled in through manual searches.
Imputation of ISCED levels is done conservatively to avoid false positives. Consequently, the percentages of ISCED levels with a non-empty value differ by level.
Natural language processing methods are used to do the parsing and separation of address strings into address variables, such as street number and postal code. The methods are reputable for performance and accuracy, but as with all statistical learning methods, they have limitations as well. Poor or unconventional formatting of addresses may result in incorrect parsing. At this stage, no further integration with other address sources was attempted; hence, although address records are generally expected to be correct, residual errors may be present in the current version of the database.
Finally, it should be noted that facility type, which discerns public, private, and other types of institutions, has different interpretations by province and data provider. For example, religious schools may be publicly funded in one jurisdiction but not in another.
The LODE open databases are modelled on ongoing improvement. To provide information on additions, updates, corrections or omissions, or for more information, please contact us at statcan.lode-ecdo.statcan@statcan.gc.ca. Please include the title of the open database in the subject line of the email.
Geography | Month | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
201806 | 201807 | 201808 | 201809 | 201810 | 201811 | 201812 | 201901 | 201902 | 201903 | 201904 | 201905 | 201906 | |
percentage | |||||||||||||
Canada | 0.52 | 0.54 | 0.51 | 0.57 | 0.60 | 0.63 | 0.63 | 0.69 | 0.63 | 0.57 | 0.54 | 0.56 | 0.58 |
Newfoundland and Labrador | 1.24 | 1.50 | 1.48 | 1.27 | 1.53 | 1.25 | 1.35 | 2.14 | 1.84 | 2.36 | 2.04 | 2.34 | 1.75 |
Prince Edward Island | 1.24 | 4.50 | 5.89 | 6.16 | 5.03 | 4.16 | 3.46 | 3.11 | 2.65 | 3.37 | 3.12 | 0.57 | 1.46 |
Nova Scotia | 3.64 | 3.66 | 2.04 | 2.20 | 2.76 | 4.16 | 2.49 | 2.42 | 3.49 | 3.37 | 2.42 | 2.95 | 2.71 |
New Brunswick | 2.33 | 2.95 | 1.59 | 1.43 | 1.46 | 1.41 | 1.48 | 1.66 | 1.18 | 1.78 | 1.96 | 2.11 | 2.21 |
Quebec | 1.10 | 1.10 | 1.00 | 1.21 | 1.20 | 1.33 | 1.17 | 1.21 | 1.14 | 1.01 | 1.26 | 1.08 | 1.48 |
Ontario | 0.96 | 1.00 | 0.96 | 0.96 | 1.05 | 1.10 | 1.15 | 1.29 | 1.11 | 1.00 | 0.93 | 0.97 | 0.95 |
Manitoba | 1.71 | 1.81 | 1.52 | 2.19 | 2.29 | 1.94 | 2.09 | 2.03 | 1.76 | 1.58 | 1.68 | 1.49 | 1.48 |
Saskatchewan | 1.20 | 1.39 | 1.37 | 1.58 | 1.61 | 1.34 | 1.29 | 1.74 | 2.34 | 1.74 | 1.59 | 1.74 | 1.63 |
Alberta | 0.88 | 1.04 | 1.03 | 1.89 | 1.79 | 1.73 | 1.72 | 2.01 | 1.80 | 1.81 | 1.25 | 1.47 | 1.30 |
British Columbia | 1.33 | 1.43 | 1.41 | 1.42 | 1.48 | 1.60 | 1.64 | 1.66 | 1.68 | 1.49 | 1.52 | 1.59 | 1.63 |
Yukon Territory | 4.85 | 4.31 | 3.06 | 3.67 | 4.59 | 4.39 | 4.18 | 3.78 | 3.69 | 3.65 | 3.09 | 3.37 | 4.49 |
Northwest Territories | 1.32 | 1.23 | 0.88 | 0.66 | 0.89 | 0.97 | 0.89 | 0.85 | 0.73 | 1.03 | 0.80 | 1.01 | 0.88 |
Nunavut | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Geography | Month | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
201806 | 201807 | 201808 | 201809 | 201810 | 201811 | 201812 | 201901 | 201902 | 201903 | 201904 | 201905 | 201906 | |
percentage | |||||||||||||
Canada | 0.6 | 0.7 | 0.7 | 0.7 | 0.6 | 0.5 | 0.7 | 0.8 | 0.6 | 0.5 | 0.6 | 0.6 | 0.6 |
Newfoundland and Labrador | 0.5 | 0.5 | 0.3 | 0.2 | 0.5 | 0.4 | 0.3 | 0.6 | 0.5 | 0.2 | 0.4 | 0.5 | 0.1 |
Prince Edward Island | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Nova Scotia | 1.2 | 1.8 | 1.9 | 2.4 | 2.5 | 1.8 | 5.4 | 4.6 | 2.3 | 2.0 | 2.9 | 1.9 | 1.8 |
New Brunswick | 2.0 | 2.0 | 4.9 | 3.0 | 2.4 | 3.3 | 1.3 | 1.1 | 0.8 | 1.1 | 1.0 | 1.4 | 1.5 |
Quebec | 1.6 | 1.9 | 1.9 | 1.8 | 1.3 | 1.5 | 1.3 | 1.9 | 1.3 | 1.5 | 1.7 | 1.4 | 1.4 |
Ontario | 0.9 | 0.9 | 1.0 | 0.9 | 0.9 | 0.8 | 1.1 | 1.3 | 0.9 | 0.8 | 0.9 | 1.0 | 0.9 |
Manitoba | 0.9 | 1.0 | 1.1 | 0.9 | 2.1 | 1.4 | 2.0 | 1.2 | 0.6 | 0.9 | 0.9 | 3.4 | 0.7 |
Saskatchewan | 0.4 | 0.7 | 0.6 | 0.4 | 0.8 | 0.5 | 0.9 | 0.6 | 0.3 | 0.3 | 0.3 | 0.7 | 0.4 |
Alberta | 1.3 | 1.7 | 1.7 | 2.1 | 1.4 | 1.7 | 1.6 | 1.0 | 1.2 | 1.1 | 1.2 | 1.4 | 0.9 |
British Columbia | 1.8 | 2.3 | 1.4 | 1.8 | 1.6 | 1.4 | 1.7 | 2.2 | 1.4 | 1.0 | 1.3 | 1.4 | 1.2 |
Yukon Territory | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Northwest Territories | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Nunavut | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Month | Sales of goods manufactured | Raw materials and components inventories | Goods / work in process inventories | Finished goods manufactured inventories | Unfilled Orders |
---|---|---|---|---|---|
% | |||||
June 2018 | 0.59 | 0.96 | 1.38 | 1.41 | 1.21 |
July 2018 | 0.64 | 0.96 | 1.31 | 1.37 | 1.18 |
August 2018 | 0.61 | 0.91 | 1.25 | 1.36 | 1.13 |
September 2018 | 0.59 | 0.88 | 1.25 | 1.23 | 1.13 |
October 2018 | 0.57 | 0.93 | 1.22 | 1.26 | 1.15 |
November 2018 | 0.59 | 0.89 | 1.24 | 1.24 | 1.18 |
December 2018 | 0.59 | 0.94 | 1.23 | 1.34 | 1.13 |
January 2019 | 0.60 | 0.94 | 1.21 | 1.29 | 1.26 |
February 2019 | 0.62 | 0.93 | 1.22 | 1.26 | 1.13 |
March 2019 | 0.59 | 0.94 | 1.22 | 1.32 | 1.11 |
April 2019 | 0.58 | 0.96 | 1.20 | 1.33 | 1.13 |
May 2019 | 0.60 | 0.94 | 1.20 | 1.31 | 1.07 |
June 2019 | 0.57 | 0.95 | 1.16 | 1.35 | 1.13 |
Characteristics | Data source | |
---|---|---|
Response or edited | Imputed | |
% | ||
Sales of goods manufactured | 92.4 | 7.6 |
Raw materials and components | 84.9 | 15.1 |
Goods / work in process | 88.2 | 11.8 |
Finished goods manufactured | 83.7 | 16.3 |
Unfilled Orders | 93.1 | 6.9 |
Capacity utilization rates | 76.9 | 23.1 |