Text Classification of Public Service Job Advertisements

By: Dominic Demers and Jessica Lachance, Public Service Commission of Canada

Introduction

The Public Service Commission (PSC) is an independent agency mandated to promote and safeguard a non-partisan, merit based public service that is representative of all Canadians. Among its many responsibilities, the PSC also oversees over 50,000 hiring activities that fall under the Public Service Employment Act (PSEA), each year.

This rich data environment includes over a million resumes and 8,000 job advertisements yearly. Some of the data are structured, like the organization name or the position's group and level. But, most human resource (HR) data collected by the PSC is unstructured. The unstructured data, such as job advertisements or screening questions, can be used for analytical purposes.

The PSC's Data Services and Analysis Division is responsible for data requests, statistical studies, surveys, forecasting models and data visualizations for staffing and recruitment activities that fall under the PSEA.

This article will give an overview of two natural language processing (NLP) techniques used by our team to extract valuable insights from two open-ended fields – Educational Requirements and the Area of Selection variables. We'll also explain how they were subsequently used to feed the Applications Forecasting Tool, a data visualization tool that reports on job advertisements.

Applications Forecasting Tool

In 2019, the PSC developed the Applications Forecasting Tool to help managers and HR advisors within the Government of Canada prepare for selection processes. Users can select the characteristics of a job advertisement and get an estimate on the number of candidates, based on similar jobs that were previously advertised.

The first version of the tool only worked with structured data from the job advertisement. But, the PSC received feedback about two open-ended fields users wanted to use to obtain a better estimate of the number of candidates for their selection process. These fields included the level of education in the essential qualifications; and for internal processes, details about the Area of Selection such as the department, location or classification.

As such, the PSC used text classification techniques for the education and Area of Selection fields to structure the information into categories that fed into the Applications Forecasting Tool. These algorithms enabled more precise and useful reporting capabilities for the PSC.

Text Classification

Text classification is a subset of problems that fall under NLP. The goal of text classification is to take open-ended fields and assign each text a label from a limited set of options.

In our case, we explored two different models to reach our goal. For the education variable, we used a rules-based approach using regular expressions. For the Area of Selection, we used a machine learning based approach called Name-entity-recognition (NER).

Although text classification using any model can produce good results, the capability of the algorithm to extract information from text is not always reliable. As such, we had to evaluate the algorithm's efficacy in extracting the correct information. We evaluated the model using a test dataset and examined metrics to determine how the classifier performed.

Evaluating text classification models

To evaluate the performance of our text classification algorithms, we used a confusion matrix. A confusion matrix is a table that describes the performance of the classification model on a set of test data for which the true values are known.

The number of correct and incorrect predictions are summarized in a table, and include count values. It also summarizes the number of errors made by our classifier and, most importantly, error type.

The confusion matrix is comprised of four types of predicted and actual value combinations. In our text classification context, the algorithm will provide a "true" (or "positive") value when the text is predicted as part of the classification. For example, if the text is classified as "high school diploma" it will return "true" (or "positive") for this classification.

The four categories are described below.

Figure 1: Confusion Matrix

Figure 1: Confusion Matrix
Description - Figure 1: Confusion Matrix

Quadrant diagram with four combinations of predicted and actual values.
Positive Predicted Value + Positive Actual Value = True Positive
Positive Predicted Value + Negative Actual Value = False Positive
Negative Predicted Value + Positive Actual Value = False Negative
Negative Predicted Value + Negative Actual Value = True Negative

The True Positive (TP) Combination: The classification is predicted as true and is correct.
The True Negative (TN) Combination: The classification is predicted as false and is correct.
The False Positive (FP) or Type 1 Error: The classification is predicted as true but is incorrect.
The False Negative (FN) or Type 2 Error: The classification is predicted as false but is incorrect.

Using these combinations, we derived the following performance metrics:

  • Accuracy: The percentage of texts that were categorized with the correct classification. Used to determine how many classifications the model got right.
  • Precision TPTP+FP: The percentage of texts correctly classified out of the total number of texts classified as positive. Used to determine the proportion of positive identification that were correct.
  • Recall TPTP+FN: The percentage of actual positive values that are predicted correctly. Used to determine the proportion of actual positives that were correctly identified.
  • F1 Score: The harmonic mean of precision and recall.

In the context of this article, these statistics will be used to evaluate the performance of classifying two variables – Educational Requirement and the Area of Selection.

Educational Requirement field

In accordance with the PSEA, the Treasury Board Secretariat established Qualification standards for the core public administration by occupation group or classification. The qualification standards give the minimum educational requirements for each occupational group. Job advertisements for positions under the PSEA must include this merit criteria.

Managers generally use the qualification standard as their essential requirement. But they have the ability to set higher educational levels when required. For example, a hiring manager might require that an EC-06 senior policy analyst have a masters degree, even though the minimum requirement is a bachelors degree.

We might expect less candidates that have a masters instead of a bachelors. Parsing the level of education would allow us to give users of the Applications Forecasting Tool more relevant estimates and historical job advertisements.

Method

There are just over 100 qualification standards across all occupational groups which are also written in natural language. We decided that these standards could be summarized as belonging to one of eight education levels:

  • Some high school
  • High school
  • Some post-secondary
  • Post-secondary
  • Professional degree (e.g. Law degree, medical degree)
  • Master's degree
  • Ph.D. or above
  • Education unknown/not listed

To label the job advertisements according to education level, we used regular expressions to find key phrases and then apply the label. Regular expressions are a sequence of characters that specify a pattern in text. To analyze the education level we:

  • found key phrases, using regular expressions, which signal a type of education
  • mapped these phrases to a common level
  • labeled the education requirements text with one of these common levels

In total, we used 30 different rules to map the job descriptions to the eight education levels. These rules were created manually, using an iterative process. We started with regular expressions that capture the sentence structure and key phrases used in many qualification standards. Then, we added additional rules to capture cases which did not follow the qualification standards.

Here's a visual representation of what this looks like:

Figure 2: Educational requirement classification

Figure 2: Educational requirement classification

Description - Figure 2: Educational requirement classification

**Please read section *Other information*

EDUCATION – COMMON TO ALL STREAMS

Successful completion of two years of an acceptable post-secondary educational program in computer science, information technology, information management or other specialty relevant to the position to be staffed.

-Indeterminate period incumbents of positions in the CS group on May 10, 1999, who do not possess the education prescribed above, are deemed to meet the minimum education standards based on their education, training and/or experience; They must be accepted as having met the minimum education standard whenever this standard is called for when staffing positions in the CS group.

-It is a recognized educational institution (e.g., community college, CÉGEP or university) that determines if the courses taken by a candidate correspond to two years of a post-secondary program at the institution.

IMPORTANT:

-It is the responsibility of the candidates to provide proof od their education. Note that your original diploma will be required during the process;

-Candidates with foreign educational credentials are required to provide proof of Canada equivalency. Please consult the Canadian Information Centre for International Credentials for further information. Any applicable fees are the responsibility of the candidate. Candidates who are unable to provide proof that they meet this essential qualification as requested will be eliminated from the process.

…years of post-secondary…

Some post-secondary

In this image, the first section represents our input. The segment highlighted in green states the relevant portion of the text related to the educational requirement. "Successful completion of two year of an acceptable post-secondary educational program in computer science, information technology, information management or other speciality relevant to the position to be staffed".

Then the second block represents the rule which was applied to the text using regular expressions. The text was flagged containing the phrase "… years of … post-secondary".

This flag, and the absence of a flag from a higher qualification (e.g. "degree", "doctorate") means this job advertisement was labelled as having the level of education as "Some post-secondary".

Model evaluation

To evaluate the model, we extracted a sample of 1,000 advertisements from the 2019-2020 Fiscal Year and manually labelled the level of education. The table below presents the precision, recall and f1-score of our rules-based algorithm, for each of the eight levels of education.

Table 1: Educational requirements model evaluation results
  Sample size Precision Recall F1-score
Education level unknown/not listed 45 97.7% 95.6% 96.6%
Some high school 30 100.0% 100.0% 100.0%
High school 418 99.3% 98.3% 98.8%
Some post-secondary 72 94.4% 94.4% 94.4%
Post-secondary 391 96.0% 97.7% 96.8%
Professional degree 17 100.0% 88.2% 93.8%
Master's degree 17 83.3% 88.2% 85.7%
Ph. D or above 10 100.0% 90.0% 94.7%

Results

We applied the algorithm to a total of 18,055 job advertisements between April 1, 2016 and March 31, 2019. The following table provides a breakdown of the EX-01 job advertisements, by the level of education derived from the algorithm. As shown below, the vast majority require either a high school education or a post-secondary education.

Table 2: Educational requirement for EX-01 positions (April 1, 2016, to March 31, 2019)
Educational Requirement Number of Job Advertisements %  Total
Post-secondary 676 83%
Master's degree 81 10%
Some post-secondary 27 3%
Education level unknown/not listed 16 2%
High school 13 2%
Professional degree 2 0%
Total 815 100%

Using this methodology, when accessing the AFT to estimate the number of job applications, users can filter results on this new education field. For instance, since April 1, 2015, 921 EX-01 jobs were advertised with a median of 30 applicants. Out of those advertisements, 806 required a post-secondary degree and had a median of 32 applicants.

Area of Selection field section

Background

In accordance with PSEA article 34 (1), for the purpose of eligibility in an appointment process, an organisation may limit the Area of Selection for internal job processes by establishing geographic, organizational, or occupational criteria. This restriction is written in the "Who can apply" field of a job advertisement.

Having a restricted Area of Selection will reduce the pool of potential applicants. Users of the Applications Forecasting tool wanted to know how many applicants they could expect if they only limited their Area of Selection to at-level employees in their department, as oppose to all public servants in Canada.

Method

Our objective was to parse the Area of Selection field to extract the department(s), location(s), and level(s) mentioned by using a technique called name-entity recognition (NER). An NER model is an NLP technique that identifies “entities” in a block of text, such as proper nouns (a person’s name, a country) or category of things (animals, vehicles).

In our case, the entities extracted are

  • organizations (e.g. “Transport Canada”, “the Federal Public Service”),
  • locations (e.g. “Canada”, “Atlantic Region”, “a 40 km radius of Winnipeg, MB”)
  • occupation classifications (e.g. “EC-04”, “EX-01”)

To apply the NER model we used spaCy, a free, open-source library used for advanced NLP in Python.

SpaCy's NER algorithm includes the entities “ORG” (organization), “LOC” (location) and “GPE” (Geopolitical).

To reduce the amount of manual tagging, we took an iterative approach to building our training set. First, we used SpaCy's default algorithm to tag a random sample of 1000 Area of Selections. Then, we made the following changes:

  1. Merged the “LOC” and “GPE” tags into one “LOC” tag
  2. Added a “LEVEL” tag which identifies occupational classifications
  3. Corrected any other issues with the “ORG” and “LOC” tags

Building off this, we created an additional 200 training examples, which were targeted to include additional examples of the “LEVEL” tag, and other cases the initial algorithm consistently failed to identify.

With the training set ready, the SpaCy NER algorithm performs the following tasks:

  1. Creates a prediction model using a portion of the labeled training data
  2. Sends an unlabeled version of another portion of the training data to model and predicts the entities
  3. Compares predicted labels to true labels
  4. Updates model to account for incorrect labels. The amount of change between models is called the gradient.
  5. Repeat until gradient is small and model predictions change very little between iterations

This process resulted in a final model that can identify the different criteria in an Area of Selection. The following image illustrates an example of the tagging the model performed:

Figure 3: Area of Selection classification

Figure 3: Area of Selection classification
Description - Figure 3: Area of Selection classification

Employees of the public service at the PM-04 or an equivalent classification who occupy a position within 40km of Edmonton, Alberta. Employees of the public service ORG at the PM-04 LEVEL or an equivalent classification who occupy a position within 40km of Edmonton, Alberta LOC.

At the top of the image, we have the complete text of the Area of Selection, then at the bottom of the image, we have our three “entities” highlighted. “the public service” is labelled as ORG, “PM-04” is labelled as LEVEL and “within 40km of Edmonton, Alberta” is labelled “LOC”

Model evaluation

We evaluated the model using a random sample of 465 Area of Selection statements which we manually labeled. The following table shows the precision and recall scores for each entity typeFootnote 1

 
Entity tag Precision Recall F1-score
ORG 92.6% 90.8% 91.7%
LOC 80.2% 74.9% 77.5%
LEVEL 95.0% 76.0% 84.4%

Results

Using the results of the model, we produced the following exploratory analysis. This analysis is based on of 13,362 internal job postings between April 1, 2016 and March 31, 2019.

Figure 4: Venn diagram of Area of Selection field, by organization, occupational group and geography

Figure 4: Venn diagram of Area of Selection field, by organization, occupational group and geography
Description - Figure 4: Venn diagram of Area of Selection field, by organization, occupational group and geography

Venn diagram of an Area of Selection field split into three.

Organizational (Dep't.) = 6.6% Organizational & Occupational share 0.4%
Occupational = 1.6% Occupational & Geographic share 2.2%
Geographic = 41.5% Geographic & Organizational share 37.9%
All three share 0.9% Open area of selection = 8.9%

What we found is that most internal advertisements chose to use at least one of the filters outlined in the PSEA and that most of the areas of selection with a geographic filter were for “Persons employed by the Public Service occupying a position in the National Capital Region (NCR)”.

However, we realized that some areas of selection proved to be harder to parse. These included:

1) Employees of Transport Canada who occupy a position in Calgary, Edmonton, Saskatoon, Winnipeg, Whitehorse, Yellowknife or Churchill.

2) Should an insufficient number of applicants be identified, persons employed in the Public Service, who occupy a position within 40km of Winnipeg, Manitoba or within 40km of Edmonton or Calgary, Alberta may be considered without re-advertising. Therefore, applicants in this expanded area of selection are encouraged to apply.

Our model performed well, but due to multi-criteria areas, we decided to use our analysis with a broader set of categories. Previously in the Applications Forecasting Tool, users could only select “internal job advertisement” or “external job advertisement”. Now, users have more precision for internal job advertisements. They can select:

  • Internal job advertisements, open to all public servants
  • Internal job advertisements, open to public servants in the NCR
  • Internal job advertisement, other areas of selection

This addition improved our model and allowed users to search a narrower set of advertisements to find any that matched their intended selection process.

Conclusion

Open-ended fields are a valuable way of collecting information and shouldn't be excluded from forms or surveys. It allows for a catch-all response when questions don't allow for users to give information within a fixed set of choices.


But this flexibility will come at the cost of accuracy of the classifications. Classification systems can generate the right predictions (true positives and true negatives), but can also make the wrong ones (false positives, false negatives). Cross validating the performance of your algorithm will be essential in determining if the classifications are sufficiently accurate for your reporting purposes.

This article showed methods to structure information from open-ended fields for reporting purposes in the Application Forecasting Tool. The categories derived from the area of selection and level of education fields were used to populate to drop-down menus allowing users to fine-tune their search results.

You're encouraged to visit the Application Forecasting Tool, or our other data visualization tools on the PSC's Data Visualization Hub.

Register for the Data Science Network's Meet the Data Scientist Presentation

If you have any questions about this article or would like to discuss this further, we invite you to our new Meet the Data Scientist presentation series where the author will be presenting this topic to DSN readers and members.

Tuesday, October 18
2:00 to 3:00 p.m. EDT
MS Teams – link will be provided to the registrants by email

Register for the Data Science Network's Meet the Data Scientist Presentation. We hope to see you there!

Date modified:

Police-Reported Racialized and Indigenous Identity Statistics via the Uniform Crime Reporting Survey: Report and Draft Recommendations

September 1, 2022

Today, Statistics Canada released a report and draft recommendations about the collection of data on Indigenous and racialized identity through the Uniform Crime Reporting Survey.

This report builds on the increasing demand for better data on people's diverse experiences in the justice system and seeks to shed light on the different treatment and overrepresentation of Indigenous and racialized people in the Canadian criminal justice system.

The six draft recommendations included in this report were developed alongside community organizations, Indigenous groups and police services through a comprehensive engagement process that began in 2021. Statistics Canada looks forward to continuing to work with stakeholders to advance police-reported data on Indigenous and racialized groups.

"Disaggregated data are a crucial part of decision making. Today's recommendations are another step toward improving the collection and quality of data on Indigenous and racialized groups in official police-reported crime statistics," said Anil Arora, Chief Statistician of Canada. "We thank our partners, including the Canadian Association of Chiefs of Police, for helping to meet the information needs of the justice community."

This project is part of Statistics Canada's Disaggregated Data Action Plan, which will lead to detailed statistical information that highlights the experiences of specific population groups, such as women, Indigenous peoples, racialized populations and people living with disabilities.

Contact

Media Relations
Statistics Canada
STATCAN.mediahotline-ligneinfomedias.STATCAN@statcan.gc.ca

If necessary, please make address label corrections in the boxes below.

  • Legal name
  • Business name
  • Title of contact
  • First name of contact
  • Last name of contact
  • Address (number and street)
  • City
  • Province/territory or state
  • Country
  • Postal code/zip code
  • Language preference
    • English
    • French

This information is collected under the authority of the Statistics Act, Revised Statutes of Canada, 1985, Chapter S-19.

Completion of this questionnaire is a legal requirement under this act.

Survey purpose

To obtain information on the supply of and demand for energy in Canada. This information serves as an important indicator of Canadian economic performance, is used by all levels of government in establishing informed policies in the energy area and, in the case of public utilities, is used by governmental agencies to fulfil their regulatory responsibilities. The private sector likewise uses this information in the corporate decision-making process. Your information may also be used by Statistics Canada for other statistical and research purposes.

Confidentiality

Statistics Canada is prohibited by law from releasing any information it collects which could identify any person, business, or organization, unless consent has been given by the respondent or as permitted by the Statistics Act. Statistics Canada will use the information from this survey for statistical and research purposes.

Security of emails and faxes

Statistics Canada advises you that there could be a risk of disclosure during facsimile or e-mail. However, upon receipt, Statistics Canada will provide the guaranteed level of protection afforded to all information collected under the authority of the Statistics Act.

Data-sharing agreements

To reduce respondent burden, Statistics Canada has entered into data sharing agreements with provincial and territorial statistical agencies and other government organizations, which have agreed to keep the data confidential and use them only for statistical purposes.

Reporting instructions

Please refer to the reporting instruction before completing this report.

Reporting Period

  • Month
  • Year

Does this establishment ship or receive products by pipeline, tanker or barge?

  • Yes
  • No

Operations (Cubic Metres)

Instructions:

  1. For product definitions and detailed survey instructions please consult the provided references.
  2. All values should be reported in cubic metres. All values should be positive except for Column H (Losses and Adjustments During Month), which can be negative.
  3. Each bolded product line should be balanced. When products are balanced, Stocks Beginning of Month + Receipts + Production = Inputs + Shipments + Fuel Uses + Losses and Adjustments + Stocks End of Month. A warning will appear if the values are not balanced.
  4. The Losses and Adjustments column for each bolded product line should be under 5% of (Stocks Beginning of Month + Receipts + Production). A warning will appear if the value is too high.
  5. Total Inputs and Production (Line 79) should be within 5%
2023 Monthly Refined Petroleum Products
Item Description Stocks Beginning of Month Receipts During Month Inputs During Month Production During Month Shipments During Month Fuel Uses During Month Losses and Adjustments During Month Stocks End of Month
Crude oil, Total (including synthetic crude oil)                
Conventional crude oil - light                
Conventional crude oil - heavy                
Crude bitumen                
Lease condensate                
Synthetic crude oil                
Hydrogen                
Other hydrocarbons                
Renewable fuels, Total                
Fuel Ethanol (denatured)                
Biodiesel Fuel (FAME)                
Renewable Diesel Fuel (HDRD/HVO)                
Other renewable fuels                
Oxygenates (excluding fuel ethanol), Total                
Ethyl tertiary butyl ether (ETBE)                
Other oxygenates                
Hydrocarbon gas liquids, Total                
Ethane and ethylene, Total                
Ethane                
Ethylene                
Propane and propylene, Total                
Propane                
Propylene                
Normal butane and butylene, Total                
Normal butane                
Butylene                
Isobutane and isobutylene, Total                
Isobutane                
Isobutylene                
Pentanes plus                
Unfinished oils, Total (excluding synthetic crude oil)                
Naphthas and lighter                
Kerosene and light gas oils                
Heavy gas oils                
Residuum                
Finished motor gasoline, Total                
Blended with fuel ethanol                
Other finished motor gasoline                
Motor gasoline blending components, Total                
Blendstock for oxygenate blending (BOB)                
Gasoline treated as blendstock (GTAB)                
Other motor gasoline blending components                
Finished aviation gasoline                
Aviation gasoline blending components                
Special naphthas (solvents)                
Kerosene-type jet fuel                
Kerosene                
Distillate fuel oil, Total                
Diesel fuel oil                
Light fuel oil (No. 2 and No. 3 heating fuel oils)                
Residual fuel oil, Total (No. 4, No.5 and No. 6 fuel oils)                
Under 1.00% sulphur                
Over 1.00% sulphur                
Lubricants, Total                
Naphthenic                
Parafinic                
Asphalt                
Wax                
Petroleum coke                
Still gas                
Petrochemical feedstocks, Total                
Naphtha less than 205°C (401°F) end-point                
Other oils greater than or equal to 205°C (401°F) end-point                
Miscellaneous products                
Total                

Comments: Identify any unusual aspects of your reporting month's operations.

Retail Commodity Survey: CVs for Total Sales (June 2022)

Retail Commodity Survey: CVs for Total Sales May 2022
Table summary
This table displays the results of Retail Commodity Survey: CVs for Total Sales (May 2022). The information is grouped by NAPCS-CANADA (appearing as row headers), and Month (appearing as column headers).
NAPCS-CANADA Month
202203 202204 202205 202206
Total commodities, retail trade commissions and miscellaneous services 0.63 0.67 0.63 0.61
Retail Services (except commissions) [561] 0.62 0.67 0.63 0.61
Food at retail [56111] 1.00 0.94 0.56 0.53
Soft drinks and alcoholic beverages, at retail [56112] 0.64 0.63 0.59 0.60
Cannabis products, at retail [56113] 0.00 0.00 0.00 0.00
Clothing at retail [56121] 1.16 1.05 1.00 0.78
Footwear at retail [56122] 1.45 1.76 1.51 1.21
Jewellery and watches, luggage and briefcases, at retail [56123] 7.35 7.38 5.44 5.94
Home furniture, furnishings, housewares, appliances and electronics, at retail [56131] 1.25 1.14 1.31 0.99
Sporting and leisure products (except publications, audio and video recordings, and game software), at retail [56141] 2.16 2.09 1.60 1.89
Publications at retail [56142] 5.87 5.82 5.62 6.18
Audio and video recordings, and game software, at retail [56143] 0.49 0.62 0.31 1.07
Motor vehicles at retail [56151] 2.08 2.33 2.21 2.12
Recreational vehicles at retail [56152] 4.14 5.72 6.99 3.09
Motor vehicle parts, accessories and supplies, at retail [56153] 1.75 1.74 1.83 1.83
Automotive and household fuels, at retail [56161] 2.02 1.68 1.86 1.67
Home health products at retail [56171] 2.12 2.39 2.54 2.50
Infant care, personal and beauty products, at retail [56172] 2.22 2.07 1.97 2.18
Hardware, tools, renovation and lawn and garden products, at retail [56181] 2.17 2.81 1.60 2.41
Miscellaneous products at retail [56191] 2.08 3.02 3.12 2.93
Total retail trade commissions and miscellaneous services Footnote 1 2.04 1.66 1.84 1.91

Footnotes

Footnote 1

Comprises the following North American Product Classification System (NAPCS): 51411, 51412, 53112, 56211, 57111, 58111, 58121, 58122, 58131, 58141, 72332, 833111, 841, 85131 and 851511.

Return to footnote 1 referrer

National Travel Survey: C.V.s for Visit-Expenditures by Duration of Visit, Main Trip Purpose and Country or Region of Expenditures – Q1 2022

National Travel Survey: C.V.s for Visit-Expenditures by Duration of Visit, Main Trip Purpose and Country or Region of Expenditures, including expenditures at origin and those for air commercial transportation in Canada, in Thousands of Dollars (x 1,000)
Table summary
This table displays the results of C.V.s for Visit-Expenditures by Duration of Visit, Main Trip Purpose and Country or Region of Expenditures. The information is grouped by Duration of trip (appearing as row headers), Main Trip Purpose, Country or Region of Expenditures (Total, Canada, United States, Overseas) calculated using Visit-Expenditures in Thousands of Dollars (x 1,000) and c.v. as units of measure (appearing as column headers).
Duration of Visit Main Trip Purpose Country or Region of Expenditures
Total Canada United States Overseas
$ '000 C.V. $ '000 C.V. $ '000 C.V. $ '000 C.V.
Total Duration Total Main Trip Purpose 15,762,153 A 9,966,054 A 3,133,898 B 2,662,201 B
Holiday, leisure or recreation 9,147,889 A 4,806,951 A 2,418,158 C 1,922,780 B
Visit friends or relatives 3,568,020 A 2,664,614 A 355,216 B 548,190 C
Personal conference, convention or trade show 100,193 C 91,220 C 8,973 E ..  
Shopping, non-routine 802,844 B 733,256 B 67,373 E 2,215 E
Other personal reasons 802,281 B 668,406 B 56,424 D 77,451 E
Business conference, convention or trade show 491,550 C 262,687 D 140,442 D 88,421 E
Other business 849,376 B 738,920 B 87,312 D 23,144 E
Same-Day Total Main Trip Purpose 3,543,156 A 3,393,910 A 148,573 D 673 E
Holiday, leisure or recreation 1,238,924 B 1,169,446 B 68,805 E 673 E
Visit friends or relatives 863,552 B 859,969 B 3,583 E ..  
Personal conference, convention or trade show 20,198 D 20,198 D ..   ..  
Shopping, non-routine 723,154 B 656,331 B 66,823 E ..  
Other personal reasons 357,516 B 357,483 B 33 E ..  
Business conference, convention or trade show 58,726 E 58,419 E 307 E ..  
Other business 281,086 C 272,064 C 9,021 E ..  
Overnight Total Main Trip Purpose 12,218,997 A 6,572,144 A 2,985,325 B 2,661,528 B
Holiday, leisure or recreation 7,908,965 B 3,637,505 A 2,349,353 C 1,922,107 B
Visit friends or relatives 2,704,468 B 1,804,645 B 351,633 B 548,190 C
Personal conference, convention or trade show 79,995 D 71,022 D 8,973 E ..  
Shopping, non-routine 79,690 C 76,926 C 550 E 2,215 E
Other personal reasons 444,765 B 310,923 B 56,390 D 77,451 E
Business conference, convention or trade show 432,824 C 204,267 C 140,136 D 88,421 E
Other business 568,290 B 466,855 B 78,291 D 23,144 E
..
data not available

Estimates contained in this table have been assigned a letter to indicate their coefficient of variation (c.v.) (expressed as a percentage). The letter grades represent the following coefficients of variation:

A
c.v. between or equal to 0.00% and 5.00% and means Excellent.
B
c.v. between or equal to 5.01% and 15.00% and means Very good.
C
c.v. between or equal to 15.01% and 25.00% and means Good.
D
c.v. between or equal to 25.01% and 35.00% and means Acceptable.
E
c.v. greater than 35.00% and means Use with caution.

National Travel Survey: C.V.s for Person-Trips by Duration of Trip, Main Trip Purpose and Country or Region of Trip Destination – Q1 2022

National Travel Survey: C.V.s for Person-Trips by Duration of Trip, Main Trip Purpose and Country or Region of Trip Destination – Q1 2022
Table summary
This table displays the results of C.V.s for Person-Trips by Duration of Trip, Main Trip Purpose and Country or Region of Trip Destination. The information is grouped by Duration of trip (appearing as row headers), Main Trip Purpose, Country or Region of Trip Destination (Total, Canada, United States, Overseas) calculated using Person-Trips in Thousands (× 1,000) and C.V. as a units of measure (appearing as column headers).
Duration of Trip Main Trip Purpose Country or Region of Trip Destination
Total Canada United States Overseas
Person-Trips (x 1,000) C.V. Person-Trips (x 1,000) C.V. Person-Trips (x 1,000) C.V. Person-Trips (x 1,000) C.V.
Total Duration Total Main Trip Purpose 46,502 A 43,459 A 1,730 B 1,313 A
Holiday, leisure or recreation 17,289 A 15,499 A 880 B 910 A
Visit friends or relatives 18,749 A 18,009 A 402 B 339 B
Personal conference, convention or trade show 276 C 261 C 15 E ..  
Shopping, non-routine 3,250 B 3,047 B 202 E 1 E
Other personal reasons 3,694 B 3,616 B 47 D 31 D
Business conference, convention or trade show 444 D 366 D 54 D 24 E
Other business 2,801 B 2,661 B 131 D 8 E
Same-Day Total Main Trip Purpose 30,319 A 29,876 A 443 C ..  
Holiday, leisure or recreation 9,657 A 9,516 A 141 E ..  
Visit friends or relatives 12,074 A 12,044 A 31 E ..  
Personal conference, convention or trade show 151 C 151 C ..   ..  
Shopping, non-routine 3,065 B 2,865 B 200 E ..  
Other personal reasons 2,971 B 2,969 B 2 E ..  
Business conference, convention or trade show 203 E 198 E 4 E ..  
Other business 2,199 B 2,134 B 65 E ..  
Overnight Total Main Trip Purpose 16,183 A 13,583 A 1,288 A 1,313 A
Holiday, leisure or recreation 7,632 A 5,983 A 739 B 910 A
Visit friends or relatives 6,675 A 5,965 A 371 B 339 B
Personal conference, convention or trade show 125 C 110 C 15 E ..  
Shopping, non-routine 185 C 182 D 2 E 1 E
Other personal reasons 723 B 647 B 45 D 31 D
Business conference, convention or trade show 241 C 168 D 50 D 24 E
Other business 602 B 527 C 66 D 8 E
..
data not available

Estimates contained in this table have been assigned a letter to indicate their coefficient of variation (c.v.) (expressed as a percentage). The letter grades represent the following coefficients of variation:

A
c.v. between or equal to 0.00% and 5.00% and means Excellent
B
c.v. between or equal to 5.01% and 15.00% and means Very good.
C
c.v. between or equal to 15.01% and 25.00% and means Good.
D
c.v. between or equal to 25.01% and 35.00% and means Acceptable.
E
c.v. greater than 35.00% and means Use with caution.

Website modernization initiative: Text-to-speech feature

Opened: August 2022
Closed: January 2023
Results posted: March 2023

Consultative engagement objectives

Statistics Canada is piloting two text-to-speech (TTS) features on selected pages of its website. TTS is an assistive technology on the rise that provides users with another way to consume the content published online by making it audible.

We collected feedback to help gain an understanding of needs and expectations when using text to speech technology, the tools’ functionalities, and areas in need of improvements.

Consultation methodology

This consultative engagement initiative is now closed.

Statistics Canada conducted virtual group discussions in both official languages with organisations (government and non-government), academics and members of the general public of various age groups. Participants were asked to respond to a series of questions and provide feedback on the TTS features.

How participants got involved

Feedback was sought from visitors to the Statistics Canada website and data users including persons with visual impairments, learning disabilities, cognitive impairments and other accessibility requirements as well as accessibility organizations, multilingual language users, readers looking for convenience, lower literacy readers, and academics.

Also included were Statistics Canada employees, including management, accessibility, and diversity communities.

Results

The consultative engagement revealed that most participants recognised that this tool is and could be useful to Canadians except for those impacted by blindness or living with sight loss. It also provided valuable recommendations on how to improve the TTS feature to be more relevant and user-friendly.

Statistics Canada thanks participants for their involvement in this consultative engagement initiative. Their insights will guide the next phase of this pilot project and we will continue to consult with Canadians if further enhancements are being implemented.

Individuals who wish to obtain more information on this engagement initiative may contact us by email at consultativeengagement-mobilisationconsultative@statcan.gc.ca

National Travel Survey: Response Rate – Q1 2022

National Travel Survey: Response Rate – Q1 2022
Table summary
This table displays the results of Response Rate. The information is grouped by Province of residence (appearing as row headers), Unweighted and Weighted (appearing as column headers), calculated using percentage unit of measure (appearing as column headers).
Province of residence Unweighted Weighted
Percentage
Newfoundland and Labrador 23.8 21.9
Prince Edward Island 25.2 23.4
Nova Scotia 29.1 26.6
New Brunswick 28.0 24.6
Quebec 32.7 28.8
Ontario 30.7 28.6
Manitoba 32.0 28.9
Saskatchewan 29.4 26.6
Alberta 27.8 25.8
British Columbia 32.1 30.2
Canada 30.1 28.4

Quarterly Survey of Financial Statements: Weighted Asset Response Rate - second quarter 2022

Weighted Asset Response Rate
Table summary
This table displays the results of Weighted Asset Response Rate. The information is grouped by Release date (appearing as row headers), 2020, Q2, Q3, and Q4, and 2021, Q1 and Q2 calculated using percentage units of measure (appearing as column headers).
Release date 2021 2022
Q2 Q3 Q4 Q1 Q2
quarterly (percentage)
August 25, 2022 80.7 79.0 80.9 75.0 55.7
May 25, 2022 80.7 79.0 77.3 56.7 ..
February 23, 2022 77.2 75.6 54.2 .. ..
November 23, 2021 74.5 56.7 .. .. ..
August 24, 2021 60.9 .. .. .. ..
.. not available for a specific reference period
Source: Quarterly Survey of Financial Statements (2501)