Canadian Health Measures Survey - Cycle 6 (2018-2019) Data Accuracy: Average of the measured body mass index

Canadian Health Measures Survey - Cycle 6 (2018-2019) - full sample
Average of the measured body mass index (kg/m2)
  Average(kg/m2) c.v.(%)
ages 3 to 5
Both sexes 15.86 0.6
ages 6 to 11
Males 17.37 1.7
Females 17.17 1.7
ages 12 to 19
Males 23.40 4.2
Females 21.93 1.4
ages 20 to 39
Males 26.93 2.0
Females 25.26 2.9
ages 40 to 59
Males 28.42 2.6
Females 27.05 2.4
ages 60 to 79
Males 27.89 3.1
Females 27.83 1.6

Data Science Network for the Federal Public Service (DSNFPS)

The information in these articles is provided 'as-is' and Statistics Canada makes no warranty, either expressed or implied, including but not limited to, warranties of merchantability and fitness for a particular purpose. In no event will Statistics Canada be liable for any direct, special, indirect, consequential or other damages, however caused.

Recent articles

Applying Semi-Supervised Machine Learning Classification to Anomaly Detection Exercises: The Case of Sensor Data

Topics covered in this article: Computer vision, Predictive analytics

Applying Semi-Supervised Machine Learning Classification to Anomaly Detection Exercises: The Case of Sensor Data

Source: Courtesy of Housing, Infrastructure and Communities Canada

Discover how the Data Science and Major Bridges and Projects teams at the Federal Government of Canada’s Department of Housing, Infrastructure and Communities (HICC) are revolutionizing bridge health monitoring. This article presents their innovative project that leveraged Machine Learning (ML) to filter noise from sensor data and enable precise monitoring of structural components. By combining ML classification with expert knowledge, the team successfully identified over 714,000 anomalies, and the results contributed to enhancing traditional monitoring methods. Learn about the techniques used and the significant impact on bridge maintenance. Don’t miss this insightful read on the future of infrastructure management!

Continue reading: Applying Semi-Supervised Machine Learning Classification to Anomaly Detection Exercises: The Case of Sensor Data


Applying Random Forest Algorithms to Enhance Expenditure Predictions in Government Grants and Contributions Programs

Topics covered in this article: Ethics and responsible machine learning

Applying Random Forest Algorithms to Enhance Expenditure Predictions in Government Grants and Contributions Programs

This article showcases the successful collaboration between the Data Science team and the Grants and Contributions Centre of Expertise (GCCOE) at Housing, Infrastructure, and Communities Canada (HICC). Leveraging Machine Learning (ML), specifically the Random Forest algorithm, the project aimed to enhance expenditure forecasting for Grants and Contributions (G&C) programs. By integrating data-driven insights with financial expertise, the ML model significantly improved the efficiency and accuracy of the department’s G&C processes. This innovative approach was recognized with the Comptroller General of Canada Innovation Award in December 2024. The article highlights how ML can bolster financial stewardship, streamline operations, and provide a scalable framework for strategic planning across the federal government.

Continue reading: Applying Random Forest Algorithms to Enhance Expenditure Predictions in Government Grants and Contributions Programs


Data to Decisions: Visualizations and ML Modeling of Rental Property Data

Topics covered in this article: Data processing and engineering Computer vision

As per the 2021 census, there were 5-million rental households in Canada, which means roughly one-third of Canadian households are renters. However, much of this rental activity occurs privately, leading to limited and inconsistent data. To bridge this knowledge gap, NorQuest college, acquired processed, analyzed, and visualized rental listings from the stakeholder – Community Data Program, for Ontario.

Continue reading: Data to Decisions: Visualizations and ML Modeling of Rental Property Data


Other recent articles

Browse articles by topic

Computer vision
Data processing and engineering
Predictive analytics
Text analysis and generation
Ethics and responsible machine learning
Other

Co-op student explores the power of Big Data

By: Kathrin Knorr, Simon Fraser University

Editor's note: The following is an edited version of an article originally featured in Simon Fraser University (SFU)'s The Co-op Close-up series. The article was modified and translated by the Data Science Network for the Federal Public Service, and reproduced here with permission from SFU.

The article features Mihir Gajjar, a co-op student working in the Data Science Division at Statistics Canada. He completed a Bachelor of Technology in Information and Communication Technology at Ahmedabad University, India. He recently completed the Professional Master's program in Computer Science at SFU. The article also features his previous supervisor at Statistics Canada, Meredith Thomas.

SFU: Can you tell us about Statistics Canada? What is it like working there?

Mihir Gajjar: I have been working in the amazing Data Science Division at Statistics Canada. In this division, data scientists work with subject-matter analysts, methodologists, and IT specialists to develop big data-processing, machine-learning, and AI (artificial intelligence) strategies.

For me, there are several highlights about the work culture at Statistics Canada, such as the daily scrum meetings with the supervisor and team members where we prioritize the day's work and discuss other important issues. I also like the agile development approach for most of the projects so that each project has a lifespan of four months, and then the project is ready for deployment. We also have weekly machine-learning technical seminars where we learn about advancements in the field and discuss relevant research papers.

SFU: Can you tell us a bit about the project(s) you are working on in your co-op position?

Mihir Gajjar, student at Simon Fraser University (Master's in Computer Science program) and co-op student with Data Science Division.

Photo: D. Taiwo.

Mihir Gajjar: At Statistics Canada, analysts spend a lot of time searching for information about enterprises. With the amount of news growing exponentially, it becomes difficult to manually track all the published information. The project I am working on seeks to automate the tasks of detecting events of interest from news articles and extracting their attributes.

For example, events of interest that are related to enterprises might include mergers and acquisitions, equity markets, and branch openings, whereas event attributes are things like dates and locations of said events. Ultimately, my work allows economic analysts to spend less time on information searches and devote more time to analysis. This multidisciplinary work is a collaboration between teams, including portfolio and accounts managers, methodologists, and other data scientists.

The main technical tasks include finding similarities between articles for ranking, removing duplicates, and text summarization. The goal is to provide subject-matter experts with a dashboard to support the detection and tracking of desired events over a specified time span.

The data for our models consist of 1.5 million news articles from the Dow Jones Data News and Analytics Platform and NewsDesk, a shared governmental system. Exploratory data analysis and basic text pre-processing were used to train various machine-learning models.

SFU: How did the Big Data program prepare you for your co-op position?

Mihir Gajjar: SFU's Big Data program provided me with theoretical as well as practical hands-on experience through lectures and a project-based learning environment. Subjects like machine learning helped me to develop a solid theoretical base while the practical assignments and group projects allowed me to implement the concepts and try out new tools and technologies.

Along with sound technical knowledge, the program equipped me with essential skills, such as working in a team, communicating and sharing ideas with other people, giving presentations, critical thinking, technical writing, and time management.

SFU: What are your most valuable takeaways from this co-op experience?

Mihir Gajjar: Through the project I have been working on, I learned a lot about the practical aspects of working as a data scientist. Part of the project involved extracting data using an external company's Application Programming Interface, which meant weekly meetings with its development team. This helped me learn how to think analytically and design questions which aid in understanding the quality and the depth of the data. I also learned about the importance of fully understanding the user's needs in order to develop a product that meets those requirements.

Working at Statistics Canada gave me exposure to real-world data science projects and taught me how to create and execute a technical plan to achieve the desired goals. This is my first time working as a data scientist and this experience has improved my skills and made me feel confident about working in this role moving forward in my career.

SFU: What do employers say about our students?

Meredith Thomas, Chief, Data Science Division: Mihir is a great fit for this work environment, as he is always open to learning new approaches in technology, and works well independently or in a team setting. Partnered with a senior data scientist, Mihir continues to grow in his time here at Statistics Canada, moving from Natural Language Processing projects to image processing projects with enthusiasm and focus. He is a valued member of our team.

Date modified:

The COVID-19 cloud platform for advanced analytics

By: Allie MacIsaac, Statistics Canada

As Canadians grew increasingly concerned about the impact of COVID-19 on our society and our economy in March 2020, Statistics Canada set to work collecting vital information to support citizens and critical government operations during unprecedented times.

At the same time, analysts, researchers and data scientists across the Government of Canada were faced with another pressing concern…how could they provide much-needed information quickly and securely, while working remotely with limited access to their usual tools and computing infrastructure?

Fast-tracking modernization

As the need for analytical capabilities became increasingly urgent, a team of experts at Statistics Canada came together to fast-track the Data Analytics as a Service (DAaaS) project and explore open source data solutions. The aim was to equip data scientists with the work environment they need to conduct deeper analysis and to provide insights on the impact of COVID-19 in Canada.

The result is the COVID-19 cloud platform for advanced analytics: a virtual data science workspace that integrates data from reliable StatCan sources, extracts information and presents it in a central platform that includes robust presentation and dissemination options.

Not only does this solution meet the needs of data scientists, but it also drives forward modernization at the national statistical agency by helping meet the strategic objectives of the Statistics Canada Data Strategy—including an enhanced focus on data science—at an expedited pace.

A multi-disciplinary tiger team creating a "dream" data science environment

The analytical platform is the result of a collaboration between Statistics Canada's Data Science Division, the IT DAaaS team, the Cloud Team and partners at Microsoft.

Each group had an important role to play. The cloud team laid the foundation for the work, providing a robust containerized foundation using Kubernetes and the underlying Azure infrastructure as a service (IaaS) base. The DAaaS team worked on integrating service components, including the portal, using the underlying services. The Data Science team worked with the other teams to identify open source software to be installed and worked to define pipelines and data flows. By having data science experts working with cloud and platform experts, the team was able to deliver a scalable, accessible platform that met data science needs. The result is an environment with a variety of advanced tools for satellite image processing, natural language processing and automation.

By breaking down barriers internally and externally, the team was able to create a cohesive workbench in a matter of weeks—all while working securely from home. This was done with a user-centric approach to modernize the experience for data users and better meet their rapidly-evolving needs, while providing end-to-end data science support.

"The platform has had a major, positive impact on the way we work. We are able to get better results, work in an agile way and see the benefits of modernization in action," explains Sarah MacKinnon, Assistant Director of the Information Technology Project Delivery team at Statistics Canada.

Inside the workbench you will find a state-of-the-art platform, a "dream data science environment," says Sevgui Erman, Director of the Data Science Division at Statistics Canada. "This environment addresses the high-capacity computing needs of data scientists and meets our needs for collaborative workspaces and tools. The workbench is equipped with tools for continuous integration and continuous development that allow for scalable and reproducible data pipelines, as well as advanced data and model management capabilities."

"You can also build out your workflows using GitHub Actions and Kubeflow Pipelines. With templates for training, validation, preprocessing, and RESTful model serving, and with integrations with Platform as a service offerings like Databricks or managed Data Lakes, the advanced analytics workspace gives you the freedom to harness whatever tools you want, and it gives you a unified layer to use them from," adds Blair Drummond, an analyst with StatCan's Data Science Division and a member of the tiger team.

Peek inside the workbench

The team gathered the best available open source tools to create a workbench that allows users to remotely access data loaded by Statistics Canada—with a focus on COVID-19. This powerful environment employs a full suite of data science and analytics tools, including

  • Jupyter Notebooks for R and Python
  • Linux remote desktop
  • Power BI
  • QGIS
  • R Shiny
  • Pachyderm (data lineage and pipelines)
  • Kubeflow Pipelines
  • MLflow for model tracking, custom web applications
  • self-serve sharable storage
  • and more.

The platform also includes support channels for user feedback and guidance.

The result is that data users are better equipped to analyze the impacts of COVID-19 and share their findings in a secure, confidential manner.

Why open source software? As Blair explains, "Open source software tools give users more flexibility and autonomy over their own work. These tools are accessible and crowd-sourced, meaning that users can also get support and help with analysis." Furthermore, results are reproducible by colleagues in other departments. An approach that incorporates open source software supports collaboration between data scientists that benefits all users.

The platform in action

By leveraging the resource functionality of the platform, data scientists at StatCan have been hard at work as they put the platform to use.

One example is the work done by Kenneth Chu, a Senior Methodologist with StatCan's Data Science Division, who was one of the early adopters of the new platform and tested it's capabilities by performing a massively parallelized statistical analysis that otherwise would not have been feasible with pre-existing computing infrastructure.

Kenneth fitted a hierarchical Bayesian model (to provincial COVID-19 death count time series) that estimated the effects of social distancing measures on COVID-19 transmissibility. There were, however, certain crucial but unknown input parameters, namely, the provincial COVID-19 infection fatality rates (IFR, defined as the conditional probability of dying of COVID-19 given that one is infected with it). Their theoretically straightforward estimates are simply the provincial ratios of the number of COVID-19 deaths to the true number of COVID-19 infections. Unfortunately, the near-complete lack of knowledge of the latter, especially during the early phase of the pandemic, rendered the IFRs highly uncertain.

The parallelized sensitivity analysis involved simply executing the Bayesian analysis independently a reasonably large number of times (200, to be precise), each time sampling the provincial IFRs randomly from the full range of plausible values. Each independent execution required approximately eight hours to complete, using two computing cores. The full sensitivity analysis, executed on DAaaS, thus required 3,200 CPU core hours in total, which would have been impossible with pre-existing infrastructure.

The capacity to execute distributed/massively parallel workflows contributes to StatCan's Big Data infrastructure. In addition, such computing capacity also enables the use of many distribution-free statistical methods (e.g. resampling-, permutation-based ones), which are highly computationally intensive but complement modern complex analytical techniques from machine learning or Bayesian statistics.

Overall, the increased computing capabilities support the agency's mission to provide timely, critical information to Canadians during the unprecedented challenges of the COVID-19 pandemic.

A secure, phased approach

Currently, the COVID-19 analytical platform is accessible to Statistics Canada employees, and to other Government of Canada departments who have research data partnerships with the agency. If you are a data scientist interested in this platform, please reach out to get involved and experience the platform by emailing statcan.analyticalplatform-platformeanalytique.statcan@statcan.gc.ca.

This is part of Statistics Canada's phased approach to grant access to the platform in a secure manner. For the first phase, access to the platform was limited to internal StatCan employees working with publicly available data only. The second phase featured access to unclassified data (publicly available data only) and access to the platform was made available to select Government of Canada employees by invitation. The third phase will feature protected data and use a mix of public and other data sets. Access to this platform will be promoted externally on the StatCan website. Each phase will include the necessary safeguards to ensure a secure environment is maintained at all times, including regular security assessments.

As this project continues to progress, Statistics Canada looks forward to engaging with the data science community and continuing to provide vital information to all Canadians.

Project team and contributors:

Christian Ritter, Statistic Canada; Blair Drummond, Statistics Canada

Date modified:

Meeting - August 20, 2020

Ninth Canadian Statistics Advisory Council (CSAC) Meeting

Date: August 20 2020, 1:00pm to 4:00pm

Location: Virtual meeting

CSAC members

Dr. Teresa Scassa (Chairperson), Anil Arora, Gurmeet Ahluwalia, David Chaundy, Annette Hester, Jan Kestle, Dr. Céline Le Bourdais, Gail Mc Donald, Dr. Howard Ramos, Dr. Michael Wolfson

Statistics Canada guests/support

Melanie Forsberg, Lynn Barr-Telford, Marc Lachance, Jean-Pierre Corbeil

Meeting agenda

Meeting agenda
Time Agenda Item Lead Participant(s)
12:50 - 13:00 Virtual Arrival CSAC Members
13:00 - 13:05 Chairperson introductory remarks Teresa Scassa: Chairperson
13:05 - 14:10 Update from the Chief Statistician of Canada Anil Arora: Chief Statistician of Canada
14:10 - 14:15 Review Annual Report content
In camera discussion
CSAC members and Rosemary Bender
14:15 - 14:25 Health Break
14:25 - 15:10 Review Annual Report content continued
In camera discussion
CSAC members and Rosemary Bender
15:10 - 15:40 Next steps for the Annual Report
In camera discussion
CSAC members and Rosemary Bender
15:40 - 15:55 Future Planning
In camera discussion
CSAC members
15:55 - 16:00 Closing remarks
In camera
Teresa Scassa: Chairperson

Meeting - July 24, 2020

Eighth Canadian Statistics Advisory Council (CSAC) Meeting

Date: July 24 2020, 1:00pm to 4:00pm

Location: Virtual meeting

CSAC members

Dr. Teresa Scassa (Chairperson), Anil Arora,  Gurmeet Ahluwalia, David Chaundy, Annette Hester, Jan Kestle, Dr. Céline Le Bourdais, Gail Mc Donald, Dr. Howard Ramos, Dr. Michael Wolfson

Meeting agenda

Meeting agenda
Time Agenda Item Lead Participant(s)
12:50 - 13:00 Virtual Arrival CSAC Members
13:00 - 13:05 Chairperson introductory remarks Teresa Scassa: Chairperson
13:05 - 13:10 Update from the Chief Statistician of Canada Anil Arora: Chief Statistician of Canada
13:10 - 13:55 Discussion with the Chair of the Advisory Committee on Ethnocultural and Immigration Statistics
In camera discussion

Morton Weinfeld: Chair of the Advisory Committee on Ethnocultural and immigration statistics
Lynn Barr-Telford: Assistant Chief Statistician
Marc Lachance: Acting, Director General
Jean-Pierre Corbeil: Assistant Director

13:55 - 14:05 Health Break
14:05 - 15:40 Update on Annual Report
In camera discussion
CSAC members and Rosemary Bender
15:40 - 15:55 Future Planning
In camera discussion
CSAC members
15:55 - 16:00 Closing remarks
In camera
Teresa Scassa: Chairperson

Price index differential range

 
Price index differential range Post classification level
190.0 + 16
185.0 - 189.9 15
180.0 - 184.9 14
175.0 - 179.9 13
170.0 - 174.9 12
165.0 - 169.9 11
160.0 - 164.9 10
155.0 - 159.9 9
150.0 - 154.9 8
145.0 - 149.9 7
140.0 - 144.9 6
135.0 - 139.9 5
130.0 - 134.9 4
125.0 - 129.9 3
120.0 - 124.9 2
115.0 - 119.9 1
Adapted from IPGHD, Appendix H – Criteria for Determining Levels (March 1, 2017)

Survey On Outstanding Treasury Bills And Short-Term Papers

Original term one year or less

Month End: October 2020

Canadian dollar equivalent of outstanding debt denominated in foreign currency
  Canadian Dollar Foreign Currency *
(Par-value - thousands of dollars)
Outstanding treasury bills of which: $
(Var. # 7071)
$
(Var. # 7081)
sold directly to chartered banks $
(Var. # 7072)
$
(Var. # 7082)
sold directly to provincial govt. accounts $
(Var. # 7073)
$
(Var. # 7083)
Outstanding short-term paper of which: $
(Var. # 7074)
$
(Var. # 7084)
sold directly to chartered banks $
(Var. # 7075)
$
(Var. # 7085)
sold directly to provincial govt. accounts $
(Var. # 7076)
$
(Var. # 7086)

Signature/Name:

*Enter Canadian dollar equivalent of outstanding debt denominated in foreign currency.

This survey covers short-term treasury bill and paper borrowings, not investment in such paper.

Include only treasury bills and paper with an original term of one year or less.

Please file a report each month whether or not you have paper outstanding at the particular month-end.

Reports should be submitted immediately, addressed to:

Statistics Canada
Operations and Integration Division
Jean Talon Building, 2nd floor, B-19
170 Tunney's Pasture
Ottawa, Ontario
K1A OT6

Participation is mandatory under the authority of the Statistics Act, which ensures that all information will be kept confidential and used only for statistical purposes. We do not release any information that could identify an organization, unless consent has been given, or as permitted by the act.

To reduce respondent burden, Statistics Canada has entered into data-sharing agreements with provincial and territorial statistical agencies and other government organizations, which have agreed to keep the data confidential and use them only for statistical purposes. Statistics Canada will only share information with those organizations that have demonstrated a requirement to use the data.

Communiqué: Canadian Statistics Advisory Council Report - Toward a Stronger National Statistical System

October 20, 2020 – Ottawa, ON – Statistics Canada

Today, the Canadian Statistics Advisory Council (CSAC) issued its first report (CSAC 2020 Annual Report - Towards a Stronger National Statistical System) on the state of the country’s statistical system to the Minister of Innovation, Science and Industry. The release of this report coincides with World Statistics Day. “Our report recognizes how, in response to the COVID-19 pandemic, Statistics Canada’s modernization efforts have helped the agency pivot to meet many of the country’s statistical needs,” says CSAC member Dr. Howard Ramos. It also highlights the importance of accelerating those efforts to bridge crucial data gaps to overcome the statistical challenges that are facing both Statistics Canada as an agency and Canada as a nation.

Decision makers were hampered by a lack of timely, consistent and disaggregated data in areas such as health care and on racialized Canadians and Indigenous people. This situation served to highlight the broader need for high-quality statistical information to address nationwide health issues and socioeconomic inequities. Collecting these data while respecting the privacy of Canadians’ personal information remains of key importance.

The council’s mandate is to advise the Minister of Innovation, Science and Industry and the Chief Statistician of Canada on any number of issues concerning the relevance, accuracy, accessibility, timeliness, and privacy and confidentiality of the agency’s data.

The report includes five core recommendations: (1) including statistical data requirements in planning federal government programs, (2) addressing critical data gaps, (3) rectifying serious imbalances in funding national statistical programs, (4) ensuring the privacy of Canadians and the need for Canadians to provide data to Statistics Canada, and (5) modernizing microdata access.

Through the course of its work, the council found that, as shown by the pandemic, Statistics Canada’s central role as an independent national statistical organization has never been more critical to meet the country’s needs for timely and high-quality statistics. The pandemic has shown that nationwide data are key for decision makers, governments and the general public to understand and address important social, health, economic, environmental and energy issues facing Canadians. CSAC member Jan Kestle notes, “Bringing together data from different levels of government, and private sources, is necessary to get a complete and up-to-date picture of the social and economic well-being of Canadians. Collaboration across jurisdictions is complicated. But this challenge must be met head-on for Canada to fill data gaps and ensure a strong foundation for decision making.

Serious shortcomings in the timeliness, completeness and quality of Canadian health care and health outcome data have greatly impaired the ability of governments at all levels to monitor and assess the evolution of the pandemic, let alone address serious health issues in Canada. The council also saw that the ability to address barriers faced by racialized groups and Indigenous peoples in Canada is seriously hampered by the lack of timely, consistent and disaggregated data. As CSAC member Gail Mc Donald explains, “The year 2020 has brought the issue of systemic racism to the forefront. By truly addressing Indigenous data gaps on the impacts of racism within our society, only then can we harness the power of data to effect change and make a difference.”

To overcome these gaps, stable core funding for Statistics Canada’s programs is essential to having high-quality data and statistical information that represent all regions of Canada and the full range of circumstances of individual Canadians.

The council also found that, going forward, it is important for researchers, decision makers and communities to be able to access the data they provide to the agency. The modernization of Statistics Canada’s microdata access infrastructure is a long-awaited initiative that will greatly improve the quality and depth of research and analysis in Canada across all sectors. However, its timeline for full implementation is too long and should be accelerated. As CSAC member Dr. Céline Le Bourdais observes, “The COVID-19 pandemic has reinforced the need for Statistics Canada to continue to modernize its infrastructure and rapidly move toward new distributed modes of data access. This will ensure that duly authorized researchers are able to pursue timely analyses on the pressing challenges faced by society.”

Canadians have provided personal data to Statistics Canada for over 100 years, and there should be no conflict between respect for the privacy of Canadians and the need for Canadians to provide data to Statistics Canada. The council found that this is also key to ensuring a robust statistical system and a stronger country.

Contact info and expertise of CSAC spokespeople

Dr. Howard Ramos
Media release CSAC facilitator
Mobile: 902-402-9893
Email: howard.ramos@uwo.ca
English/français
Availability on October 20, 2020: 8:00 a.m. to 4:00 p.m. EDT
Expertise: General insight on the report, focus on race and ethnic data, data access, balance of privacy and need for data

Jan Kestle
Mobile: 647-988-2834
Email: jan.kestle@environicsanalytics.com
Availability on October 20, 2020: 8:00 a.m. to 6:00 p.m. EDT
Expertise: The importance of high-quality data and evidence-driven decision making, modernizing methods for the production of official statistics, national data strategy, privacy and security of data

Dr. Céline Le Bourdais
Mobile: 514-770-3714
Email: celine.lebourdais@mcgill.ca
Français/English
Availability on October 20, 2020: 8:00 a.m. to 2:30 p.m. EDT
Expertise: General insight on the report, focus on data access and addressing data gaps and imbalances in funding

Gail Mc Donald
Mobile: 514-970-8254
Email: gail.mcd@sympatico.ca
English
Availability on October 20, 2020: 11:00 a.m. to 3:00 p.m.
Expertise: General insight on the report, focus on Indigenous data, capacity development and governance

Members of the CSAC