Data analysis

Skip to main content
Skip to footer

Language selection

Français

Search and menus

Search and menus

Search

Skip to filters. View results.

Results

All (18)

All (18) (0 to 10 of 18 results)

1. Longitudinal Immigration Database (IMDB) Technical Report, 2022
Articles and reports: 11-633-X2024001
Description: The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 35 years.
Release date: 2024-01-22
2. Confidentiality Vetting Support: Dominance and homogeneity using SAS
Stats in brief: 89-20-00082021001
Description: This video is part of the confidentiality vetting support series and presents examples of how to use SAS to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-29
3. Confidentiality Vetting Support: Proportion and round tool using SAS
Stats in brief: 89-20-00082021002
Description: This video is part of the confidentiality vetting support series and presents examples of how to use SAS to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
4. Confidentiality Vetting Support: Rounding proportions using Stata
Stats in brief: 89-20-00082021003
Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
5. Confidentiality Vetting Support: Dominance and homogeneity using the tcensus function (Stata)
Stats in brief: 89-20-00082021004
Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-27
6. Confidentiality Vetting Support: Rounding proportions using Rounder An R Shiny App
Stats in brief: 89-20-00082021005
Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
7. Confidentiality Vetting Support: Dominance and homogeneity using R
Stats in brief: 89-20-00082021006
Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-27
8. Data science pipelines @ Istat: challenges and solutions Archived
Articles and reports: 11-522-X202100100029
Description:
In line with the path taken by the European Statistical System, Istat is investing on innovative methods to harness Big Data sources and to use them for the production of new and enriched Official Statistics products. Big Data sources are not, in general, directly tractable with traditional statistical techniques, just think of specific data types such as images and texts that are examples of the Variety dimension of Big Data. This motivates and justifies the growing interest of National Statistical Institutes in data science techniques. Istat is currently using data science techniques, including machine learning techniques, in innovation projects and for the publication of experimental statistics. This paper will provide an overview of the main current projects by Istat and will focus on two specific Big Data-based production pipelines, related to the processing of respectively text sources and imagery sources. The paper will highlight the main challenges these two pipelines and the solutions put in place to solve them.

Key Words: Machine Learning; Text Processing; Image Processing; Big Data

Release date: 2021-11-05
9. An Approximate Bayesian Approach to Improving Probability Sample Estimators Using a Supplementary Non-Probability Sample Archived
Articles and reports: 11-522-X202100100008
Description:
Non-probability samples are being increasingly explored by National Statistical Offices as a complement to probability samples. We consider the scenario where the variable of interest and auxiliary variables are observed in both a probability and non-probability sample. Our objective is to use data from the non-probability sample to improve the efficiency of survey-weighted estimates obtained from the probability sample. Recently, Sakshaug, Wisniowski, Ruiz and Blom (2019) and Wisniowski, Sakshaug, Ruiz and Blom (2020) proposed a Bayesian approach to integrating data from both samples for the estimation of model parameters. In their approach, non-probability sample data are used to determine the prior distribution of model parameters, and the posterior distribution is obtained under the assumption that the probability sampling design is ignorable (or not informative). We extend this Bayesian approach to the prediction of finite population parameters under non-ignorable (or informative) sampling by conditioning on appropriate survey-weighted statistics. We illustrate the properties of our predictor through a simulation study.
Key Words: Bayesian prediction; Gibbs sampling; Non-ignorable sampling; Statistical data integration.

Release date: 2021-10-29
10. Statistics Canada British Columbia Opioid Overdose Analytical File: Technical Report
Articles and reports: 11-633-X2021003
Description:
Canada continues to experience an opioid crisis. While there is solid information on the demographic and geographic characteristics of people experiencing fatal and non-fatal opioid overdoses in Canada, there is limited information on the social and economic conditions of those who experience these events. To fill this information gap, Statistics Canada collaborated with existing partnerships in British Columbia, including the BC Coroners Service, BC Stats, the BC Centre for Disease Control and the British Columbia Ministry of Health, to create the Statistics Canada British Columbia Opioid Overdose Analytical File (BC-OOAF).
Release date: 2021-02-17

Data (0)

Data (0) (0 results)

No content available at this time.

Analysis (17)

Analysis (17) (0 to 10 of 17 results)

1. Longitudinal Immigration Database (IMDB) Technical Report, 2022
Articles and reports: 11-633-X2024001
Description: The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 35 years.
Release date: 2024-01-22
2. Confidentiality Vetting Support: Dominance and homogeneity using SAS
Stats in brief: 89-20-00082021001
Description: This video is part of the confidentiality vetting support series and presents examples of how to use SAS to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-29
3. Confidentiality Vetting Support: Proportion and round tool using SAS
Stats in brief: 89-20-00082021002
Description: This video is part of the confidentiality vetting support series and presents examples of how to use SAS to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
4. Confidentiality Vetting Support: Rounding proportions using Stata
Stats in brief: 89-20-00082021003
Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
5. Confidentiality Vetting Support: Dominance and homogeneity using the tcensus function (Stata)
Stats in brief: 89-20-00082021004
Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-27
6. Confidentiality Vetting Support: Rounding proportions using Rounder An R Shiny App
Stats in brief: 89-20-00082021005
Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to create proportion output for researchers working with confidential data.
Release date: 2022-04-27
7. Confidentiality Vetting Support: Dominance and homogeneity using R
Stats in brief: 89-20-00082021006
Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to perform the dominance and homogeneity test while using the Census.
Release date: 2022-04-27
8. Data science pipelines @ Istat: challenges and solutions Archived
Articles and reports: 11-522-X202100100029
Description:
In line with the path taken by the European Statistical System, Istat is investing on innovative methods to harness Big Data sources and to use them for the production of new and enriched Official Statistics products. Big Data sources are not, in general, directly tractable with traditional statistical techniques, just think of specific data types such as images and texts that are examples of the Variety dimension of Big Data. This motivates and justifies the growing interest of National Statistical Institutes in data science techniques. Istat is currently using data science techniques, including machine learning techniques, in innovation projects and for the publication of experimental statistics. This paper will provide an overview of the main current projects by Istat and will focus on two specific Big Data-based production pipelines, related to the processing of respectively text sources and imagery sources. The paper will highlight the main challenges these two pipelines and the solutions put in place to solve them.

Key Words: Machine Learning; Text Processing; Image Processing; Big Data

Release date: 2021-11-05
9. An Approximate Bayesian Approach to Improving Probability Sample Estimators Using a Supplementary Non-Probability Sample Archived
Articles and reports: 11-522-X202100100008
Description:
Non-probability samples are being increasingly explored by National Statistical Offices as a complement to probability samples. We consider the scenario where the variable of interest and auxiliary variables are observed in both a probability and non-probability sample. Our objective is to use data from the non-probability sample to improve the efficiency of survey-weighted estimates obtained from the probability sample. Recently, Sakshaug, Wisniowski, Ruiz and Blom (2019) and Wisniowski, Sakshaug, Ruiz and Blom (2020) proposed a Bayesian approach to integrating data from both samples for the estimation of model parameters. In their approach, non-probability sample data are used to determine the prior distribution of model parameters, and the posterior distribution is obtained under the assumption that the probability sampling design is ignorable (or not informative). We extend this Bayesian approach to the prediction of finite population parameters under non-ignorable (or informative) sampling by conditioning on appropriate survey-weighted statistics. We illustrate the properties of our predictor through a simulation study.
Key Words: Bayesian prediction; Gibbs sampling; Non-ignorable sampling; Statistical data integration.

Release date: 2021-10-29
10. Statistics Canada British Columbia Opioid Overdose Analytical File: Technical Report
Articles and reports: 11-633-X2021003
Description:
Canada continues to experience an opioid crisis. While there is solid information on the demographic and geographic characteristics of people experiencing fatal and non-fatal opioid overdoses in Canada, there is limited information on the social and economic conditions of those who experience these events. To fill this information gap, Statistics Canada collaborated with existing partnerships in British Columbia, including the BC Coroners Service, BC Stats, the BC Centre for Disease Control and the British Columbia Ministry of Health, to create the Statistics Canada British Columbia Opioid Overdose Analytical File (BC-OOAF).
Release date: 2021-02-17

Reference (1)

Reference (1) ((1 result))

1. SLID Derived Variables: Demographic, Cultural and Geographic Archived
Surveys and statistical programs – Documentation: 75F0002M1994018
Description:
This document describes the demographic, cultural and geographic derived variables for the Survey of Labour and Income Dynamics (SLID).
Release date: 1995-12-30

Report a problem or mistake on this page

Date modified:: 2024-04-25