Statistical techniques

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Type

1 facets displayed. 1 facets selected.

Geography

2 facets displayed. 0 facets selected.

Survey or statistical program

43 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (162)

All (162) (0 to 10 of 162 results)

  • Articles and reports: 11-522-X202200100008
    Description: The publication of more disaggregated data can increase transparency and provide important information on underrepresented groups. Developing more readily available access options increases the amount of information available to and produced by researchers. Increasing the breadth and depth of the information released allows for a better representation of the Canadian population, but also puts a greater responsibility on Statistics Canada to do this in a way that preserves confidentiality, and thus it is helpful to develop tools which allow Statistics Canada to quantify the risk from the additional data granularity. In an effort to evaluate the risk of a database reconstruction attack on Statistics Canada’s published Census data, this investigation follows the strategy of the US Census Bureau, who outlined a method to use a Boolean satisfiability (SAT) solver to reconstruct individual attributes of residents of a hypothetical US Census block, based just on a table of summary statistics. The technique is expanded to attempt to reconstruct a small fraction of Statistics Canada’s Census microdata. This paper will discuss the findings of the investigation, the challenges involved in mounting a reconstruction attack, and the effect of an existing confidentiality measure in mitigating these attacks. Furthermore, the existing strategy is compared to other potential methods used to protect data – in particular, releasing tabular data perturbed by some random mechanism, such as those suggested by differential privacy.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100014
    Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.
    Release date: 2024-03-25

  • Articles and reports: 12-001-X202300200005
    Description: Population undercoverage is one of the main hurdles faced by statistical analysis with non-probability survey samples. We discuss two typical scenarios of undercoverage, namely, stochastic undercoverage and deterministic undercoverage. We argue that existing estimation methods under the positivity assumption on the propensity scores (i.e., the participation probabilities) can be directly applied to handle the scenario of stochastic undercoverage. We explore strategies for mitigating biases in estimating the mean of the target population under deterministic undercoverage. In particular, we examine a split population approach based on a convex hull formulation, and construct estimators with reduced biases. A doubly robust estimator can be constructed if a followup subsample of the reference probability survey with measurements on the study variable becomes feasible. Performances of six competing estimators are investigated through a simulation study and issues which require further investigation are briefly discussed.
    Release date: 2024-01-03

  • Articles and reports: 11-633-X2023003
    Description: This paper spans the academic work and estimation strategies used in national statistics offices. It addresses the issue of producing fine, grid-level geography estimates for Canada by exploring the measurement of subprovincial and subterritorial gross domestic product using Yukon as a test case.
    Release date: 2023-12-15

  • Articles and reports: 12-001-X202300100001
    Description: Recent work in survey domain estimation allows for estimation of population domain means under a priori assumptions expressed in terms of linear inequality constraints. For example, it might be known that the population means are non-decreasing along ordered domains. Imposing the constraints has been shown to provide estimators with smaller variance and tighter confidence intervals. In this paper we consider a formal test of the null hypothesis that all the constraints are binding, versus the alternative that at least one constraint is non-binding. The test of constant versus increasing domain means is a special case. The power of the test is substantially better than the test with the same null hypothesis and an unconstrained alternative. The new test is used with data from the National Survey of College Graduates, to show that salaries are positively related to the subject’s father’s educational level, across fields of study and over several years of cohorts.
    Release date: 2023-06-30

  • Articles and reports: 12-001-X202300100002
    Description: We consider regression analysis in the context of data integration. To combine partial information from external sources, we employ the idea of model calibration which introduces a “working” reduced model based on the observed covariates. The working reduced model is not necessarily correctly specified but can be a useful device to incorporate the partial information from the external data. The actual implementation is based on a novel application of the information projection and model calibration weighting. The proposed method is particularly attractive for combining information from several sources with different missing patterns. The proposed method is applied to a real data example combining survey data from Korean National Health and Nutrition Examination Survey and big data from National Health Insurance Sharing Service in Korea.
    Release date: 2023-06-30

  • Articles and reports: 11-637-X202200100007
    Description:

    As the seventh goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to ensure access to affordable, reliable, sustainable and modern energy for all by 2030. This 2022 infographic provides an overview of indicators underlying the seventh Sustainable Development Goal in support of affordable and clean energy, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100008
    Description:

    As the eighth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to promote sustained, inclusive and sustainable economic growth, full and productive employment and decent work for all by 2030. This 2022 infographic provides an overview of indicators underlying the eighth Sustainable Development Goal in support of decent work and economic growth, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100009
    Description:

    As the ninth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation by 2030. This 2022 infographic provides an overview of indicators underlying the ninth Sustainable Development Goal in support of industry, innovation and infrastructure, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100010
    Description:

    As the tenth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to reduce inequalities within and among countries by 2030. This 2022 infographic provides an overview of indicators underlying the tenth Sustainable Development Goal in support of reduced inequalities, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13
Data (0)

Data (0) (0 results)

No content available at this time.

Analysis (162)

Analysis (162) (0 to 10 of 162 results)

  • Articles and reports: 11-522-X202200100008
    Description: The publication of more disaggregated data can increase transparency and provide important information on underrepresented groups. Developing more readily available access options increases the amount of information available to and produced by researchers. Increasing the breadth and depth of the information released allows for a better representation of the Canadian population, but also puts a greater responsibility on Statistics Canada to do this in a way that preserves confidentiality, and thus it is helpful to develop tools which allow Statistics Canada to quantify the risk from the additional data granularity. In an effort to evaluate the risk of a database reconstruction attack on Statistics Canada’s published Census data, this investigation follows the strategy of the US Census Bureau, who outlined a method to use a Boolean satisfiability (SAT) solver to reconstruct individual attributes of residents of a hypothetical US Census block, based just on a table of summary statistics. The technique is expanded to attempt to reconstruct a small fraction of Statistics Canada’s Census microdata. This paper will discuss the findings of the investigation, the challenges involved in mounting a reconstruction attack, and the effect of an existing confidentiality measure in mitigating these attacks. Furthermore, the existing strategy is compared to other potential methods used to protect data – in particular, releasing tabular data perturbed by some random mechanism, such as those suggested by differential privacy.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100014
    Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.
    Release date: 2024-03-25

  • Articles and reports: 12-001-X202300200005
    Description: Population undercoverage is one of the main hurdles faced by statistical analysis with non-probability survey samples. We discuss two typical scenarios of undercoverage, namely, stochastic undercoverage and deterministic undercoverage. We argue that existing estimation methods under the positivity assumption on the propensity scores (i.e., the participation probabilities) can be directly applied to handle the scenario of stochastic undercoverage. We explore strategies for mitigating biases in estimating the mean of the target population under deterministic undercoverage. In particular, we examine a split population approach based on a convex hull formulation, and construct estimators with reduced biases. A doubly robust estimator can be constructed if a followup subsample of the reference probability survey with measurements on the study variable becomes feasible. Performances of six competing estimators are investigated through a simulation study and issues which require further investigation are briefly discussed.
    Release date: 2024-01-03

  • Articles and reports: 11-633-X2023003
    Description: This paper spans the academic work and estimation strategies used in national statistics offices. It addresses the issue of producing fine, grid-level geography estimates for Canada by exploring the measurement of subprovincial and subterritorial gross domestic product using Yukon as a test case.
    Release date: 2023-12-15

  • Articles and reports: 12-001-X202300100001
    Description: Recent work in survey domain estimation allows for estimation of population domain means under a priori assumptions expressed in terms of linear inequality constraints. For example, it might be known that the population means are non-decreasing along ordered domains. Imposing the constraints has been shown to provide estimators with smaller variance and tighter confidence intervals. In this paper we consider a formal test of the null hypothesis that all the constraints are binding, versus the alternative that at least one constraint is non-binding. The test of constant versus increasing domain means is a special case. The power of the test is substantially better than the test with the same null hypothesis and an unconstrained alternative. The new test is used with data from the National Survey of College Graduates, to show that salaries are positively related to the subject’s father’s educational level, across fields of study and over several years of cohorts.
    Release date: 2023-06-30

  • Articles and reports: 12-001-X202300100002
    Description: We consider regression analysis in the context of data integration. To combine partial information from external sources, we employ the idea of model calibration which introduces a “working” reduced model based on the observed covariates. The working reduced model is not necessarily correctly specified but can be a useful device to incorporate the partial information from the external data. The actual implementation is based on a novel application of the information projection and model calibration weighting. The proposed method is particularly attractive for combining information from several sources with different missing patterns. The proposed method is applied to a real data example combining survey data from Korean National Health and Nutrition Examination Survey and big data from National Health Insurance Sharing Service in Korea.
    Release date: 2023-06-30

  • Articles and reports: 11-637-X202200100007
    Description:

    As the seventh goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to ensure access to affordable, reliable, sustainable and modern energy for all by 2030. This 2022 infographic provides an overview of indicators underlying the seventh Sustainable Development Goal in support of affordable and clean energy, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100008
    Description:

    As the eighth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to promote sustained, inclusive and sustainable economic growth, full and productive employment and decent work for all by 2030. This 2022 infographic provides an overview of indicators underlying the eighth Sustainable Development Goal in support of decent work and economic growth, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100009
    Description:

    As the ninth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation by 2030. This 2022 infographic provides an overview of indicators underlying the ninth Sustainable Development Goal in support of industry, innovation and infrastructure, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13

  • Articles and reports: 11-637-X202200100010
    Description:

    As the tenth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to reduce inequalities within and among countries by 2030. This 2022 infographic provides an overview of indicators underlying the tenth Sustainable Development Goal in support of reduced inequalities, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13
Reference (0)

Reference (0) (0 results)

No content available at this time.

Date modified: