Statistical techniques
Filter results by
Search HelpKeyword(s)
Type
Survey or statistical program
Results
All (40)
All (40) (0 to 10 of 40 results)
- Articles and reports: 11-522-X202200100014Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.Release date: 2024-03-25
- Articles and reports: 11-633-X2023003Description: This paper spans the academic work and estimation strategies used in national statistics offices. It addresses the issue of producing fine, grid-level geography estimates for Canada by exploring the measurement of subprovincial and subterritorial gross domestic product using Yukon as a test case.Release date: 2023-12-15
- Articles and reports: 12-001-X202300100001Description: Recent work in survey domain estimation allows for estimation of population domain means under a priori assumptions expressed in terms of linear inequality constraints. For example, it might be known that the population means are non-decreasing along ordered domains. Imposing the constraints has been shown to provide estimators with smaller variance and tighter confidence intervals. In this paper we consider a formal test of the null hypothesis that all the constraints are binding, versus the alternative that at least one constraint is non-binding. The test of constant versus increasing domain means is a special case. The power of the test is substantially better than the test with the same null hypothesis and an unconstrained alternative. The new test is used with data from the National Survey of College Graduates, to show that salaries are positively related to the subject’s father’s educational level, across fields of study and over several years of cohorts.Release date: 2023-06-30
- Stats in brief: 89-20-00062022005Description:
In this video, you will learn the answers to the following questions: What are the different types of error? What are the types of error that lead to statistical bias? Where during the data journey statistical bias can occur?
Release date: 2022-10-17 - Stats in brief: 89-20-00062022002Description:
This video will break down what it means to be FAIR in terms of data and metadata, and how each pillar of FAIR serves to guide data users and producers alike, as they navigate their way through the data journey, in order to gain maximum, long term value.
Release date: 2022-05-24 - Stats in brief: 89-20-00062022003Description:
By the end of this video you will understand what confidence intervals are, why we use them, and what factors have an impact on them.
Release date: 2022-05-24 - Articles and reports: 11-522-X202100100027Description:
Privacy concerns are a barrier to applying remote analytics, including machine learning, on sensitive data via the cloud. In this work, we use a leveled fully Homomorphic Encryption scheme to train an end-to-end supervised machine learning algorithm to classify texts while protecting the privacy of the input data points. We train our single-layer neural network on a large simulated dataset, providing a practical solution to a real-world multi-class text classification task. To improve both accuracy and training time, we train an ensemble of such classifiers in parallel using ciphertext packing.
Key Words: Privacy Preservation, Machine Learning, Encryption
Release date: 2021-10-29 - Articles and reports: 11-633-X2018018Description:
This paper describes the data sources and methods used to backcast provincial and territorial income-based gross domestic product (GDP), expenditure-based GDP, real gross domestic income, unemployment rates, depreciation rates and urbanization rates. Nevertheless, estimates can be produced that are very close and which are useful for understanding the evolution of the provincial and territorial economies. Instrumental variable techniques are used to estimate the historical movements of these economic variables back to 1950.
Release date: 2018-11-02 - 9. Growth Rates Preservation (GRP) temporal benchmarking: Drawbacks and alternative solutions ArchivedArticles and reports: 12-001-X201800154927Description:
Benchmarking monthly or quarterly series to annual data is a common practice in many National Statistical Institutes. The benchmarking problem arises when time series data for the same target variable are measured at different frequencies and there is a need to remove discrepancies between the sums of the sub-annual values and their annual benchmarks. Several benchmarking methods are available in the literature. The Growth Rates Preservation (GRP) benchmarking procedure is often considered the best method. It is often claimed that this procedure is grounded on an ideal movement preservation principle. However, we show that there are important drawbacks to GRP, relevant for practical applications, that are unknown in the literature. Alternative benchmarking models will be considered that do not suffer from some of GRP’s side effects.
Release date: 2018-06-21 - 10. Canadian Cancer Treatment Linkage Project ArchivedArticles and reports: 11-633-X2018016Description:
Record linkage has been identified as a potential mechanism to add treatment information to the Canadian Cancer Registry (CCR). The purpose of the Canadian Cancer Treatment Linkage Project (CCTLP) pilot is to add surgical treatment data to the CCR. The Discharge Abstract Database (DAD) and the National Ambulatory Care Reporting System (NACRS) were linked to the CCR, and surgical treatment data were extracted. The project was funded through the Cancer Data Development Initiative (CDDI) of the Canadian Partnership Against Cancer (CPAC).
The CCTLP was developed as a feasibility study in which patient records from the CCR would be linked to surgical treatment records in the DAD and NACRS databases, maintained by the Canadian Institute for Health Information. The target cohort to whom surgical treatment data would be linked was patients aged 19 or older registered on the CCR (2010 through 2012). The linkage was completed in Statistics Canada’s Social Data Linkage Environment (SDLE).
Release date: 2018-03-27
Data (0)
Data (0) (0 results)
No content available at this time.
Analysis (38)
Analysis (38) (0 to 10 of 38 results)
- Articles and reports: 11-522-X202200100014Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.Release date: 2024-03-25
- Articles and reports: 11-633-X2023003Description: This paper spans the academic work and estimation strategies used in national statistics offices. It addresses the issue of producing fine, grid-level geography estimates for Canada by exploring the measurement of subprovincial and subterritorial gross domestic product using Yukon as a test case.Release date: 2023-12-15
- Articles and reports: 12-001-X202300100001Description: Recent work in survey domain estimation allows for estimation of population domain means under a priori assumptions expressed in terms of linear inequality constraints. For example, it might be known that the population means are non-decreasing along ordered domains. Imposing the constraints has been shown to provide estimators with smaller variance and tighter confidence intervals. In this paper we consider a formal test of the null hypothesis that all the constraints are binding, versus the alternative that at least one constraint is non-binding. The test of constant versus increasing domain means is a special case. The power of the test is substantially better than the test with the same null hypothesis and an unconstrained alternative. The new test is used with data from the National Survey of College Graduates, to show that salaries are positively related to the subject’s father’s educational level, across fields of study and over several years of cohorts.Release date: 2023-06-30
- Stats in brief: 89-20-00062022005Description:
In this video, you will learn the answers to the following questions: What are the different types of error? What are the types of error that lead to statistical bias? Where during the data journey statistical bias can occur?
Release date: 2022-10-17 - Stats in brief: 89-20-00062022002Description:
This video will break down what it means to be FAIR in terms of data and metadata, and how each pillar of FAIR serves to guide data users and producers alike, as they navigate their way through the data journey, in order to gain maximum, long term value.
Release date: 2022-05-24 - Stats in brief: 89-20-00062022003Description:
By the end of this video you will understand what confidence intervals are, why we use them, and what factors have an impact on them.
Release date: 2022-05-24 - Articles and reports: 11-522-X202100100027Description:
Privacy concerns are a barrier to applying remote analytics, including machine learning, on sensitive data via the cloud. In this work, we use a leveled fully Homomorphic Encryption scheme to train an end-to-end supervised machine learning algorithm to classify texts while protecting the privacy of the input data points. We train our single-layer neural network on a large simulated dataset, providing a practical solution to a real-world multi-class text classification task. To improve both accuracy and training time, we train an ensemble of such classifiers in parallel using ciphertext packing.
Key Words: Privacy Preservation, Machine Learning, Encryption
Release date: 2021-10-29 - Articles and reports: 11-633-X2018018Description:
This paper describes the data sources and methods used to backcast provincial and territorial income-based gross domestic product (GDP), expenditure-based GDP, real gross domestic income, unemployment rates, depreciation rates and urbanization rates. Nevertheless, estimates can be produced that are very close and which are useful for understanding the evolution of the provincial and territorial economies. Instrumental variable techniques are used to estimate the historical movements of these economic variables back to 1950.
Release date: 2018-11-02 - 9. Growth Rates Preservation (GRP) temporal benchmarking: Drawbacks and alternative solutions ArchivedArticles and reports: 12-001-X201800154927Description:
Benchmarking monthly or quarterly series to annual data is a common practice in many National Statistical Institutes. The benchmarking problem arises when time series data for the same target variable are measured at different frequencies and there is a need to remove discrepancies between the sums of the sub-annual values and their annual benchmarks. Several benchmarking methods are available in the literature. The Growth Rates Preservation (GRP) benchmarking procedure is often considered the best method. It is often claimed that this procedure is grounded on an ideal movement preservation principle. However, we show that there are important drawbacks to GRP, relevant for practical applications, that are unknown in the literature. Alternative benchmarking models will be considered that do not suffer from some of GRP’s side effects.
Release date: 2018-06-21 - 10. Canadian Cancer Treatment Linkage Project ArchivedArticles and reports: 11-633-X2018016Description:
Record linkage has been identified as a potential mechanism to add treatment information to the Canadian Cancer Registry (CCR). The purpose of the Canadian Cancer Treatment Linkage Project (CCTLP) pilot is to add surgical treatment data to the CCR. The Discharge Abstract Database (DAD) and the National Ambulatory Care Reporting System (NACRS) were linked to the CCR, and surgical treatment data were extracted. The project was funded through the Cancer Data Development Initiative (CDDI) of the Canadian Partnership Against Cancer (CPAC).
The CCTLP was developed as a feasibility study in which patient records from the CCR would be linked to surgical treatment records in the DAD and NACRS databases, maintained by the Canadian Institute for Health Information. The target cohort to whom surgical treatment data would be linked was patients aged 19 or older registered on the CCR (2010 through 2012). The linkage was completed in Statistics Canada’s Social Data Linkage Environment (SDLE).
Release date: 2018-03-27
Reference (2)
Reference (2) ((2 results))
- Surveys and statistical programs – Documentation: 11-522-X20050019476Description:
The paper will show how, using data published by Statistics Canada and available from member libraries of the CREPUQ, a linkage approach using postal codes makes it possible to link the data from the outcomes file to a set of contextual variables. These variables could then contribute to producing, on an exploratory basis, a better index to explain the varied outcomes of students from schools. In terms of the impact, the proposed index could show more effectively the limitations of ranking students and schools when this information is not given sufficient weight.
Release date: 2007-03-02 - Surveys and statistical programs – Documentation: 81-595-M2003005Geography: CanadaDescription:
This paper develops technical procedures that may enable ministries of education to link provincial tests with national and international tests in order to compare standards and report results on a common scale.
Release date: 2003-05-29
- Date modified: