Survey of Digital Technology and Internet Use (SDTIU)
Detailed information for 2023
Status:
Active
Frequency:
Occasional
Record number:
4225
The purpose of the 2023 Survey of Digital Technology and Internet Use (SDTIU) is to measure the impact of digital technologies, including the Internet and specific information and communication technologies (ICTs), on the operations of Canadian enterprises.
Data release - September 17, 2024
Description
The 2023 Survey of Digital Technology and Internet Use (SDTIU) is designed to measure the impact of digital technologies on the operations of Canadian enterprises.
The information gathered helps to better understand how enterprises use the Internet, including their online presence, involvement in e-commerce, use of specific information and communication technologies (ICTs) and interaction with federal government online services. The survey also examines skills and employment in ICT-related jobs. The data from this survey are used by government departments to develop policies and programs that help improve Canada's innovation system and strengthen the overall economy.
The SDTIU is sponsored by Innovation, Science and Economic Development Canada (ISED). Numerous other government departments also provided input during the questionnaire content development phase.
The results from this survey are also monitored by international organizations such as the Organization for Economic Co-operation and Development (OECD) for benchmarking purposes and to study the development and the influence of the digital economy.
Reference period: The 12-month calendar year
Collection period: November of the reference year through March of the next year
Subjects
- Business and government internet use
- Information and communications technology
Data sources and methodology
Target population
The target population was derived from Statistics Canada's Business Register (BR). The BR is an information database on the Canadian business population and serves as a frame for all Statistics Canada business surveys. It is a structured list of businesses engaged in the production of goods and services in Canada.
This survey covers enterprises operating in Canada in almost all industrial sectors. The industries in the target population are based on the 2022 North American Industry Classification System (NAICS). Sectors 21, 22, 23, 31, 32, 33, 41, 44, 45, 48, 49, 51, 52, 53, 54, 55, 56, 61, 62, 71, 72 and 81 were included. The following entities were excluded to arrive at the target population:
- Government entities;
- Enterprises with less than 1 employee;
- Agriculture, forestry, fishing and hunting (11), specialty trade contractors (238), head offices (551114), private households (814), and public administration (91);
- Enterprises from sectors 23, 48, 49, 53, 54, 56, 61, 62, 71, 72 and 81 with a revenue less than $100K;
- Enterprises from sectors 21, 22, 31, 32, 33, 41, 44, 45, 51, 52 and 55 with a revenue less than $250K.
Instrument design
The survey data will be collected using an electronic questionnaire.
The questionnaire had minor revisions compared to the previous version in 2021 to better meet the policy needs of the sponsoring partner Innovation, Science and Economic Development Canada (ISED) and to better align with surveys conducted by other national statistical offices. Subject matter experts, private businesses and external stakeholders were also consulted during the content development process.
Cognitive testing of the questionnaire content was carried in conjunction with the Questionnaire Design Resource Center based at Statistics Canada in both official languages. For the 2019 iteration, the entire questionnaire was tested through one-on-one interviews with potential respondents that took place in Gatineau, Montreal and Toronto. For the 2021 iteration, the revised content was tested through one-on-one telephone interviews with potential respondents. For the 2023 iteration, the revised content was tested through interviews with potential respondents over video call. The resulting comments and analysis of these interviews led to revisions of the questionnaire to make the questions more relevant to respondents and easier to answer.
Sampling
This is a sample survey with a cross-sectional design.
The survey frame was constructed by selecting all enterprises from the BR that met the definition of the target population. There were 752,587 enterprises within the target population in August 2023.
Enterprises were stratified by industry and two sets of size categories to meet the data needs of both domestic and international stakeholders. The size categories are based on the number of employees of the enterprise. The first set of size categories uses the following definitions: very small (1 to 4 employees), small (5-19 employees), medium (20 to 99 employees outside the manufacturing sector and 20 to 499 employees in the manufacturing sector), and large (100 or more employees outside the manufacturing sector and 500 or more employees in the manufacturing sector). The second set of size categories uses the following definitions: very small (1 to 9 employees) small (10-49 employees), medium (50-249 employees), and large (250 or more employees).
The overall size of the survey sample was determined based on the following:
- An expected standard error of 6.5% for the two sets of overlapping domains (NAICS level by size category for the two sets of size definitions) for a reported proportion of 50%;
- NAICS levels specified at the NAICS2 for all sectors 21, 22, 23, 31, 32, 33, 41, 44, 45, 48, 49, 51, 52, 53, 54, 55, 56, 61, 62, 71, 72 and 81;
- A response rate of 65%.
A proportional allocation of the sample was done in each of the stratum defined by the NAICS level, and the sets of size categories. The probability of a particular enterprise being selected in the survey sample is determined by the number of enterprises selected in the stratum and the population size of the stratum.
The final sample size was 9,705 enterprises.
Data sources
Data collection for this reference period: 2023-11-20 to 2024-03-28
Responding to this survey is mandatory.
Data are collected directly from survey respondents.
Data are collected through an electronic questionnaire. Businesses are initially contacted by telephone during a pre-contact phase to identify an appropriate contact within the enterprise to respond to the survey.
Follow-up because of non-response, inconsistent or missing data is done by phone on a priority basis.
View the Questionnaire(s) and reporting guide(s) .
Error detection
Error detection is an integral part of both collection and data processing activities. Automated edits are applied to data records during collection to identify and correct reporting errors.
The processing phase of the survey was for the most part concerned with applying edits for consistency and validity to each record. Consistency edits ensure that data reported in one question does not contradict information reported in another question. Validity edits ensure that the data reported is valid (e.g., percentages do not exceed 100%).
Outlier detection checks were also conducted for key variables during data processing. Some outliers that could not be validated were replaced by imputed values.
Imputation
The imputation of non-responses and erroneous records was performed using the nearest neighbour donor imputation procedure in the BANFF generalized system. This procedure uses a nearest neighbour approach to find, for each record requiring imputation, a valid record that is most similar to it and that will ensure that the imputed record does not violate any of the logical flows and input restrictions of the questionnaire.
Similar records are found by defining imputation classes which take into account other variables that are correlated with the missing or erroneous values. When a nearest neighbour cannot be found for some records in the most specific imputation classes, the definition of the imputation classes are expanded and the data are reprocessed. This imputation processing continues using a predetermined sequence until nearest neighbour donors are assigned to all records requiring imputation.
Estimation
The response values for sampled units were multiplied by a final sampling weight in order to provide an estimate for the entire population. The final weight was calculated using several factors, including the probability for a unit to be selected in the sample and an adjustment to represent the units that could not be contacted or that refused to respond. Using a statistical technique called calibration, the final set of weights was adjusted in such a way that the sample represents as closely as possible the entire population.
Sampling error was measured by the standard error (SE) for proportions and by the coefficient of variation (CV) for numerical responses. These measures represent the proportion of the estimate that comes from the variability associated with sampling. The SEs and CVs were calculated and are indicated in the data tables by quality flags.
Quality evaluation
Prior to the data release, combined survey results are analyzed for comparability. This analysis includes a detailed review of:
- individual responses (especially for the largest organizations);
- coherence with results from other surveys and studies related to digital technologies and Internet use, including previous iterations of the survey; and
- information from other external sources (e.g. annual reports, news articles).
Disclosure control
Statistics Canada is prohibited by law from releasing any information it collects that could identify any person, business, or organization, unless consent has been given by the respondent or as permitted by the Statistics Act. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.
In order to prevent any data disclosure, confidentiality analysis is done using the Statistics Canada Generalized Disclosure Control System (G-Confid). G-Confid is used for primary suppression (direct disclosure) as well as for secondary suppression (residual disclosure). Direct disclosure occurs when the value in a tabulation cell is composed of or dominated by few enterprises while residual disclosure occurs when confidential information can be derived indirectly by piecing together information from different sources or data series.
Revisions and seasonal adjustment
This methodology does not apply to this survey.
Data accuracy
Errors may occur for various reasons during the collection and processing of the data. For example, non-response is one possible source of error. Under or over-coverage of the population, differences in interpretation of questions and mistakes in recording and processing data are other possible errors. To the maximum extent possible, these errors are minimized through careful design of the survey questionnaire and verification of the survey data.
The data accuracy indicators used for the SDTIU are the standard error and the coefficient of variation. The standard error is a commonly used statistical measure indicating the error of an estimate associated with sampling. The coefficient of variation is the standard error expressed as a percentage of the estimate.
Data quality indicators for the survey are based on the standard error (SE) or the coefficient of variation (CV), and the imputation rates. Quality indicators indicate the following for SE: A is excellent (SE up to 2.5%); B is very good (SE 2.5% up to 5.0%); C is good (SE 5.0% up to 7.5%); D is acceptable (SE 7.5% up to 10.0%); E is use with caution (SE 10.0% up to 12.5%); and F is too unreliable to be published (SE 12.5% or higher). Quality indicators indicate the following for CV: A is excellent (CV up to 5%); B is very good (CV 5% up to 10%); C is good (CV 10% up to 15%); D is acceptable (CV 15% up to 20%); E is use with caution (CV 20% up to 25%); and F is too unreliable to be published (SE 25% or higher).
Response rates:
The response rate at the estimation phase was 61%.
Non-response bias:
In addition to increasing variance, non-response can result in biased estimates if non-respondents have different characteristics from respondents. Non-response is addressed through survey design, respondent follow-up, reweighting, and verification and validation of microdata. Other indicators of quality such as the response rate are also provided.
Coverage error:
The Business Register was used as the frame.
- Date modified: