Canadian Housing Statistics Program (CHSP)
Detailed information for 2019
The Canadian Housing Statistics Program is an innovative data project that leverages existing data sources and transforms them into new and timely indicators on Canadian housing.
Data release - May 22, 2020 (New Brunswick data); October 28, 2020 (Nova Scotia, Ontario and British Columbia data)
Statistics Canada was mandated to create a residential property database: a comprehensive repository of data that covers numerous aspects of the housing sector. The database, under the responsibility of the Canadian Housing Statistics Program (CHSP), will ultimately include all residential properties in Canada and their owners.
The CHSP residential property database was developed by combining data from multiple sources (e.g., property assessment rolls, land titles, Census of Population, etc.) and provides detailed information at the property and owner levels.
The database, initialized in 2017, continues to be expanded with new geographies and variables and is expected to contain information for all properties in every census subdivision nationwide by December 2022.
Collection period: Ongoing
- Housing and dwelling characteristics
- Rental and leasing and real estate
Data sources and methodology
At this time, the CHSP data file consists of a complete list of residential properties and residential property owners in the provinces of Nova Scotia, New Brunswick, Ontario and British Columbia, with varying data vintages to the extent that data have been made available by data providers.
The CHSP database does not currently contain information about non-residential properties, residential properties on Indian reserves, or collective dwellings (e.g., nursing homes, jails or staff residences). Properties with mixed residential and non-residential portions are included, but the property characteristics reported in the CHSP reflect only the residential portions of mixed properties.
This methodology type does not apply to this statistical program.
This methodology type does not apply to this statistical program.
Data are extracted from administrative files.
The CHSP leverages existing data from provincial and territorial land registries, property assessment programs and other administrative data files to create a database of all residential properties in Canada.
Property-level data are obtained from land registries and property assessment programs. Owner-level information is also derived from land registries and property assessment programs, and a variety of owner characteristics are linked from tax data, the Business Register, the Census of Population, and the Longitudinal Immigration Database. This owner information is supplemented with indicators of residency in the economic territory of Canada, which are obtained by linkage to various data sources, including tax and the Census of Population data.
The municipalities covered in the source data are assigned to census subdivisions (CSD) which are updated on a yearly basis by Statistics Canada's Standard Geographical Classification System. Some CSD types are out of scope, such as Indian Reserves. Values for such CSDs are not part of the estimates.
The record linkage process is implemented using custom software developed at Statistics Canada. G-Link, part of Statistics Canada's suite of generalized systems, was used to perform probabilistic record linkage, while SAS and Mix-Match software were used to perform deterministic linkage.
A range of data sources are used to determine whether or not property owners are residents of Canada. Key amongst these factors is linking an owner to recent Canadian tax data activity; when linking to tax data is successful, an owner is highly likely to be considered a resident of Canada. However, additional criteria such as an indication of emigration from Canada to a foreign country, or a lack of presence on the last Canadian Census of Population may conversely lead to an owner being designated as a non-resident of Canada.
Data for a given reference year reflect the stock of properties available on the property assessment roll in each province or territory for that year. Each assessment agency applies its own reference date for the creation of municipal assessment rolls. "Assessment value" refers to the assessed value of the property for the purpose of determining property taxes. It is important to note that the assessed value does not necessarily represent the market value.
Concepts and terminology used to describe properties are distinct to each jurisdiction, and CHSP harmonizes these differences as much as possible.
All microdata records contained in the CHSP are verified in order to identify possible errors (e.g., outliers, unexpected values or formatting issues). Validation edits are used to verify that each field contains values that fall within the allowable range for that data element. Correlation edits are used to check the compatibility of different data elements within a record.
Data abnormalities are resolved in collaboration with data providers and by comparing aggregated values available from alternate sources like the Census of Population and tax data.
The CHSP estimates undergo various levels of error detection from internal checks during data production to post development sampling for detection of linkage errors. Data providers are extensively consulted with respect to the concepts and any data abnormalities pertaining to externally obtained files.
Imputation was performed on a subset of the Nova Scotia, Ontario and British Colombia properties to fill missing data on living area.
Estimation methodology is not currently required.
A number of strategies have been developed and implemented to assess data quality and to minimize errors.
The contents of administrative databases containing property information are compared between vintages to ensure consistency over time.
Steps were taken to consolidate and standardize variables originating from various data sources to achieve the best possible matches between records.
The linkage results are extensively reviewed during the linkage process to ensure that the methods used are correct and appropriate. Furthermore, samples of linked records are manually reviewed and estimates of linkage error rates are calculated to ensure that linkages are of high quality.
Linkage quality varies among the provinces and territories as a result of the prevalence of common names and the presence of non-civic addresses such as post office boxes in the source data. The variance in quality for linkage can impact some indicators which are derived from these linked data sets, such as residency ownership and property use.
Other minor data quality issues can also affect linkage quality and linkage quality impacts some derived variables more than others. Although the quality estimates for most variables are very strong, the derived non-resident ownership rate in particular is impacted by variation in linkage quality.
The indicator on property use is also impacted by variation in linkage quality. It is determined by a methodology relying on a range of data, particularly civic-style address data used in an algorithm to link between an owner's property address and stated address of residence. This indicator is not available in some areas which lack civic-style addresses.
Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge and the consent in writing of that person, business or organization. Various confidentiality protections are applied to all data published to prevent the disclosure of any information deemed confidential. As necessary, data are suppressed or rounded to prevent direct or residual disclosure of identifiable data.
The use of the CHSP data is subject to Statistics Canada's privacy and confidentiality constraints to prevent the disclosure of personal information.
Revisions and seasonal adjustment
As the CHSP is a program in development, published data may be subject to revision. When data are released for the first time for a given jurisdiction, the data are considered experimental for that reference year. Consistency and coherence edits may occur when subsequent vintages are released for those new jurisdictions.
Since each Canadian municipality, province or territory has a legislated responsibility for property monitoring and assessment, completeness of the administrative data provided by external sources is considered relatively good.
The CHSP database reflects the current content of the external data provider's registry of residential properties as of the date of extraction, which varies by province and territory.
The CHSP assigns properties to a geographic location using data from property assessment rolls.
Initial investigations are performed to ensure that all properties on the data files are unique. Through internal linkages, duplicate records are identified and then suppressed if owners are listed twice for the same property.
Undercoverage of residential properties may exist for a variety of reasons. For example, properties undergoing unreported changes between assessment periods (e.g., new constructions, demolitions or improvements performed without a building permit) are not captured in the assessment values.
There are no coefficient of variation from sampling as the CHSP is a census of all residential properties in Canada, with data for each province and territory to be added as they become available.
The CHSP is an innovative data project that utilizes new techniques in linkage and processing that may be refined over time leading to improvement in the accuracy and precision of data that is released. The first year of data for each province and territory should be considered preliminary results and may contain a precocity error which may be corrected in future data releases.
The CHSP data are used to produce annual estimates.
- Reference years of the property stock and assessment values, by province