Financial Data and Charitable Donations, Preliminary T1 Family File

Detailed information for 2015

Status:

Active

Frequency:

Annual

Record number:

4106

This activity is conducted for the development and dissemination of annual small area economic data for Canadians.

Data release - February 22, 2017 (First in a series of releases for this reference period.)

Description

This activity is conducted for the development and dissemination of annual small area economic data for Canadians. The data, collected from income tax returns submitted to the Canada Revenue Agency (CRA), provide financial information (RRSP Contributions, RRSP Room, Savings and Investment Income) and the amount of charitable donations reported on the tax file. The data are available for Canada, the provinces and territories and sub-provincial geographic areas (postal areas and selected Census areas). Data are used by financial institutions and charitable organizations to evaluate contributions and support marketing decisions. Academics and researchers use the data for analyses of economic conditions.

Reference period: Calendar year "y" for income and contributions, end of calendar year "y" for age, point in time (usually April of calendar year "y+1") for address information.

Collection period: Income tax returns are filed mainly in the spring following the year of reference. The preliminary T1 file for income year "y" is received from the Canada Revenue Agency (CRA) in September or October of the year "y+1".

Subjects

  • Household, family and personal income
  • Household spending and savings
  • Income, pensions, spending and wealth
  • Pension plans and funds and other retirement income programs

Data sources and methodology

Target population

These data cover all persons who completed a T1 tax return for the year of reference by the date the file was copied for Statistics Canada. This is a preliminary version of the T1 file and therefore the file is missing a certain amount of late tax filers.

Instrument design

This methodology type does not apply to this statistical program.

Sampling

No sampling is done for this statistical program.

Data sources

Data are extracted from administrative files.

The individual T1 tax file is received from the Canada Revenue Agency in early fall following the taxation year. This is a preliminary version of the T1 file and therefore this file is missing a certain amount of late taxfilers. The input file contains records for 26.2 million unique individuals for the 2015 tax year. Taxfilers who died within the year are not counted.

The period of income is the calendar year.

Error detection

During data processing, there is a combination of automated and manual editing. Some variables with a value of 1 (a type of flag for the Canada Revenue Agency) are converted to zero and variables with values above their absolute maximum are corrected automatically. Those with outliers are identified then examined and those identified as erroneous are corrected manually.

Imputation

This methodology type does not apply to this statistical program.

Estimation

The data are aggregated to approximate the standard geographic areas of Statistics Canada. Census metropolitan areas (CMAs) and census agglomerations (CAs) are areas consisting of one or more neighbouring municipalities situated around a major urban core. A CMA must have a total population of at least 100,000 of which 50,000 or more live in the urban core. A CA must have an urban core population of at least 10,000.

Other levels of postal and census geography are also available.

When performing calculations, Canada Revenue Agency (CRA) tax rules are used.

Since this data is based on the entire preliminary T1 file, and is not a sample, data is left unweighted and unadjusted.

Quality evaluation

The estimates are evaluated in several ways:

1. The geography is evaluated by comparing the number of taxfilers and dependents with population estimates from Statistics Canada for the same areas.
2. The demographic information is evaluated in much the same way - by comparisons with estimates from Statistics Canada for the same areas.
3. The income information is evaluated by trend analysis, by comparing to both the preliminary and final T1 files from the previous year, and by comparisons with data from Canadian Income Survey (CIS) whenever possible.
4. When Census or National Household Survey data are available, many comparisons are made -- population, income and demographics.
5. In addition, comparisons are made for income of individuals with annual income data produced by CRA.

Disclosure control

Statistics Canada is prohibited by law from releasing any information it collects which could identify any person, business, or organization, unless consent has been given by the respondent or as permitted by the Statistics Act. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential.

Only a small group of people within the Income Statistics Division of Statistics Canada have access to confidential data. Users must specify their requirements to these people who then carry out the retrievals. Before release, data are subjected to stringent non-disclosure practices:

1. There must be a minimum of 100 taxfilers in any geographic area before any data will be produced.
2. Any cell must represent a minimum of 15 taxfilers, otherwise it is suppressed.
3. Each cell which can be dominated by one tax filer (or one family) is checked for dominance and suppressed if a problem is identified.
4. Once the primary suppressions are made, complementary suppressions are made so that suppressed information cannot be discovered residually. This is an iterative process - each complementary suppression may require an additional complementary suppression. Patterns are created to keep these to a minimum.
5. Finally, the counts and amounts are rounded - counts to the nearest ten, aggregate amounts to the nearest $5,000 and distribution measures such as percentiles to the nearest $10.
6. Averages and percentages are based on rounded counts and amounts to prevent the unravelling of non-disclosure procedures.

Revisions and seasonal adjustment

Once the data are finalized, they are not revised. For analyses, data are sometimes adjusted to constant dollars for comparison with data from other years, but only current dollars are kept on the file.

Data accuracy

The data for these products are derived from an early file from the Canada Revenue Agency. They benefit from timeliness, but lose some accuracy because of it. This preliminary T1 tax file contains about 97% of the records on the file received four to five months later.

The data are unadjusted apart from editing and estimation of missing components to achieve a definition of income that is closer to Statistics Canada's definition of income. There are no coefficients of variation from sampling, as the population studied is nearly a census of filers and the data are neither weighted nor adjusted to compensate for the earliness of the file.

Documentation

Date modified: