Financial Data and Charitable Donations, Preliminary T1 Family File

Detailed information for 2007

Status:

Active

Frequency:

Annual

Record number:

4106

This activity is conducted for the development and dissemination of annual small area economic data for Canadians.

Data release - November 4, 2008 (2007 Charitable Donors); November 5, 2008 (2007 RRSP Contributions); November 6, 2008 (2007 Savers, Investors and Investment Income)

Description

This activity is conducted for the development and dissemination of annual small area economic data for Canadians. The data, collected from income tax returns submitted to the Canada Revenue Agency (CRA), provide financial information (RRSP Contributions, RRSP Room, Savings and Investment Income) and the amount of charitable donations reported on the tax file. The data are available for Canada, the provinces and territories and sub-provincial geographic areas (postal areas and selected Census areas). Data are used by financial institutions and charitable organizations to evaluate contributions and support marketing decisions. Academics and researchers use the data for analyses of economic conditions.

Reference period: Calendar year "y" for income and contributions, end of calendar year "y" for age, point in time (usually April of calendar year "y+1") for address information.

Collection period: Income tax returns are filed mainly in the spring following the year of reference. The preliminary T1 file for income year "y" is received from the Canada Revenue Agency (CRA) in September or October of the year "y+1".

Subjects

  • Household, family and personal income
  • Household spending and savings
  • Income, pensions, spending and wealth
  • Pension plans and funds and other retirement income programs

Data sources and methodology

Target population

These data cover all persons who completed a T1 tax return for the year of reference by the date the file was copied for Statistics Canada. This is a preliminary version of the T1 file and therefore the file is missing a certain amount of late tax filers.

Instrument design

This methodology does not apply.

Sampling

This methodology does not apply.

Data sources

Data collection for this reference period: 2007-01-01 to 2007-12-31

Data are extracted from administrative files.

The individual T1 tax file is received from the Canada Revenue Agency. The file is processed over a one-week period to create the standard tabulations. The input files contain records for 23.9 million unique individuals for the 2007 tax year. Taxfilers who died within the year are not counted.

The period of income is the calendar year.

Error detection

During processing, there is some automatic editing. Variables with values of unity (a type of flag for CRA) are converted to zero. In total, for tax year 2007, for example, about 4.2% of the records had at least one field changed from a 1 to a 0. In a marginal number of cases, variables with values above some absolute maxima are corrected, some negative values are changed to be positive and those with outliers in the key fields are analyzed and corrected if required. Thus, very few records are changed.

Imputation

This methodology does not apply.

Estimation

This methodology type does not apply to this statistical program.

Quality evaluation

The estimates are evaluated primarily by trend analysis.

Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential.

Only a small group of people within the Division have access to confidential data. Users must specify their requirements to these people who then carry out the retrievals. Beginning with the release of 2007 Financial Products in the fall of 2008, new confidentiality measures will be introduced into the products. As a result of these new measures, much more data will be available to clients. Before release, data are subjected to stringent non-disclosure practices:

1. There must be a minimum of 100 taxfilers in any geographic area before any data will be produced.
2. In the case of extremely small counts, a certain distortion will be applied to the corresponding amount and the data are rounded.
3. Each cell which can be dominated by one tax filer (or one family) is checked for dominance. Each cell which can be dominated by one tax filer (or one family) is checked for dominance. Once a record is identified as dominant, a minimum amount of distortion will be introduced to protect the value of the dominant record. This means that the cell aggregate total will be changed somewhat and marginal and overall table totals will be recalculated.
4. Starting with 2007 data, no complementary suppressions will be done and therefore other data cells will be preserved which were formerly suppressed.
5. Finally, the counts and amounts are rounded -- counts to the nearest ten, aggregate amounts to the nearest $5,000 and distribution measures such as percentiles to the nearest $10.
6. Averages and percentages are based on rounded counts and amounts to prevent the unravelling of non-disclosure procedures.

Revisions and seasonal adjustment

Once the data are finalized, they are not revised. For analyses, data are sometimes adjusted to constant dollars for comparison with data from other years, but only current dollars are kept on the file.

Data accuracy

The data for these products are derived from an early file from the Canada Revenue Agency. They benefit from timeliness, but lose some accuracy because of it. This preliminary T1 tax file contains about 97% of the records on the file received four to five months later.

The data are unadjusted apart from editing and estimation of missing components to achieve a definition of income that is closer to Statistics Canada's definition of income. There are no coefficients of variation from sampling, as the population studied is nearly a census of filers and the data are neither weighted nor adjusted to compensate for the earliness of the file.

Documentation

Report a problem on this page

Is something not working? Is there information outdated? Can't find what you're looking for?

Please contact us and let us know how we can help you.

Privacy notice

Date modified: