When you keep in mind that all Census data is derived from the questions asked of the population during the Census-taking process, it is obvious that the inclusion or exclusion of a particular question will have a great effect on the data available for that particular year.
For example, the 2000 Census is the first one which asked a question about grandparents as primary caretakers of grandchildren. Therefore, this 2000 data cannot be compared with data from earlier years, at least not with Census data. (Other organizations may have information on this subject from surveys or other means.) Conversely, the 2000 Census did not ask about the source of a household's water, a question which had appeared on previous Census questionnaires. Therefore data on this topic from a previous Census cannot be compared with 2000 data, since there is no 2000 data (from the Census, at least) on this topic.
Which questions are asked (or not) is largely a matter of Congressional mandate or executive agency requirements, making the seemingly arcane and pedestrian subject of Census methodology subject to all the whims of political maneuvering and varying public opinion.
To complicate matters further, other factors than simply the presence or absence of a question can affect comparability. For example, questions may be switched from the 100% section of the Census questionnaire to the sample section, and vice versa. Some basic questions (age, race, sex) are asked of all residents, while approximately one-sixth of the households in the country, through the 2000 Census, were asked a set of much more detailed questions (income, education level, etc.). The Census Bureau will report data for all of these items, of course, but in comparing one set of data to another, users should be sensitive to whether the data came from a 100% or a sample question.
Note that the 2010 Census did not ask more detailed questions of a sample of the population, but rather refers to the American Community Survey, a continuous sample survey, to provide this kind of data. Therefore comparisons of pre-2010 sample data and the same sorts of information post-2010 will become even more complicated.
Differences in the questions themselves are not the only comparability problem stemming from the actual Census questionnaires. The answers may also be different from Census to Census. Some Census questions leave space for open-ended answers, but for most of them, respondents must choose one (or more) answers from several options.
Since the responses are constrained in this way, the data will be similarly constrained, and these constraints might have differed over time. For example, the answers to a question about disability were greatly expanded in the 2000 Census, in order to gather more detailed information about the nature of the disabled population. The total numbers of disabled people as reported in 2000 should be comparable to total numbers from previous Censuses. But any details on the nature or extent of disabilities will not be.
While not as challenging as comparability problems arising from differences in questions asked, geographical changes can be very important nonetheless.
Metropolitan areas. As you can imagine, in the ten years between Censuses, new cities will attain Metropolitan Area status (although the changing definitions applied to urban areas account for some of the differences, too. And the boundaries of MAs will shift considerably over time as well, making comparability of data for some applications difficult.
Even such supposedly stable geographic entities as counties can be changed over time.
The two main reasons for difficulty in comparing statistics from one Census to another are:
The problems arising from differences in questions are the most complicated and are discussed in the box below. Some issues to be aware of in dealing with geographic changes are presented in the box to the left.