National ZIP Code Crosswalk (1990-2020)
- URL
- https://www.icpsr.umich.edu/web/ICPSR/studies/39431
- Description
ZIP Codes are administrative codes generated by the United States Postal Service (USPS) that refer to the geographic area covered by a specific set of mail delivery routes. The U.S. Census Bureau calculates and distributes aggregated social, economic, and demographic information for the population associated with "ZIP Code Tabulation Areas" (ZCTAs), which are roughly analogous to ZIP Codes and serve as identifiers for specific neighborhoods and communities. These aggregated census data, however, are unable to account for changes in ZIP Code boundaries that occur between decennial censuses, leading to measurement error and missing data problems for scholars who attempt to use the aggregated ZCTA data. The purpose of this crosswalk file is to allow researchers to overcome this limitation, enabling them to appropriately link spatial reference information (ZIP Codes) with characteristics of the populations to which they refer.
Most ZIP Codes do not change boundaries in a decade, but a large enough percentage do as to create a problem with missing or mis-specified data. Boundary changes typically involve one or more of the following three processes, although a small number of cases do not conform to these typologies: (1) two or more existing ZIP Codes are combined to create a single surviving ZIP Code, (2) an existing ZIP Code is divided into multiple resulting ZIP Codes, and (3) boundaries between two or more existing ZIP Codes are altered.
Each of these types of changes alters the geographic area that a ZIP Code refers to, and as such, the spatial unit identified by the ZIP Code includes a different population, with a different array of characteristics. By linking the spatial units associated with ZIP Codes as these boundary changes are enacted, the research team can both prevent the loss of observations due to missing data, and more accurately measure social, demographic, and economic characteristics associated with each ZIP Code.
This data set identifies changes in ZIP Code boundaries between 1990 and 2020, and provides numeric codes that cluster the ZIP Codes into the smallest geographic unit, or group of ZIP Codes, that are consistent across a decade: 1990 - 2000, 2000 - 2010, and 2010 - 2020. This "crosswalk" covers the contiguous United States, Alaska, Hawaii, and the District of Columbia. Since much administrative data is available with ZIP Code as the smallest identifiable geography, ZIP Codes are often used to embed observations from administrative data (patients, businesses, survey respondents, etc.) within their social, demographic, and economic contexts. However, ZIP Code boundaries change over time, resulting in measurement error (matching observations to the wrong contextual unit) or missing data (due to an observation reporting a ZIP Code that did not exist at the beginning of the observational period). These data were collected, and the crosswalk created, in an attempt to resolve these data quality issues.
- Sample
- Format
- Series - completed
- Title
- National ZIP Code Crosswalk (1990-2020)
- Format
- Series - completed