Demographic Research (May 2019)

IRS county-to-county migration data, 1990‒2010

  • Mathew Hauer,
  • James Byars

DOI
https://doi.org/10.4054/DemRes.2019.40.40
Journal volume & issue
Vol. 40
p. 40

Abstract

Read online

Background: The county-to-county migration data of the Internal Revenue Service's (IRS) is an incredible resource for understanding migration in the United States. Produced annually since 1990 in conjunction with the US Census Bureau, the IRS migration data represents 95Š to 98Š of the tax-filing universe and their dependents, making the IRS migration data one of the largest sources of migration data. However, any analysis using the IRS migration data must process at least seven legacy formats of this public data across more than 2000 data files - a serious burden for migration scholars. Objective: To produce a single, flat data file containing complete county-to-county IRS migration flow data and to make the computer code to process the migration data freely available. Methods: This paper uses R to process more than 2,000 IRS migration files into a single, flat data file for use in migration research. Contribution: To encourage and facilitate the use of this data, we provide a single, standardized, flat data file containing county-to-county one-year migration flows for the period 1990-2010 (containing 163,883 dyadic county pairs resulting in 3.2 million county-year observations totaling over 343 million migrants) and provide the full R script to download, process, and flatten the IRS migration data.