Geospatial Health (Nov 2012)

A multifaceted comparison of ArcGIS and MapMarker for automated geocoding

  • Sanjaya Kumar,
  • Ming Liu,
  • Syni-An Hwang

DOI
https://doi.org/10.4081/gh.2012.113
Journal volume & issue
Vol. 7, no. 1
pp. 145 – 151

Abstract

Read online

Geocoding is increasingly being used for public health surveillance and spatial epidemiology studies. Public health departments in the United States of America (USA) often use this approach to investigate disease outbreaks and clusters or assign health records to appropriate geographic units. We evaluated two commonly used geocoding software packages, ArcGIS and MapMarker, for automated geocoding of a large number of residential addresses from health administrative data in New York State, USA to better understand their features, performance and limitations. The comparison was based on three metrics of evaluation: completeness (or match rate), geocode similarity and positional accuracy. Of the 551,798 input addresses, 318,302 (57.7%) were geocoded by MapMarker and 420,813 (76.3%) by the ArcGIS composite address locator. High similarity between the geocodes assigned by the two methods was found, especially in suburban and urban areas. Among addresses with a distance of greater than 100 m between the geocodes assigned by the two packages, the point assigned by ArcGIS was closer to the associated parcel centroid (“true” location) compared with that assigned by MapMarker. In addition, the composite address locator in ArcGIS allows users to fully utilise available reference data, which consequently results in better geocoding results. However, the positional differences found were minimal, and a large majority of addresses were placed on the same locations by both geocoding packages. Using both methods and combining the results can maximise match rates and save the time needed for manual geocoding.

Keywords