Humanities & Social Sciences Communications (Jan 2021)

Semi-automatic mapping of pre-census enumeration areas and population sampling frames

  • Sarchil Qader,
  • Veronique Lefebvre,
  • Andrew Tatem,
  • Utz Pape,
  • Kristen Himelein,
  • Amy Ninneman,
  • Linus Bengtsson,
  • Tomas Bird

DOI
https://doi.org/10.1057/s41599-020-00670-0
Journal volume & issue
Vol. 8, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Enumeration Areas (EAs) are the operational geographic units for the collection and dissemination of census data and are often used as a national sampling frame for various types of surveys. In many poor or conflict-affected countries, EA demarcations are incomplete, outdated, or missing. Even for countries that are stable and prosperous, creating and updating EAs is one of the most challenging yet essential tasks in the preparation for a national census. Commonly, EAs are created by manually digitising small geographic units on high-resolution satellite imagery or physically walking the boundaries of units, both of which are highly time, cost, and labour intensive. In addition, creating EAs requires considering population and area size within each unit. This is an optimisation problem that can best be solved by a computer. Here, for the first time, we produce a semi-automatic mapping of pre-defined census EAs based on high-resolution gridded population and settlement datasets and using publicly available natural and administrative boundaries. We demonstrate the approach in generating rural EAs for Somalia where such mapping is not existent. In addition, we compare our automated approach against manually digitised EAs created in urban areas of Mogadishu and Hargeysa. Our semi-automatically generated EAs are consistent with standard EAs, including having identifiable boundaries for field teams to follow on the ground, and appropriate sizing and population for coverage by an enumerator. Furthermore, our semi-automated urban EAs have no gaps, in contrast, to manually drawn urban EAs. Our work shows the time, labour and cost-saving value of automated EA delineation and points to the potential for broadly available tools suitable for low-income and data-poor settings but applicable to potentially wider contexts.