International Journal of Health Geographics (Oct 2018)

Developing a representative community health survey sampling frame using open-source remote satellite imagery in Mozambique

  • Bradley H. Wagenaar,
  • Orvalho Augusto,
  • Kristjana Ásbjörnsdóttir,
  • Adam Akullian,
  • Nelia Manaca,
  • Falume Chale,
  • Alberto Muanido,
  • Alfredo Covele,
  • Cathy Michel,
  • Sarah Gimbel,
  • Tyler Radford,
  • Blake Girardot,
  • Kenneth Sherr,
  • with input from the INCOMAS Study Team

DOI
https://doi.org/10.1186/s12942-018-0158-4
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background Lack of accurate data on the distribution of sub-national populations in low- and middle-income countries impairs planning, monitoring, and evaluation of interventions. Novel, low-cost methods to develop unbiased survey sampling frames at sub-national, sub-provincial, and even sub-district levels are urgently needed. This article details our experience using remote satellite imagery to develop a provincial-level representative community survey sampling frame to evaluate the effects of a 7-year health system intervention in Sofala Province, Mozambique. Methods Mozambique’s most recent census was conducted in 2007, and no data are readily available to generate enumeration areas for representative health survey sampling frames. To remedy this, we partnered with the Humanitarian OpenStreetMap Team to digitize every building in Sofala and Manica provinces (685,189 Sofala; 925,713 Manica) using up-to-date remote satellite imagery, with final results deposited in the open-source OpenStreetMap database. We then created a probability proportional to size sampling frame by overlaying a grid of 2.106 km resolution (0.02 decimal degrees) across each province, and calculating the number of buildings within each grid square. Squares containing buildings were used as our primary sampling unit with replacement. Study teams navigated to the geographic center of each selected square using geographic positioning system coordinates, and then conducted a standard “random walk” procedure to select 20 households for each time a given square was selected. Based on sample size calculations, we targeted a minimum of 1500 households in each province. We selected 88 grids within each province to reach 1760 households, anticipating ongoing conflict and transport issues could preclude the inclusion of some clusters. Results Civil conflict issues forced the exclusion of 8 of 31 subdistricts in Sofala and 15 of 39 subdistricts in Manica. Using Android tablets, Open Data Kit software, and a remote RedCap data capture system, our final sample included 1549 households in Sofala (4669 adults; 4766 children; 33 missing age) and 1538 households in Manica (4422 adults; 4898 children; 33 missing age). Conclusions Other implementation or evaluation teams may consider employing similar methods to track population distributions for health systems planning or the development of representative sampling frames using remote satellite imagery.

Keywords