Scientific Reports (Dec 2023)

Applying a zero-corrected, gravity model estimator reduces bias due to heterogeneity in healthcare utilization in community-scale, passive surveillance datasets of endemic diseases

  • Michelle V. Evans,
  • Felana A. Ihantamalala,
  • Mauricianot Randriamihaja,
  • Andritiana Tsirinomen’ny Aina,
  • Matthew H. Bonds,
  • Karen E. Finnegan,
  • Rado J. L. Rakotonanahary,
  • Mbolatiana Raza-Fanomezanjanahary,
  • Bénédicte Razafinjato,
  • Oméga Raobela,
  • Sahondraritera Herimamy Raholiarimanana,
  • Tiana Harimisa Randrianavalona,
  • Andres Garchitorena

DOI
https://doi.org/10.1038/s41598-023-48390-0
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Data on population health are vital to evidence-based decision making but are rarely adequately localized or updated in continuous time. They also suffer from low ascertainment rates, particularly in rural areas where barriers to healthcare can cause infrequent touch points with the health system. Here, we demonstrate a novel statistical method to estimate the incidence of endemic diseases at the community level from passive surveillance data collected at primary health centers. The zero-corrected, gravity-model (ZERO-G) estimator explicitly models sampling intensity as a function of health facility characteristics and statistically accounts for extremely low rates of ascertainment. The result is a standardized, real-time estimate of disease incidence at a spatial resolution nearly ten times finer than typically reported by facility-based passive surveillance systems. We assessed the robustness of this method by applying it to a case study of field-collected malaria incidence rates from a rural health district in southeastern Madagascar. The ZERO-G estimator decreased geographic and financial bias in the dataset by over 90% and doubled the agreement rate between spatial patterns in malaria incidence and incidence estimates derived from prevalence surveys. The ZERO-G estimator is a promising method for adjusting passive surveillance data of common, endemic diseases, increasing the availability of continuously updated, high quality surveillance datasets at the community scale.