Nature Communications (Feb 2024)

Geographic pair matching in large-scale cluster randomized trials

  • Benjamin F. Arnold,
  • Francois Rerolle,
  • Christine Tedijanto,
  • Sammy M. Njenga,
  • Mahbubur Rahman,
  • Ayse Ercumen,
  • Andrew Mertens,
  • Amy J. Pickering,
  • Audrie Lin,
  • Charles D. Arnold,
  • Kishor Das,
  • Christine P. Stewart,
  • Clair Null,
  • Stephen P. Luby,
  • John M. Colford,
  • Alan E. Hubbard,
  • Jade Benjamin-Chung

DOI
https://doi.org/10.1038/s41467-024-45152-y
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Cluster randomized trials are often used to study large-scale public health interventions. In large trials, even small improvements in statistical efficiency can have profound impacts on the required sample size and cost. Location integrates many socio-demographic and environmental characteristics into a single, readily available feature. Here we show that pair matching by geographic location leads to substantial gains in statistical efficiency for 14 child health outcomes that span growth, development, and infectious disease through a re-analysis of two large-scale trials of nutritional and environmental interventions in Bangladesh and Kenya. Relative efficiencies from pair matching are ≥1.1 for all outcomes and regularly exceed 2.0, meaning an unmatched trial would need to enroll at least twice as many clusters to achieve the same level of precision as the geographically pair matched design. We also show that geographically pair matched designs enable estimation of fine-scale, spatially varying effect heterogeneity under minimal assumptions. Our results demonstrate broad, substantial benefits of geographic pair matching in large-scale, cluster randomized trials.