iScience (Nov 2024)

Federated difference-in-differences with multiple time periods in DataSHIELD

  • Manuel Huth,
  • Carolina Alvarez Garavito,
  • Lea Seep,
  • Laia Cirera,
  • Francisco Saúte,
  • Elisa Sicuri,
  • Jan Hasenauer

Journal volume & issue
Vol. 27, no. 11
p. 111025

Abstract

Read online

Summary: Difference-in-differences (DID) is a key tool for causal impact evaluation but faces challenges when applied to sensitive data restricted by privacy regulations. Obtaining consent can shrink sample sizes and reduce statistical power, limiting the analysis’s effectiveness. Federated learning addresses these issues by sharing aggregated statistics rather than individual data, though advanced federated DID software is limited. We developed a federated version of the Callaway and Sant’Anna difference-in-differences (CSDID), integrated into the DataSHIELD platform, adhering to stringent privacy protocols. Our approach reproduces key estimates and standard errors while preserving confidentiality. Using simulated and real-world data from a malaria intervention in Mozambique, we demonstrate that federated estimates increase sample sizes, reduce estimation uncertainty, and enable analyses when data owners cannot share treated or untreated group data. Our work contributes to facilitating the evaluation of policy interventions or treatments across centers and borders.

Keywords