Clinical Epidemiology (Jun 2020)
MultiCenter Interrupted Time Series Analysis: Incorporating Within and Between-Center Heterogeneity
Abstract
Joycelyne E Ewusie,1,2 Lehana Thabane,1 Joseph Beyene,1 Sharon E Straus,3 Jemila S Hamid1,2,4 1Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada; 2School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada; 3Li Ka Shing Knowledge Institute of St Michael’s Hospital, Toronto, Ontario, Canada; 4Children’s Hospital of Eastern Ontario, Ottawa, Ontario, CanadaCorrespondence: Jemila S HamidChildren’s Hospital of Eastern Ontario, Ottawa, Ontario, CanadaTel +1 613 737-7600 x 4194Email [email protected]: Segmented regression (SR) is the most common statistical method used in the analysis of interrupted time series (ITS) data. However, this modeling strategy is indicated to produce spurious results when applied to aggregated data. For multicenter ITS studies, data at a given time point are often aggregated across different participants and settings; thus, conventional segmented regression analysis may not be an optimal approach. Our objective is to provide a robust method for analysis of ITS data, while accounting for two sources of heterogeneity, between participants and across sites.Methods: We present a methodological framework within the segmented regression modeling strategy, where we introduced weights to account for between-participant variation and the differences across multiple sites. We empirically compared the proposed weighted segmented regression (wSR) with the conventional SR as well as with a previously published pooled analysis method using data from the Mobility of Vulnerable Elders in Ontario (MOVE-ON) project, a multisite ITS study.Results: Overall, the wSR produced the most precise estimates, where they had the narrowest 95% CI, while the conventional SR method resulted in the least precise estimates. Our method also resulted in increased power. The pooled analysis method and the wSR had comparable results when there were ≤ 4 sites included in the overall analysis and when there was moderate to high between-site heterogeneity as measured by the I2 statistic.Conclusion: Incorporating participant-level and site-level variability led to estimates that were more precise and accurate in determining the magnitude of the effect of an intervention and led to increased statistical power. This underscores the importance of accounting for the inherent variability in aggregated data. Extensive simulations are required to further assess the methods in a wide range of scenarios and outcome types.Keywords: aggregated data, weighted segmented regression, pooled analysis, interrupted time series, multisite studies