Characterising information gains and losses when collecting multiple epidemic model outputs
Katharine Sherratt,
Ajitesh Srivastava,
Kylie Ainslie,
David E. Singh,
Aymar Cublier,
Maria Cristina Marinescu,
Jesus Carretero,
Alberto Cascajo Garcia,
Nicolas Franco,
Lander Willem,
Steven Abrams,
Christel Faes,
Philippe Beutels,
Niel Hens,
Sebastian Müller,
Billy Charlton,
Ricardo Ewert,
Sydney Paltra,
Christian Rakow,
Jakob Rehmann,
Tim Conrad,
Christof Schütte,
Kai Nagel,
Sam Abbott,
Rok Grah,
Rene Niehus,
Bastian Prasse,
Frank Sandmann,
Sebastian Funk
Affiliations
Katharine Sherratt
London School of Hygiene & Tropical Medicine, London, UK; Corresponding author.
Ajitesh Srivastava
University of Southern California, Los Angeles, USA
Kylie Ainslie
Dutch National Institute of Public Health and the Environment (RIVM), Bilthoven, Netherlands; School of Public Health, University of Hong Kong, Hong Kong Special Administrative Region
David E. Singh
Universidad Carlos III de Madrid, Madrid, Spain
Aymar Cublier
Universidad Carlos III de Madrid, Madrid, Spain
Maria Cristina Marinescu
Barcelona Supercomputing Center, Barcelona, Spain
Jesus Carretero
Universidad Carlos III de Madrid, Madrid, Spain
Alberto Cascajo Garcia
Universidad Carlos III de Madrid, Madrid, Spain
Nicolas Franco
University of Namur, Namur, Belgium
Lander Willem
University of Antwerp, Antwerp, Belgium
Steven Abrams
University of Antwerp, Antwerp, Belgium; UHasselt, Hasselt, Belgium
Christel Faes
UHasselt, Hasselt, Belgium
Philippe Beutels
University of Antwerp, Antwerp, Belgium
Niel Hens
University of Antwerp, Antwerp, Belgium; UHasselt, Hasselt, Belgium
Sebastian Müller
Technische Universität Berlin, Berlin, Germany
Billy Charlton
Technische Universität Berlin, Berlin, Germany
Ricardo Ewert
Technische Universität Berlin, Berlin, Germany
Sydney Paltra
Technische Universität Berlin, Berlin, Germany
Christian Rakow
Technische Universität Berlin, Berlin, Germany
Jakob Rehmann
Technische Universität Berlin, Berlin, Germany
Tim Conrad
Zuse Institute Berlin (ZIB), Berlin, Germany
Christof Schütte
Zuse Institute Berlin (ZIB), Berlin, Germany
Kai Nagel
Technische Universität Berlin, Berlin, Germany
Sam Abbott
London School of Hygiene & Tropical Medicine, London, UK
Rok Grah
ECDC, Stockholm, Sweden
Rene Niehus
ECDC, Stockholm, Sweden
Bastian Prasse
ECDC, Stockholm, Sweden
Frank Sandmann
ECDC, Stockholm, Sweden
Sebastian Funk
London School of Hygiene & Tropical Medicine, London, UK
Background: Collaborative comparisons and combinations of epidemic models are used as policy-relevant evidence during epidemic outbreaks. In the process of collecting multiple model projections, such collaborations may gain or lose relevant information. Typically, modellers contribute a probabilistic summary at each time-step. We compared this to directly collecting simulated trajectories. We aimed to explore information on key epidemic quantities; ensemble uncertainty; and performance against data, investigating potential to continuously gain information from a single cross-sectional collection of model results. Methods: We compared projections from the European COVID-19 Scenario Modelling Hub. Five teams modelled incidence in Belgium, the Netherlands, and Spain. We compared July 2022 projections by incidence, peaks, and cumulative totals. We created a probabilistic ensemble drawn from all trajectories, and compared to ensembles from a median across each model’s quantiles, or a linear opinion pool. We measured the predictive accuracy of individual trajectories against observations, using this in a weighted ensemble. We repeated this sequentially against increasing weeks of observed data. We evaluated these ensembles to reflect performance with varying observed data. Results: By collecting modelled trajectories, we showed policy-relevant epidemic characteristics. Trajectories contained a right-skewed distribution well represented by an ensemble of trajectories or a linear opinion pool, but not models’ quantile intervals. Ensembles weighted by performance typically retained the range of plausible incidence over time, and in some cases narrowed this by excluding some epidemic shapes. Conclusions: We observed several information gains from collecting modelled trajectories rather than quantile distributions, including potential for continuously updated information from a single model collection. The value of information gains and losses may vary with each collaborative effort’s aims, depending on the needs of projection users. Understanding the differing information potential of methods to collect model projections can support the accuracy, sustainability, and communication of collaborative infectious disease modelling efforts.