PLoS Computational Biology (Sep 2021)

An integrated framework for building trustworthy data-driven epidemiological models: Application to the COVID-19 outbreak in New York City.

  • Sheng Zhang,
  • Joan Ponce,
  • Zhen Zhang,
  • Guang Lin,
  • George Karniadakis

DOI
https://doi.org/10.1371/journal.pcbi.1009334
Journal volume & issue
Vol. 17, no. 9
p. e1009334

Abstract

Read online

Epidemiological models can provide the dynamic evolution of a pandemic but they are based on many assumptions and parameters that have to be adjusted over the time the pandemic lasts. However, often the available data are not sufficient to identify the model parameters and hence infer the unobserved dynamics. Here, we develop a general framework for building a trustworthy data-driven epidemiological model, consisting of a workflow that integrates data acquisition and event timeline, model development, identifiability analysis, sensitivity analysis, model calibration, model robustness analysis, and projection with uncertainties in different scenarios. In particular, we apply this framework to propose a modified susceptible-exposed-infectious-recovered (SEIR) model, including new compartments and model vaccination in order to project the transmission dynamics of COVID-19 in New York City (NYC). We find that we can uniquely estimate the model parameters and accurately project the daily new infection cases, hospitalizations, and deaths, in agreement with the available data from NYC's government's website. In addition, we employ the calibrated data-driven model to study the effects of vaccination and timing of reopening indoor dining in NYC.