PLoS ONE (Jan 2020)

A machine learning algorithm to increase COVID-19 inpatient diagnostic capacity.

  • David Goodman-Meza,
  • Akos Rudas,
  • Jeffrey N Chiang,
  • Paul C Adamson,
  • Joseph Ebinger,
  • Nancy Sun,
  • Patrick Botting,
  • Jennifer A Fulcher,
  • Faysal G Saab,
  • Rachel Brook,
  • Eleazar Eskin,
  • Ulzee An,
  • Misagh Kordi,
  • Brandon Jew,
  • Brunilda Balliu,
  • Zeyuan Chen,
  • Brian L Hill,
  • Elior Rahmani,
  • Eran Halperin,
  • Vladimir Manuel

DOI
https://doi.org/10.1371/journal.pone.0239474
Journal volume & issue
Vol. 15, no. 9
p. e0239474

Abstract

Read online

Worldwide, testing capacity for SARS-CoV-2 is limited and bottlenecks in the scale up of polymerase chain reaction (PCR-based testing exist. Our aim was to develop and evaluate a machine learning algorithm to diagnose COVID-19 in the inpatient setting. The algorithm was based on basic demographic and laboratory features to serve as a screening tool at hospitals where testing is scarce or unavailable. We used retrospectively collected data from the UCLA Health System in Los Angeles, California. We included all emergency room or inpatient cases receiving SARS-CoV-2 PCR testing who also had a set of ancillary laboratory features (n = 1,455) between 1 March 2020 and 24 May 2020. We tested seven machine learning models and used a combination of those models for the final diagnostic classification. In the test set (n = 392), our combined model had an area under the receiver operator curve of 0.91 (95% confidence interval 0.87-0.96). The model achieved a sensitivity of 0.93 (95% CI 0.85-0.98), specificity of 0.64 (95% CI 0.58-0.69). We found that our machine learning algorithm had excellent diagnostic metrics compared to SARS-CoV-2 PCR. This ensemble machine learning algorithm to diagnose COVID-19 has the potential to be used as a screening tool in hospital settings where PCR testing is scarce or unavailable.