PLoS ONE (Jan 2023)

Machine learning to predict risk for community-onset Staphylococcus aureus infections in children living in southeastern United States.

  • Xiting Lin,
  • Ruijin Geng,
  • Kurt Menke,
  • Mike Edelson,
  • Fengxia Yan,
  • Traci Leong,
  • George S Rust,
  • Lance A Waller,
  • Erica L Johnson,
  • Lilly Cheng Immergluck

DOI
https://doi.org/10.1371/journal.pone.0290375
Journal volume & issue
Vol. 18, no. 9
p. e0290375

Abstract

Read online

Staphylococcus aureus (S. aureus) is known to cause human infections and since the late 1990s, community-onset antibiotic resistant infections (methicillin resistant S. aureus (MRSA)) continue to cause significant infections in the United States. Skin and soft tissue infections (SSTIs) still account for the majority of these in the outpatient setting. Machine learning can predict the location-based risks for community-level S. aureus infections. Multi-year (2002-2016) electronic health records of children <19 years old with S. aureus infections were queried for patient level data for demographic, clinical, and laboratory information. Area level data (Block group) was abstracted from U.S. Census data. A machine learning ecological niche model, maximum entropy (MaxEnt), was applied to assess model performance of specific place-based factors (determined a priori) associated with S. aureus infections; analyses were structured to compare methicillin resistant (MRSA) against methicillin sensitive S. aureus (MSSA) infections. Differences in rates of MRSA and MSSA infections were determined by comparing those which occurred in the early phase (2002-2005) and those in the later phase (2006-2016). Multi-level modeling was applied to identify risks factors for S. aureus infections. Among 16,124 unique patients with community-onset MRSA and MSSA, majority occurred in the most densely populated neighborhoods of Atlanta's metropolitan area. MaxEnt model performance showed the training AUC ranged from 0.771 to 0.824, while the testing AUC ranged from 0.769 to 0.839. Population density was the area variable which contributed the most in predicting S. aureus disease (stratified by CO-MRSA and CO-MSSA) across early and late periods. Race contributed more to CO-MRSA prediction models during the early and late periods than for CO-MSSA. Machine learning accurately predicts which densely populated areas are at highest and lowest risk for community-onset S. aureus infections over a 14-year time span.