Identifying Predictors Associated with Risk of Death or Admission to Intensive Care Unit in Internal Medicine Patients with Sepsis: A Comparison of Statistical Models and Machine Learning Algorithms
Antonio Mirijello,
Andrea Fontana,
Antonio Pio Greco,
Alberto Tosoni,
Angelo D’Agruma,
Maria Labonia,
Massimiliano Copetti,
Pamela Piscitelli,
Salvatore De Cosmo,
on behalf of the Internal Medicine Sepsis Study Group
Affiliations
Antonio Mirijello
Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Andrea Fontana
Unit of Biostatistics, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Antonio Pio Greco
Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Alberto Tosoni
Department of Internal Medicine and Gastroenterology, Fondazione Policlinico Universitario “A. Gemelli” IRCCS, 00168 Rome, Italy
Angelo D’Agruma
Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Maria Labonia
Unit of Microbiology, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Massimiliano Copetti
Unit of Biostatistics, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Pamela Piscitelli
Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
Salvatore De Cosmo
Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy
on behalf of the Internal Medicine Sepsis Study Group
Background: Sepsis is a time-dependent disease: the early recognition of patients at risk for poor outcome is mandatory. Aim: To identify prognostic predictors of the risk of death or admission to intensive care units in a consecutive sample of septic patients, comparing different statistical models and machine learning algorithms. Methods: Retrospective study including 148 patients discharged from an Italian internal medicine unit with a diagnosis of sepsis/septic shock and microbiological identification. Results: Of the total, 37 (25.0%) patients reached the composite outcome. The sequential organ failure assessment (SOFA) score at admission (odds ratio (OR): 1.83; 95% confidence interval (CI): 1.41–2.39; p p p Conclusions: Although structurally different, each model identified similar predictive covariates. The classical multivariable logistic regression model was the most parsimonious and calibrated one, while RPART was the easiest to interpret clinically. Finally, LASSO and RF were the costliest in terms of number of variables identified.