Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit.

Tianzhi Huang; Dejin Le; Lili Yuan; Shoujia Xu; Xiulan Peng

doi:10.1371/journal.pone.0280606

PLoS ONE (Jan 2023)

Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit.

Tianzhi Huang,
Dejin Le,
Lili Yuan,
Shoujia Xu,
Xiulan Peng

Affiliations

Tianzhi Huang
Dejin Le
Lili Yuan
Shoujia Xu
Xiulan Peng

DOI: https://doi.org/10.1371/journal.pone.0280606
Journal volume & issue: Vol. 18, no. 1
p. e0280606

Abstract

Read online

BackgroundsThe in-hospital mortality in lung cancer patients admitted to intensive care unit (ICU) is extremely high. This study intended to adopt machine learning algorithm models to predict in-hospital mortality of critically ill lung cancer for providing relative information in clinical decision-making.MethodsData were extracted from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) for a training cohort and data extracted from the Medical Information Mart for eICU Collaborative Research Database (eICU-CRD) database for a validation cohort. Logistic regression, random forest, decision tree, light gradient boosting machine (LightGBM), eXtreme gradient boosting (XGBoost), and an ensemble (random forest+LightGBM+XGBoost) model were used for prediction of in-hospital mortality and important feature extraction. The AUC (area under receiver operating curve), accuracy, F1 score and recall were used to evaluate the predictive performance of each model. Shapley Additive exPlanations (SHAP) values were calculated to evaluate feature importance of each feature.ResultsOverall, there were 653 (24.8%) in-hospital mortality in the training cohort, and 523 (21.7%) in-hospital mortality in the validation cohort. Among the six machine learning models, the ensemble model achieved the best performance. The top 5 most influential features were the sequential organ failure assessment (SOFA) score, albumin, the oxford acute severity of illness score (OASIS) score, anion gap and bilirubin in random forest and XGBoost model. The SHAP summary plot was used to illustrate the positive or negative effects of the top 15 features attributed to the XGBoost model.ConclusionThe ensemble model performed best and might be applied to forecast in-hospital mortality of critically ill lung cancer patients, and the SOFA score was the most important feature in all models. These results might offer valuable and significant reference for ICU clinicians' decision-making in advance.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal