Exploring a global interpretation mechanism for deep learning networks when predicting sepsis

Ethan A. T. Strickler; Joshua Thomas; Johnson P. Thomas; Bruce Benjamin; Rittika Shamsuddin

doi:10.1038/s41598-023-30091-3

Scientific Reports (Feb 2023)

Exploring a global interpretation mechanism for deep learning networks when predicting sepsis

Ethan A. T. Strickler,
Joshua Thomas,
Johnson P. Thomas,
Bruce Benjamin,
Rittika Shamsuddin

Affiliations

Ethan A. T. Strickler: Physics and Mathematics, East Central University
Joshua Thomas: Department of Internal Medicine, Rush University Medical Center
Johnson P. Thomas: Oklahoma State University
Bruce Benjamin: School of Biomedical Sciences, Center for Health Sciences
Rittika Shamsuddin: Oklahoma State University

DOI: https://doi.org/10.1038/s41598-023-30091-3
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 17

Abstract

Read online

Abstract The purpose of this study is to identify additional clinical features for sepsis detection through the use of a novel mechanism for interpreting black-box machine learning models trained and to provide a suitable evaluation for the mechanism. We use the publicly available dataset from the 2019 PhysioNet Challenge. It has around 40,000 Intensive Care Unit (ICU) patients with 40 physiological variables. Using Long Short-Term Memory (LSTM) as the representative black-box machine learning model, we adapted the Multi-set Classifier to globally interpret the black-box model for concepts it learned about sepsis. To identify relevant features, the result is compared against: (i) features used by a computational sepsis expert, (ii) clinical features from clinical collaborators, (iii) academic features from literature, and (iv) significant features from statistical hypothesis testing. Random Forest was found to be the computational sepsis expert because it had high accuracies for solving both the detection and early detection, and a high degree of overlap with clinical and literature features. Using the proposed interpretation mechanism and the dataset, we identified 17 features that the LSTM used for sepsis classification, 11 of which overlaps with the top 20 features from the Random Forest model, 10 with academic features and 5 with clinical features. Clinical opinion suggests, 3 LSTM features have strong correlation with some clinical features that were not identified by the mechanism. We also found that age, chloride ion concentration, pH and oxygen saturation should be investigated further for connection with developing sepsis. Interpretation mechanisms can bolster the incorporation of state-of-the-art machine learning models into clinical decision support systems, and might help clinicians to address the issue of early sepsis detection. The promising results from this study warrants further investigation into creation of new and improvement of existing interpretation mechanisms for black-box models, and into clinical features that are currently not used in clinical assessment of sepsis.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal