Deep reinforcement learning extracts the optimal sepsis treatment policy from treatment records

Yunho Choi; Songmi Oh; Jin Won Huh; Ho-Taek Joo; Hosu Lee; Wonsang You; Cheng-mok Bae; Jae-Hun Choi; Kyung-Joong Kim

doi:10.1038/s43856-024-00665-x

Communications Medicine (Nov 2024)

Deep reinforcement learning extracts the optimal sepsis treatment policy from treatment records

Yunho Choi,
Songmi Oh,
Jin Won Huh,
Ho-Taek Joo,
Hosu Lee,
Wonsang You,
Cheng-mok Bae,
Jae-Hun Choi,
Kyung-Joong Kim

Affiliations

Yunho Choi: School of Integrated Technology, Gwangju Institute of Science and Technology
Songmi Oh: School of Integrated Technology, Gwangju Institute of Science and Technology
Jin Won Huh: Pulmonary and Critical Care Medicine, Asan Medical Center
Ho-Taek Joo: School of Integrated Technology, Gwangju Institute of Science and Technology
Hosu Lee: Department of Control and Robot Engineering, Gyeongsang National University
Wonsang You: School of Integrated Technology, Gwangju Institute of Science and Technology
Cheng-mok Bae: School of Integrated Technology, Gwangju Institute of Science and Technology
Jae-Hun Choi: Medical Information Lab, Electronics and Telecommunications Research Institute
Kyung-Joong Kim: School of Integrated Technology, Gwangju Institute of Science and Technology

DOI: https://doi.org/10.1038/s43856-024-00665-x
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Sepsis is one of the most life-threatening medical conditions. Therefore, many clinical trials have been conducted to identify optimal treatment strategies for sepsis. However, finding reliable strategies remains challenging due to limited-scale clinical tests. Here we tried to extract the optimal sepsis treatment policy from accumulated treatment records. Methods In this study, with our modified deep reinforcement learning algorithm, we stably generated a patient treatment artificial intelligence model. As training data, 16,744 distinct admissions in tertiary hospitals were used and tested with separate datasets. Model performance was tested by t test and visualization of estimated survival rates. We also analyze model behavior using the confusion matrix, important feature extraction by a random forest decision tree, and treatment behavior comparison to understand how our treatment model achieves high performance. Results Here we show that our treatment model’s policy achieves a significantly higher estimated survival rate (up to 10.03%). We also show that our models’ vasopressor treatment was quite different from that of physicians. Here, we identify that blood urea nitrogen, age, sequential organ failure assessment score, and shock index are the most different factors in dealing with sepsis patients between our model and physicians. Conclusions Our results demonstrate that the patient treatment model can extract potential optimal sepsis treatment policy. We also extract core information about sepsis treatment by analyzing its policy. These results may not apply directly in clinical settings because they were only tested on a database. However, they are expected to serve as important guidelines for further research.

Published in Communications Medicine

ISSN: 2730-664X (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://www.nature.com/commsmed/

About the journal