Individualized decision making in on-scene resuscitation time for out-of-hospital cardiac arrest using reinforcement learning

Dong Hyun Choi; Min Hyuk Lim; Ki Jeong Hong; Young Gyun Kim; Jeong Ho Park; Kyoung Jun Song; Sang Do Shin; Sungwan Kim

doi:10.1038/s41746-024-01278-3

npj Digital Medicine (Oct 2024)

Individualized decision making in on-scene resuscitation time for out-of-hospital cardiac arrest using reinforcement learning

Dong Hyun Choi,
Min Hyuk Lim,
Ki Jeong Hong,
Young Gyun Kim,
Jeong Ho Park,
Kyoung Jun Song,
Sang Do Shin,
Sungwan Kim

Affiliations

Dong Hyun Choi: Department of Biomedical Engineering, Seoul National University College of Medicine
Min Hyuk Lim: Graduate School of Health Science and Technology, Ulsan National Institute of Science and Technology (UNIST)
Ki Jeong Hong: Department of Emergency Medicine, Seoul National University College of Medicine and Hospital
Young Gyun Kim: Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University
Jeong Ho Park: Department of Emergency Medicine, Seoul National University College of Medicine and Hospital
Kyoung Jun Song: Laboratory of Emergency Medical Services, Seoul National University Hospital Biomedical Research Institute
Sang Do Shin: Department of Emergency Medicine, Seoul National University College of Medicine and Hospital
Sungwan Kim: Department of Biomedical Engineering, Seoul National University College of Medicine

DOI: https://doi.org/10.1038/s41746-024-01278-3
Journal volume & issue: Vol. 7, no. 1
pp. 1 – 14

Abstract

Read online

Abstract On-scene resuscitation time is associated with out-of-hospital cardiac arrest (OHCA) outcomes. We developed and validated reinforcement learning models for individualized on-scene resuscitation times, leveraging nationwide Korean data. Adult OHCA patients with a medical cause of arrest were included (N = 73,905). The optimal policy was derived from conservative Q-learning to maximize survival. The on-scene return of spontaneous circulation hazard rates estimated from the Random Survival Forest were used as intermediate rewards to handle sparse rewards, while patients’ historical survival was reflected in the terminal rewards. The optimal policy increased the survival to hospital discharge rate from 9.6% to 12.5% (95% CI: 12.2–12.8) and the good neurological recovery rate from 5.4% to 7.5% (95% CI: 7.3–7.7). The recommended maximum on-scene resuscitation times for patients demonstrated a bimodal distribution, varying with patient, emergency medical services, and OHCA characteristics. Our survival analysis-based approach generates explainable rewards, reducing subjectivity in reinforcement learning.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal