A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis

XiaoDan Wu; RuiChang Li; Zhen He; TianZhi Yu; ChangQing Cheng

doi:10.1038/s41746-023-00755-5

npj Digital Medicine (Feb 2023)

A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis

XiaoDan Wu,
RuiChang Li,
Zhen He,
TianZhi Yu,
ChangQing Cheng

Affiliations

XiaoDan Wu: Smart Health Laboratory, Hebei University of Technology
RuiChang Li: Smart Health Laboratory, Hebei University of Technology
Zhen He: College of Management and Economics, Tianjin University
TianZhi Yu: Emergency Department, Tianjin Medical University General Hospital
ChangQing Cheng: Department of Systems Science and Industrial Engineering, State University of New York

DOI: https://doi.org/10.1038/s41746-023-00755-5
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function precisely and are not as safe as clinical experts. In this study, we propose a Weighted Dueling Double Deep Q-Network with embedded human Expertise (WD3QNE). A target Q value function with adaptive dynamic weight is designed to improve the estimate accuracy and human expertise in decision-making is leveraged. In addition, the random forest algorithm is employed for feature selection to improve model interpretability. We test our algorithm against state-of-the-art value function methods in terms of expected return, survival rate, action distribution and external validation. The results demonstrate that WD3QNE obtains the highest survival rate of 97.81% in MIMIC-III dataset. Our proposed method is capable of providing reliable treatment decisions with embedded clinician expertise.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal