Learning Optimal Dynamic Treatment Regime from Observational Clinical Data through Reinforcement Learning

Seyum Abebe; Irene Poli; Roger D. Jones; Debora Slanzi

doi:10.3390/make6030088

Machine Learning and Knowledge Extraction (Jul 2024)

Learning Optimal Dynamic Treatment Regime from Observational Clinical Data through Reinforcement Learning

Seyum Abebe,
Irene Poli,
Roger D. Jones,
Debora Slanzi

Affiliations

Seyum Abebe: European Centre for Living Technology, Ca’ Foscari University of Venice, 30123 Venice, Italy
Irene Poli: European Centre for Living Technology, Ca’ Foscari University of Venice, 30123 Venice, Italy
Roger D. Jones: European Centre for Living Technology, Ca’ Foscari University of Venice, 30123 Venice, Italy
Debora Slanzi: European Centre for Living Technology, Ca’ Foscari University of Venice, 30123 Venice, Italy

DOI: https://doi.org/10.3390/make6030088
Journal volume & issue: Vol. 6, no. 3
pp. 1798 – 1817

Abstract

Read online

In medicine, dynamic treatment regimes (DTRs) have emerged to guide personalized treatment decisions for patients, accounting for their unique characteristics. However, existing methods for determining optimal DTRs face limitations, often due to reliance on linear models unsuitable for complex disease analysis and a focus on outcome prediction over treatment effect estimation. To overcome these challenges, decision tree-based reinforcement learning approaches have been proposed. Our study aims to evaluate the performance and feasibility of such algorithms: tree-based reinforcement learning (T-RL), DTR-Causal Tree (DTR-CT), DTR-Causal Forest (DTR-CF), stochastic tree-based reinforcement learning (SL-RL), and Q-learning with Random Forest. Using real-world clinical data, we conducted experiments to compare algorithm performances. Evaluation metrics included the proportion of correctly assigned patients to recommended treatments and the empirical mean with standard deviation of expected counterfactual outcomes based on estimated optimal treatment strategies. This research not only highlights the potential of decision tree-based reinforcement learning for dynamic treatment regimes but also contributes to advancing personalized medicine by offering nuanced and effective treatment recommendations.

Published in Machine Learning and Knowledge Extraction

ISSN: 2504-4990 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware
Website: https://www.mdpi.com/journal/make

About the journal

Abstract

Keywords