Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm

Jeongjin Lee; Jong-Min Kim

doi:10.3390/axioms13040212

Axioms (Mar 2024)

Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm

Jeongjin Lee,
Jong-Min Kim

Affiliations

Jeongjin Lee: Department of Statistics, Ohio State University, Columbus, OH 43210, USA
Jong-Min Kim: Statistics Discipline, University of Minnesota-Morris, Morris, MN 56267, USA

DOI: https://doi.org/10.3390/axioms13040212
Journal volume & issue: Vol. 13, no. 4
p. 212

Abstract

Read online

This research paper presents the Buckley-James Q-learning (BJ-Q) algorithm, a cutting-edge method designed to optimize personalized treatment strategies, especially in the presence of right censoring. We critically assess the algorithm’s effectiveness in improving patient outcomes and its resilience across various scenarios. Central to our approach is the innovative use of the survival time to impute the reward in Q-learning, employing the Buckley-James method for enhanced accuracy and reliability. Our findings highlight the significant potential of personalized treatment regimens and introduce the BJ-Q learning algorithm as a viable and promising approach. This work marks a substantial advancement in our comprehension of treatment dynamics and offers valuable insights for augmenting patient care in the ever-evolving clinical landscape.

Published in Axioms

ISSN: 2075-1680 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/axioms

About the journal

Abstract

Keywords