Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning

Syed Ihtesham Hussain Shah; Antonio Coronato; Muddasar Naeem; Giuseppe De Pietro

doi:10.1109/ACCESS.2022.3193494

IEEE Access (Jan 2022)

Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning

Syed Ihtesham Hussain Shah,
Antonio Coronato,
Muddasar Naeem,
Giuseppe De Pietro

Affiliations

Syed Ihtesham Hussain Shah: ORCiD; CNR, Institute for High Performance Computing and Networking (ICAR), Napoli, Italy
Antonio Coronato: ORCiD; CNR, Institute for High Performance Computing and Networking (ICAR), Napoli, Italy
Muddasar Naeem: ORCiD; CNR, Institute for High Performance Computing and Networking (ICAR), Napoli, Italy
Giuseppe De Pietro: CNR, Institute for High Performance Computing and Networking (ICAR), Napoli, Italy

DOI: https://doi.org/10.1109/ACCESS.2022.3193494
Journal volume & issue: Vol. 10
pp. 78148 – 78158

Abstract

Read online

Dynamic Treatment Regimes (DTRs) are sets of sequential decision rules that can be adapted over time to treat patients with a specific pathology. DTR consists of alternative treatment paths and any of these treatments can be adapted depending on the patient's characteristics. Reinforcement Learning (RL) and Imitation Learning (IL) approaches have been deployed for obtaining optimal treatment for a patient but, these approaches rely only on positive trajectories (i.e., treatments that concluded with positive responses of the patient). In contrast, negative trajectories (i.e., samples of non-responding treatments) are discarded, although these have valuable information content. We propose a Cooperative Imitation Learning (CIL) method that exploits information from both negative and positive trajectories to learn the optimal DTR. The proposed method reduces the chance of selecting any treatment which results in a negative outcome (negative response of the patient) during the medical examination. To validate our approach, we have considered a well-known DTR which is defined for the treatment of patients with alcohol addiction. Results show that our approach outperforms those that rely only on positive trajectories.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords