Deep advantage learning for optimal dynamic treatment regime

Shuhan Liang; Wenbin Lu; Rui Song

doi:10.1080/24754269.2018.1466096

Statistical Theory and Related Fields (Jan 2018)

Deep advantage learning for optimal dynamic treatment regime

Shuhan Liang,
Wenbin Lu,
Rui Song

Affiliations

Shuhan Liang: North Carolina State University
Wenbin Lu: North Carolina State University
Rui Song: North Carolina State University

DOI: https://doi.org/10.1080/24754269.2018.1466096
Journal volume & issue: Vol. 2, no. 1
pp. 80 – 88

Abstract

Read online

Recently deep learning has successfully achieved state-of-the-art performance on many difficult tasks. Deep neural networks allow for model flexibility and process features without the need of domain knowledge. Advantage learning (A-learning) is a popular method in dynamic treatment regime (DTR). It models the advantage function, which is of direct relevance to optimal treatment decision. No assumptions on baseline function are made. However, there is a paucity of literature on deep A-learning. In this paper, we present a deep A-learning approach to estimate optimal DTR. We use an inverse probability weighting method to estimate the difference between potential outcomes. Parameter sharing of convolutional neural networks (CNN) greatly reduces the amount of parameters in neural networks, which allows for high scalability. Convexified convolutional neural networks (CCNN) relax the constraints of CNN for optimisation purpose. Different architectures of CNN and CCNN are implemented for contrast function estimation. Both simulation results and application to the STAR*D (Sequenced Treatment Alternatives to Relieve Depression) trial indicate that the proposed methods outperform penalised least square estimator.

Published in Statistical Theory and Related Fields

ISSN: 2475-4269 (Print); 2475-4277 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Probabilities. Mathematical statistics
Website: https://www.tandfonline.com/TSTF

About the journal

Abstract

Keywords