JMIR Medical Informatics (Mar 2022)

Predicting Long-term Survival After Allogeneic Hematopoietic Cell Transplantation in Patients With Hematologic Malignancies: Machine Learning–Based Model Development and Validation

  • Eun-Ji Choi,
  • Tae Joon Jun,
  • Han-Seung Park,
  • Jung-Hee Lee,
  • Kyoo-Hyung Lee,
  • Young-Hak Kim,
  • Young-Shin Lee,
  • Young-Ah Kang,
  • Mijin Jeon,
  • Hyeran Kang,
  • Jimin Woo,
  • Je-Hwan Lee

DOI
https://doi.org/10.2196/32313
Journal volume & issue
Vol. 10, no. 3
p. e32313

Abstract

Read online

BackgroundScoring systems developed for predicting survival after allogeneic hematopoietic cell transplantation (HCT) show suboptimal prediction power, and various factors affect posttransplantation outcomes. ObjectiveA prediction model using a machine learning–based algorithm can be an alternative for concurrently applying multiple variables and can reduce potential biases. In this regard, the aim of this study is to establish and validate a machine learning–based predictive model for survival after allogeneic HCT in patients with hematologic malignancies. MethodsData from 1470 patients with hematologic malignancies who underwent allogeneic HCT between December 1993 and June 2020 at Asan Medical Center, Seoul, South Korea, were retrospectively analyzed. Using the gradient boosting machine algorithm, we evaluated a model predicting the 5-year posttransplantation survival through 10-fold cross-validation. ResultsThe prediction model showed good performance with a mean area under the receiver operating characteristic curve of 0.788 (SD 0.03). Furthermore, we developed a risk score predicting probabilities of posttransplantation survival in 294 randomly selected patients, and an agreement between the estimated predicted and observed risks of overall death, nonrelapse mortality, and relapse incidence was observed according to the risk score. Additionally, the calculated score demonstrated the possibility of predicting survival according to the different transplantation-related factors, with the visualization of the importance of each variable. ConclusionsWe developed a machine learning–based model for predicting long-term survival after allogeneic HCT in patients with hematologic malignancies. Our model provides a method for making decisions regarding patient and donor candidates or selecting transplantation-related resources, such as conditioning regimens.