HemaSphere (Jan 2023)

Machine Learning Improves Risk Stratification in Myelofibrosis: An Analysis of the Spanish Registry of Myelofibrosis

  • Adrián Mosquera-Orgueira,
  • Manuel Pérez-Encinas,
  • Alberto Hernández-Sánchez,
  • Teresa González-Martínez,
  • Eduardo Arellano-Rodrigo,
  • Javier Martínez-Elicegui,
  • Ángela Villaverde-Ramiro,
  • José-María Raya,
  • Rosa Ayala,
  • Francisca Ferrer-Marín,
  • María-Laura Fox,
  • Patricia Velez,
  • Elvira Mora,
  • Blanca Xicoy,
  • María-Isabel Mata-Vázquez,
  • María García-Fortes,
  • Anna Angona,
  • Beatriz Cuevas,
  • María-Alicia Senín,
  • Angel Ramírez-Payer,
  • María-José Ramírez,
  • Raúl Pérez-López,
  • Sonia González de Villambrosía,
  • Clara Martínez-Valverde,
  • María-Teresa Gómez-Casares,
  • Carmen García-Hernández,
  • Mercedes Gasior,
  • Beatriz Bellosillo,
  • Juan-Luis Steegmann,
  • Alberto Álvarez-Larrán,
  • Jesús María Hernández-Rivas,
  • Juan Carlos Hernández-Boluda,
  • on behalf of the Spanish MPN Group (GEMFIN).

DOI
https://doi.org/10.1097/HS9.0000000000000818
Journal volume & issue
Vol. 7, no. 1
p. e818

Abstract

Read online

Myelofibrosis (MF) is a myeloproliferative neoplasm (MPN) with heterogeneous clinical course. Allogeneic hematopoietic cell transplantation remains the only curative therapy, but its morbidity and mortality require careful candidate selection. Therefore, accurate disease risk prognostication is critical for treatment decision-making. We obtained registry data from patients diagnosed with MF in 60 Spanish institutions (N = 1386). These were randomly divided into a training set (80%) and a test set (20%). A machine learning (ML) technique (random forest) was used to model overall survival (OS) and leukemia-free survival (LFS) in the training set, and the results were validated in the test set. We derived the AIPSS-MF (Artificial Intelligence Prognostic Scoring System for Myelofibrosis) model, which was based on 8 clinical variables at diagnosis and achieved high accuracy in predicting OS (training set c-index, 0.750; test set c-index, 0.744) and LFS (training set c-index, 0.697; test set c-index, 0.703). No improvement was obtained with the inclusion of MPN driver mutations in the model. We were unable to adequately assess the potential benefit of including adverse cytogenetics or high-risk mutations due to the lack of these data in many patients. AIPSS-MF was superior to the IPSS regardless of MF subtype and age range and outperformed the MYSEC-PM in patients with secondary MF. In conclusion, we have developed a prediction model based exclusively on clinical variables that provides individualized prognostic estimates in patients with primary and secondary MF. The use of AIPSS-MF in combination with predictive models that incorporate genetic information may improve disease risk stratification.