Biomarker Research (Feb 2021)

Machine learning-aided risk stratification in Philadelphia chromosome-positive acute lymphoblastic leukemia

  • Satoshi Nishiwaki,
  • Isamu Sugiura,
  • Daisuke Koyama,
  • Yukiyasu Ozawa,
  • Masahide Osaki,
  • Yuichi Ishikawa,
  • Hitoshi Kiyoi

DOI
https://doi.org/10.1186/s40364-021-00268-x
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 4

Abstract

Read online

Abstract We used the eXtreme Gradient Boosting algorithm, an optimized gradient boosting machine learning library, and established a model to predict events in Philadelphia chromosome-positive acute lymphoblastic leukemia using a machine learning-aided method. A model was constructed using a training set (80%) and prediction was tested using a test set (20%). According to the feature importance score, BCR-ABL lineage, polymerase chain reaction value, age, and white blood cell count were identified as important features. These features were also confirmed by the permutation feature importance for the prediction using the test set. Both event-free survival and overall survival were clearly stratified according to risk groups categorized using these features: 80 and 100% in low risk (two or less factors), 42 and 47% in intermediate risk (three factors), and 0 and 10% in high risk (four factors) at 4 years. Machine learning-aided analysis was able to identify clinically useful prognostic factors using data from a relatively small number of patients.

Keywords