BMC Medical Informatics and Decision Making (Jan 2024)
Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia
Abstract
Abstract Background Acute Myeloid Leukemia (AML) generally has a relatively low survival rate after treatment. There is an urgent need to find new biomarkers that may improve the survival prognosis of patients. Machine-learning tools are more and more widely used in the screening of biomarkers. Methods Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine-Recursive Feature Elimination (SVM-RFE), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), lrFuncs, IdaProfile, caretFuncs, and nbFuncs models were used to screen key genes closely associated with AML. Then, based on the Cancer Genome Atlas (TCGA), pan-cancer analysis was performed to determine the correlation between important genes and AML or other cancers. Finally, the diagnostic value of important genes for AML was verified in different data sets. Results The survival analysis results of the training set showed 26 genes with survival differences. After the intersection of the results of each machine learning method, DNM1, MEIS1, and SUSD3 were selected as key genes for subsequent analysis. The results of the pan-cancer analysis showed that MEIS1 and DNM1 were significantly highly expressed in AML; MEIS1 and SUSD3 are potential risk factors for the prognosis of AML, and DNM1 is a potential protective factor. Three key genes were significantly associated with AML immune subtypes and multiple immune checkpoints in AML. The results of the verification analysis show that DNM1, MEIS1, and SUSD3 have potential diagnostic value for AML. Conclusion Multiple machine learning methods identified DNM1, MEIS1, and SUSD3 can be regarded as prognostic biomarkers for AML.
Keywords