Drug Design, Development and Therapy (May 2025)
Prediction of High-Dose Methotrexate Blood Concentration in Osteosarcoma Patients Using Machine Learning
Abstract
Jin Zhao,1 Shuqi Dai,2 Jiali He,1 Na Liu,1 Baowanze Zhang,1 Su Li1 1Department of Pharmacy, Cancer Hospital of China Medical University, Liaoning Cancer Hospital & Institute, Shenyang, Liaoning, 110042, People’s Republic of China; 2Department of Pharmacy, Qujing Medical College, Qujing, Yunnan, 655000, People’s Republic of ChinaCorrespondence: Su Li, Department of Pharmacy, Cancer Hospital of China Medical University, Liaoning Cancer Hospital & Institute, No. 44, Xiaoheyan Road, Dadong District, Shenyang, Liaoning, 110042, People’s Republic of China, Tel +86-024-81916806, Email [email protected]: High-dose methotrexate is a typical chemotherapy that is widely used in the treatment of osteosarcoma. However, the unique dose-response relationship of methotrexate makes its treatment window relatively narrow, and its clinical use is in a dilemma: either the drug concentration in the patient’s body cannot reach the effective concentration level, or adverse reactions may occur due to drug overdose. For this circumstance, monitoring and predicting the drug concentration in the patient’s body is well founded and necessary. While pharmacokinetic models exist, they often oversimplify patient-specific covariates. This study addresses the unmet need for early-exposure prediction through interpretable machine learning, enabling data-driven decisions before toxicity manifestation.Methods: In this article, 68 osteosarcoma patients’ information including demography, administration and assay was gathered. We analyzed medical data and selected 10 important features using a random forest, including hydration status, red blood cell distribution width coefficient of variation, platelet distribution width, creatinine, γ-glutamyl transferase, large platelet ratio, serum potassium, lactate dehydrogenase, weight, and prealbumin. Then, cross-validation and SHAP has been conducted to confirm the robust and interpretation of the model.Results: On this basis, 7 machine learning regression models was built to predict the blood concentration of methotrexate. R2, MSE, RMSE, MAE are the evaluation metrics. Finally, LightGBM was selected as the best prediction model with a performance of R2=0.87, MSE=0.020, RMSE=0.141, MAE=0.065.Discussion: This machine learning framework addresses a critical gap in high-dose methotrexate therapeutic monitoring by achieving early and personalized blood drug concentration prediction, allowing for personalized dosing of patients based on predicted concentrations. The interpretability of SHAP-derived feature importance enhances clinical utility, offering a paradigm shift from reactive toxicity management to proactive precision dosing in osteosarcoma therapy.Keywords: high-dose methotrexate, osteosarcoma, machine learning, blood concentration prediction