Frontiers in Pediatrics (Jun 2024)
Machine learning for screening and predicting the availability of medications for children: a cross-sectional survey study
Abstract
ObjectiveThe aim of the study was to explore the factors influencing the availability of medications for children, and establish a machine learning model to provide an empirical basis for the subsequent formulation and improvement of relevant policies.MethodsDesign: Cross-sectional survey. Setting: 12 provinces, China. Medical doctors from 25 public hospitals were enrolled. All data were randomly divided into a training set and a validation set at a ratio of 7:3. Three prediction models, namely random forest (RF), logistic regression (LR), and extreme gradient boosting (XGBoost), were developed and compared. The receiver operating characteristic curve (ROC) and the associated area under the curve (AUC) were used to evaluate the three models. A nomogram and clinical impact curve (CIC) for availability of medication were developed.ResultsFifteen of 29 factors in the database that were most likely to be selected were considered to establish the prediction model. The XGBoost model (AUC = 0.915) demonstrated better performance than the RF model (AUC = 0.902) and the LR model (AUC = 0.890). According to the Shapley additive explanation values, the five factors that most significantly affected the availability of medications for children in the XGboost model were as follows: the relatively small number of specialized dosage forms for children; unaffordable medications for children; public education on the accessibility and safety of medication for children; uneven distribution of medical resources, leading to insufficient access to medication for children; and years of service as a doctor. The CIC was used to assess the practical applicability of the factor prediction nomogram.ConclusionsThe XGBoost model can be used to establish a prediction model to screen the factors associated with the availability of medications for children. The most important contributing factors to the models were the following: the relatively small number of specialized dosage forms for children; unaffordable medications for children; public education on the accessibility and safety of medication for children; uneven distribution of medical resources, leading to insufficient access to medication for children; and years of service as a doctor.
Keywords