Digital Health (Dec 2023)
Construction of a predictive model based on MIV-SVR for prognosis and length of stay in patients with traumatic brain injury: Retrospective cohort study
Abstract
Objective To investigate the mean impact value (MIV) method for discerning the most efficacious input variables for the machine learning (ML) model. Subsequently, various ML algorithms are harnessed to formulate a more accurate predictive model that can forecast both the prognosis and the length of hospital stay for patients suffering from traumatic brain injury (TBI). Design Retrospective cohort study. Participants The study retrospectively accrued data from 1128 cases of patients who sought medical intervention at the Neurosurgery Center of the Second Affiliated Hospital of Anhui Medical University, within the timeframe spanning from May 2017 to May 2022. Methods We performed a retrospective analysis of patient data obtained from the Neurosurgery Center of the Second Hospital of Anhui Medical University, covering the period from May 2017 to May 2022. Following meticulous data filtration and partitioning, 70% of the data were allocated for model training, while the remaining 30% served for model evaluation. During the construction phase of the ML models, a gamut of 11 independent variables—including, but not limited to, in-hospital complications and patient age—were utilized as input variables. Conversely, the length of stay (LOS) and the Glasgow Outcome Scale (GOS) scores were designated as output variables. The model architecture was initially refined through the MIV methodology to identify optimal input variables, whereupon five distinct predictive models were constructed, encompassing support vector regression (SVR), convolutional neural networks (CNN), backpropagation (BP) neural networks, artificial neural networks (ANN) and logistic regression (LR). Ultimately, SVR emerged as the most proficient predictive model and was further authenticated through an external dataset obtained from the First Hospital of Anhui Medical University. Results Upon incorporating the optimal input variables as ascertained through MIV, it was observed that the SVR model exhibited remarkable predictive prowess. Specifically, the Mean Absolute Percentage Error (MAPE) of the SVR model in predicting the GOS score in the test dataset is only 6.30%, and the MAPE in the external validation set is only 7.61%. In terms of predicting hospitalization time, the accuracy of the test and external validation sets were 9.28% and 7.91%, respectively. This error indicator is significantly lower than the error of other prediction models, thus proving the excellent efficacy and clinical reliability of the MIV-optimized SVR model. Conclusion This study unequivocally substantiates that the incorporation of MIV for selecting optimal input variables can substantially augment the predictive accuracy of machine learning models. Among the models examined, the MIV-SVR model emerged as the most accurate and clinically applicable, thereby rendering it highly conducive for future clinical decision-making processes.