Diagnostics (Jul 2024)

Predictive Modeling of COVID-19 Readmissions: Insights from Machine Learning and Deep Learning Approaches

  • Wei Kit Loo,
  • Wingates Voon,
  • Anwar Suhaimi,
  • Cindy Shuan Ju Teh,
  • Yee Kai Tee,
  • Yan Chai Hum,
  • Khairunnisa Hasikin,
  • Kareen Teo,
  • Hang Cheng Ong,
  • Khin Wee Lai

DOI
https://doi.org/10.3390/diagnostics14141511
Journal volume & issue
Vol. 14, no. 14
p. 1511

Abstract

Read online

This project employs artificial intelligence, including machine learning and deep learning, to assess COVID-19 readmission risk in Malaysia. It offers tools to mitigate healthcare resource strain and enhance patient outcomes. This study outlines a methodology for classifying COVID-19 readmissions. It starts with dataset description and pre-processing, while the data balancing was computed through Random Oversampling, Borderline SMOTE, and Adaptive Synthetic Sampling. Nine machine learning and ten deep learning techniques are applied, with five-fold cross-validation for evaluation. Optuna is used for hyperparameter selection, while the consistency in training hyperparameters is maintained. Evaluation metrics encompass accuracy, AUC, and training/inference times. Results were based on stratified five-fold cross-validation and different data-balancing methods. Notably, CatBoost consistently excelled in accuracy and AUC across all tables. Using ROS, CatBoost achieved the highest accuracy (0.9882 ± 0.0020) with an AUC of 1.0000 ± 0.0000. CatBoost maintained its superiority in BSMOTE and ADASYN as well. Deep learning approaches performed well, with SAINT leading in ROS and TabNet leading in BSMOTE and ADASYN. Decision Tree ensembles like Random Forest and XGBoost consistently showed strong performance.

Keywords