Brain Sciences (Oct 2024)

Feature Contributions and Predictive Accuracy in Modeling Adolescent Daytime Sleepiness Using Machine Learning: The MeLiSA Study

  • Mohammed A. Mamun,
  • Jannatul Mawa Misti,
  • Md Emran Hasan,
  • Firoj Al-Mamun,
  • Moneerah Mohammad ALmerab,
  • Johurul Islam,
  • Mohammad Muhit,
  • David Gozal

DOI
https://doi.org/10.3390/brainsci14101015
Journal volume & issue
Vol. 14, no. 10
p. 1015

Abstract

Read online

Background: Excessive daytime sleepiness (EDS) among adolescents poses significant risks to academic performance, mental health, and overall well-being. This study examines the prevalence and risk factors of EDS in adolescents in Bangladesh and utilizes machine learning approaches to predict the risk of EDS. Methods: A cross-sectional study was conducted among 1496 adolescents using a structured questionnaire. Data were collected through a two-stage stratified cluster sampling method. Chi-square tests and logistic regression analyses were performed using SPSS. Machine learning models, including Categorical Boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), and Gradient Boosting Machine (GBM), were employed to identify and predict EDS risk factors using Python and Google Colab. Results: The prevalence of EDS in the cohort was 11.6%. SHAP values from the CatBoost model identified self-rated health status, gender, and depression as the most significant predictors of EDS. Among the models, GBM achieved the highest accuracy (90.15%) and precision (88.81%), while CatBoost had comparable accuracy (89.48%) and the lowest log loss (0.25). ROC-AUC analysis showed that CatBoost and GBM performed robustly in distinguishing between EDS and non-EDS cases, with AUC scores of 0.86. Both models demonstrated the superior predictive performance for EDS compared to others. Conclusions: The study emphasizes the role of health and demographic factors in predicting EDS among adolescents in Bangladesh. Machine learning techniques offer valuable insights into the relative contribution of these factors, and can guide targeted interventions. Future research should include longitudinal and interventional studies in diverse settings to improve generalizability and develop effective strategies for managing EDS among adolescents.

Keywords