Scientific Reports (Apr 2023)
Application of various machine learning techniques to predict obstructive sleep apnea syndrome severity
Abstract
Abstract As the incidence of obstructive sleep apnea syndrome (OSAS) increases worldwide, the need for a new screening method that can compensate for the shortcomings of the traditional diagnostic method, polysomnography (PSG), is emerging. In this study, data from 4014 patients were used, and both supervised and unsupervised learning methods were used. Clustering was conducted with hierarchical agglomerative clustering, K-means, bisecting K-means algorithm, Gaussian mixture model, and feature engineering was carried out using both medically researched methods and machine learning techniques. For classification, we used gradient boost-based models such as XGBoost, LightGBM, CatBoost, and Random Forest to predict the severity of OSAS. The developed model showed high performance with 88%, 88%, and 91% of classification accuracy for three thresholds for the severity of OSAS: Apnea-Hypopnea Index (AHI) $$\ge $$ ≥ 5, AHI $$\ge $$ ≥ 15, and AHI $$\ge $$ ≥ 30, respectively. The results of this study demonstrate significant evidence of sufficient potential to utilize machine learning in predicting OSAS severity.