Development of machine learning models for predicting depressive symptoms in knee osteoarthritis patients

Dan Li; Han Lu; Junhui Wu; Hongbo Chen; Meidi Shen; Beibei Tong; Wen Zeng; Weixuan Wang; Shaomei Shang

doi:10.1038/s41598-024-79601-x

Scientific Reports (Nov 2024)

Development of machine learning models for predicting depressive symptoms in knee osteoarthritis patients

Dan Li,
Han Lu,
Junhui Wu,
Hongbo Chen,
Meidi Shen,
Beibei Tong,
Wen Zeng,
Weixuan Wang,
Shaomei Shang

Affiliations

Dan Li: Nursing School, Peking University Health Science Center
Han Lu: Nursing School, Peking University Health Science Center
Junhui Wu: Nursing School, Peking University Health Science Center
Hongbo Chen: Peking University Third Hospital
Meidi Shen: Nursing School, Peking University Health Science Center
Beibei Tong: Nursing School, Peking University Health Science Center
Wen Zeng: Nursing School, Peking University Health Science Center
Weixuan Wang: Nursing School, Peking University Health Science Center
Shaomei Shang: Nursing School, Peking University Health Science Center

DOI: https://doi.org/10.1038/s41598-024-79601-x
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Knee osteoarthritis (KOA) combined with depressive symptoms is prevalent and leads to poor outcomes and significant financial burdens. However, practical tools for identifying at-risk patients remain limited. A robust prediction model is needed to address this gap. This study aims to develop and validate a predictive model to identify KOA patients at risk of developing depressive symptoms. The China Health and Retirement Longitudinal Survey (CHARLS) data were used for model development and the Osteoarthritis Initiative (OAI) for external validation. 18 potential predictors were selected using LASSO regression. 4 machine learning models—logistic regression, decision tree, random forest, and artificial neural network—were developed. Model performance was assessed using the area under the operating characteristic curve (AUC), calibration curves, and decision curve analysis. The most important features were extracted from the optimal model on external validation. A total of 469 individuals were included, with 70% used for training and 30% for testing. The random forest model achieved the best performance, with an AUC of 0.928 in the test set, outperforming logistic regression (AUC 0.622), decision tree (AUC 0.611), and neural network models (AUC 0.868). External validation revealed an AUC of 0.877 (95% CI: 0.864–0.889) for the adjusted random forest model. Pain severity was the most significant predictor, followed by the five-time sit-to-stand test (FTSST) and sleep problems. This study is the first in China to apply a predictive model for depressive symptoms in KOA patients, offering a practical tool for early risk identification using routinely available data.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords