Predicting non-suicidal self-injury among Chinese adolescents: The application of ten algorithms of machine learning

Wei Chen; Yujing Gao; Shiyin Xiao

Heliyon (Sep 2024)

Predicting non-suicidal self-injury among Chinese adolescents: The application of ten algorithms of machine learning

Wei Chen,
Yujing Gao,
Shiyin Xiao

Affiliations

Wei Chen: School of Psychology, Guizhou Normal University, Guiyang, China; Inner Mongolia Student Bullying Prevention Research Center, Tongliao, China; Corresponding author.School of Psychology, Guizhou Normal University, Guiyang, China
Yujing Gao: School of Psychology, Guizhou Normal University, Guiyang, China; Inner Mongolia Student Bullying Prevention Research Center, Tongliao, China
Shiyin Xiao: School of Psychology, Guizhou Normal University, Guiyang, China; Inner Mongolia Student Bullying Prevention Research Center, Tongliao, China

Journal volume & issue: Vol. 10, no. 18
p. e37723

Abstract

Read online

Background and aims: High non-suicidal self-injury (NSSI) prevalence among adolescents is a global health issue. However, current prediction models for adolescent NSSI rely on a limited set of algorithms, resulting in biased predictions. Therefore, the aim of this study is to develop multiple machine learning models to enhance prediction accuracy and mitigate biases among Chinese adolescents. Methods: A total of 4487 junior and senior high school students in China were recruited. Multiple algorithms were included, such as logistic regression, decision tree, support vector machine, Naive Bayes, multi-layer perceptron, K-nearest neighbors, and ensemble learning algorithm like random forest, bagging, AdaBoost, and stacking to build predictive models. Data processing techniques, including standardization and the synthetic minority oversampling technique, were employed to optimize the predictive model. The model was trained on 70 % of the data, reserving 30 % for testing. Results: The ten prediction models achieved a good performance, with area under the receiver operating characteristic curve (AUC) scores above 0.700 in the test set. The stacking and random forest models achieved AUC scores of 0.904 and 0.898, respectively. The prediction performance of the Naive Bayes model was relatively poor. The top five important variables were resilience, bully, suicidal ideation, internet addiction, and depression. Conclusions: The ensemble machine learning algorithm showed promising results predicting NSSI among adolescents. Such algorithms should be recommended for future NSSI research to enhance predictive accuracy. Identification of important features in NSSI prediction can help develop screening protocols and lay a foundation for clinical diagnosis and intervention in adolescent populations.

Published in Heliyon

ISSN: 2405-8440 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General); Social Sciences: Social sciences (General)
Website: https://www.cell.com/heliyon/home

About the journal

Abstract

Keywords