Risk Management and Healthcare Policy (Nov 2023)
Using a Decision Tree Algorithm Predictive Model for Sperm Count Assessment and Risk Factors in Health Screening Population
Abstract
Hung-Hsiang Huang,1,* Chi-Jie Lu,2– 4,* Mao-Jhen Jhou,4 Tzu-Chi Liu,4 Chih-Te Yang,5 Shang-Ju Hsieh,1 Wen-Jen Yang,6 Hsiao-Chun Chang,1 Ming-Shu Chen7 1Department of Urology, Surgery, Far Eastern Memorial Hospital, New Taipei City, 220, Taiwan; 2Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, 242, Taiwan; 3Department of Information Management, Fu Jen Catholic University, New Taipei City, 242, Taiwan; 4Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, 242, Taiwan; 5Department of Business Administration, Tamkang University, New Taipei City, 251, Taiwan; 6Health Screening Center, Chi Hsin Clinic, Taipei City, 104, Taiwan; 7Department of Healthcare Administration, Asia Eastern University of Science and Technology, New Taipei City, 220, Taiwan*These authors contributed equally to this workCorrespondence: Ming-Shu Chen; Hsiao-Chun Chang, Email [email protected]; [email protected]: Approximately 20% of couples face infertility challenges and struggle to conceive naturally. Despite advances in artificial reproduction, its success hinges on sperm quality. Our previous study used five machine learning (ML) algorithms, random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting, to model health data from 1375 Taiwanese males and identified ten risk factors affecting sperm count.Methods: We employed the CART algorithm to generate decision trees using identified risk factors to predict healthy sperm counts. Four error metrics, SMAPE, RAE, RRSE, and RMSE, were used to evaluate the decision trees. We identified the top five decision trees based on their low errors and discussed in detail the tree with the least error.Results: The decision tree featuring the least error, comprising BMI, UA, ST, T-Cho/HDL-C ratio, and BUN, corroborated the negative impacts of metabolic syndrome, particularly high BMI, on sperm count, while emphasizing the link between good sleep and male fertility. Our study also sheds light on the potentially significant influence of high BUN on spermatogenesis. Two novel risk factors, T-Cho/HDL-C and UA, warrant further investigation.Conclusion: The ML algorithm established a predictive model for healthcare personnel to assess low sperm counts. Refinement of the model using additional data is crucial for improved precision. The risk factors identified offer avenues for future investigations.Keywords: sperm count, metabolic syndrome, sleep time, food metabolite, decision tree