Development and head-to-head comparison of machine-learning models to identify patients requiring prostate biopsy

Shuanbao Yu; Jin Tao; Biao Dong; Yafeng Fan; Haopeng Du; Haotian Deng; Jinshan Cui; Guodong Hong; Xuepei Zhang

doi:10.1186/s12894-021-00849-w

BMC Urology (May 2021)

Development and head-to-head comparison of machine-learning models to identify patients requiring prostate biopsy

Shuanbao Yu,
Jin Tao,
Biao Dong,
Yafeng Fan,
Haopeng Du,
Haotian Deng,
Jinshan Cui,
Guodong Hong,
Xuepei Zhang

Affiliations

Shuanbao Yu: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Jin Tao: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Biao Dong: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Yafeng Fan: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Haopeng Du: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Haotian Deng: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Jinshan Cui: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Guodong Hong: Department of Urology, The First Affiliated Hospital of Zhengzhou University
Xuepei Zhang: Department of Urology, The First Affiliated Hospital of Zhengzhou University

DOI: https://doi.org/10.1186/s12894-021-00849-w
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Machine learning has many attractive theoretic properties, specifically, the ability to handle non predefined relations. Additionally, studies have validated the clinical utility of mpMRI for the detection and localization of CSPCa (Gleason score ≥ 3 + 4). In this study, we sought to develop and compare machine-learning models incorporating mpMRI parameters with traditional logistic regression analysis for prediction of PCa (Gleason score ≥ 3 + 3) and CSPCa on initial biopsy. Methods A total of 688 patients with no prior prostate cancer diagnosis and tPSA ≤ 50 ng/ml, who underwent mpMRI and prostate biopsy were included between 2016 and 2020. We used four supervised machine-learning algorithms in a hypothesis-free manner to build models to predict PCa and CSPCa. The machine-learning models were compared to the logistic regression analysis using AUC, calibration plot, and decision curve analysis. Results The artificial neural network (ANN), support vector machine (SVM), and random forest (RF) yielded similar diagnostic accuracy with logistic regression, while classification and regression tree (CART, AUC = 0.834 and 0.867) had significantly lower diagnostic accuracy than logistic regression (AUC = 0.894 and 0.917) in prediction of PCa and CSPCa (all P < 0.05). However, the CART illustrated best calibration for PCa (SSR = 0.027) and CSPCa (SSR = 0.033). The ANN, SVM, RF, and LR for PCa had higher net benefit than CART across the threshold probabilities above 5%, and the five models for CSPCa displayed similar net benefit across the threshold probabilities below 40%. The RF (53% and 57%, respectively) and SVM (52% and 55%, respectively) for PCa and CSPCa spared more unnecessary biopsies than logistic regression (35% and 47%, respectively) at 95% sensitivity for detection of CSPCa. Conclusion Machine-learning models (SVM and RF) yielded similar diagnostic accuracy and net benefit, while spared more biopsies at 95% sensitivity for detection of CSPCa, compared with logistic regression. However, no method achieved desired performance. All methods should continue to be explored and used in complementary ways.

Published in BMC Urology

ISSN: 1471-2490 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Diseases of the genitourinary system. Urology
Website: http://bmcurol.biomedcentral.com

About the journal

Abstract

Keywords