Frontiers in Oncology (Aug 2022)

Machine learning model for the prediction of prostate cancer in patients with low prostate-specific antigen levels: A multicenter retrospective analysis

  • Xiaobin Deng,
  • Xiaobin Deng,
  • Tianyu Li,
  • Tianyu Li,
  • Tianyu Li,
  • Linjian Mo,
  • Linjian Mo,
  • Linjian Mo,
  • Fubo Wang,
  • Fubo Wang,
  • Fubo Wang,
  • Jin Ji,
  • Xing He,
  • Bashir Hussein Mohamud,
  • Bashir Hussein Mohamud,
  • Swadhin Pradhan,
  • Swadhin Pradhan,
  • Jiwen Cheng,
  • Jiwen Cheng,
  • Jiwen Cheng

DOI
https://doi.org/10.3389/fonc.2022.985940
Journal volume & issue
Vol. 12

Abstract

Read online

ObjectiveThe aim of this study was to develop a predictive model to improve the accuracy of prostate cancer (PCa) detection in patients with prostate specific antigen (PSA) levels ≤20 ng/mL at the initial puncture biopsy.MethodsA total of 146 patients (46 with Pca, 31.5%) with PSA ≤20 ng/mL who had undergone transrectal ultrasound-guided 12+X prostate puncture biopsy with clear pathological results at the First Affiliated Hospital of Guangxi Medical University (November 2015 to December 2021) were retrospectively evaluated. The validation group was 116 patients drawn from Changhai Hospital(52 with Pca, 44.8%). Age, body mass index (BMI), serum PSA, PSA-derived indices, several peripheral blood biomarkers, and ultrasound findings were considered as predictive factors and were analyzed by logistic regression. Significant predictors (P < 0.05) were included in five machine learning algorithm models. The performance of the models was evaluated by receiver operating characteristic curves. Decision curve analysis (DCA) was performed to estimate the clinical utility of the models. Ten-fold cross-validation was applied in the training process.ResultsProstate-specific antigen density, alanine transaminase-to-aspartate transaminase ratio, BMI, and urine red blood cell levels were identified as independent predictors for the differential diagnosis of PCa according to multivariate logistic regression analysis. The RandomForest model exhibited the best predictive performance and had the highest net benefit when compared with the other algorithms, with an area under the curve of 0.871. In addition, DCA had the highest net benefit across the whole range of cut-off points examined.ConclusionThe RandomForest-based model generated showed good prediction ability for the risk of PCa. Thus, this model could help urologists in the treatment decision-making process.

Keywords