Journal of the Egyptian National Cancer Institute (Mar 2025)

Prediction of prostate biopsy outcomes at different cut-offs of prostate-specific antigen using machine learning: a multicenter study

  • Mostafa A. Arafa,
  • Karim H. Farhat,
  • Sherin F. Aly,
  • Farrukh K. Khan,
  • Alaa Mokhtar,
  • Abdulaziz M. Althunayan,
  • Waleed Al-Taweel,
  • Sultan S. Al-Khateeb,
  • Sami Azhari,
  • Danny M. Rabah

DOI
https://doi.org/10.1186/s43046-025-00265-3
Journal volume & issue
Vol. 37, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Machine learning (ML) is a significant area of artificial intelligence, which can improve the accuracy of predictive or diagnostic models for differentiating between prostate biopsy outcomes. This study aims to develop a novel decision-support ML model for classifying patients with biopsy-negative (cancer-free), clinically significant, and non-clinically significant prostate cancer across two prostate-specific antigen (PSA) cut-offs ≤ 10 ng/ml and > 10 ng/ml. Methods The data for the current study were retrieved from the records of two main hospitals in Riyadh, Saudi Arabia from July 2018 through July 2024. Six machine learning algorithms were employed, and the dataset was randomly divided into a training set and a validation set at a ratio of 8:2. The following metrics were used as performance indicators across the six algorithms: Accuracy, Precision, Recall, F1-score, and area under the curve. Recent data from the two hospitals was utilized for external validation. Results The metrics for Random Forest, Extra Tree, and Decision Tree algorithms showed excellent capability in classifying the outcomes of prostate biopsy for the two PSA cut-offs. However, the metrics for the PSA cut-off > 10 ng/ml are higher than those for PSA ≤ 10 ng/ml. For the three-class classification, the accuracy and area under the curve for the cut-off > 10 ng/ml were 0.96 and 0.99, respectively. While for the cut-off ≤ 10 ng/ml they were 0.92 and 0.94 for Random Forest and 0.94 and 0.95 for the Extra Tree algorithm. The metrics of non-clinically significant and biopsy-negative cases outperformed those of clinically significant cases. Conclusion ML models are proving to be effective tools in differentiating between prostate biopsy outcomes, enhancing diagnostic accuracy, and potentially transforming clinical practices in prostate cancer management.

Keywords