PLoS ONE (Jan 2023)

Optimization and validation of 18F-DCFPyL PET radiomics-based machine learning models in intermediate- to high-risk primary prostate cancer.

  • Wietske I Luining,
  • Daniela E Oprea-Lager,
  • André N Vis,
  • Reindert J A van Moorselaar,
  • Remco J J Knol,
  • Maurits Wondergem,
  • Ronald Boellaard,
  • Matthijs C F Cysouw

DOI
https://doi.org/10.1371/journal.pone.0293672
Journal volume & issue
Vol. 18, no. 11
p. e0293672

Abstract

Read online

IntroductionRadiomics extracted from prostate-specific membrane antigen (PSMA)-PET modeled with machine learning (ML) may be used for prediction of disease risk. However, validation of previously proposed approaches is lacking. We aimed to optimize and validate ML models based on 18F-DCFPyL-PET radiomics for the prediction of lymph-node involvement (LNI), extracapsular extension (ECE), and postoperative Gleason score (GS) in primary prostate cancer (PCa) patients.MethodsPatients with intermediate- to high-risk PCa who underwent 18F-DCFPyL-PET/CT before radical prostatectomy with pelvic lymph-node dissection were evaluated. The training dataset included 72 patients, the internal validation dataset 24 patients, and the external validation dataset 27 patients. PSMA-avid intra-prostatic lesions were delineated semi-automatically on PET and 480 radiomics features were extracted. Conventional PET-metrics were derived for comparative analysis. Segmentation, preprocessing, and ML methods were optimized in repeated 5-fold cross-validation (CV) on the training dataset. The trained models were tested on the combined validation dataset. Combat harmonization was applied to external radiomics data. Model performance was assessed using the receiver-operating-characteristics curve (AUC).ResultsThe CV-AUCs in the training dataset were 0.88, 0.79 and 0.84 for LNI, ECE, and GS, respectively. In the combined validation dataset, the ML models could significantly predict GS with an AUC of 0.78 (p0.05) and ECE (0.66, p>0.05), but a lower AUC for GS (0.73, pConclusionIn internal and external validation, 18F-DCFPyL-PET radiomics-based ML models predicted high postoperative GS but not LNI or ECE in intermediate- to high-risk PCa. Therefore, the clinical benefit seems to be limited. These results underline the need for external and/or multicenter validation of PET radiomics-based ML model analyses to assess their generalizability.