Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness

Ana Rodrigues; Nuno Rodrigues; João Santinha; Maria V. Lisitskaya; Aycan Uysal; Celso Matos; Inês Domingues; Nickolas Papanikolaou

doi:10.1038/s41598-023-33339-0

Scientific Reports (Apr 2023)

Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness

Ana Rodrigues,
Nuno Rodrigues,
João Santinha,
Maria V. Lisitskaya,
Aycan Uysal,
Celso Matos,
Inês Domingues,
Nickolas Papanikolaou

Affiliations

Ana Rodrigues: Champalimaud Research, Champalimaud Foundation
Nuno Rodrigues: Champalimaud Research, Champalimaud Foundation
João Santinha: Champalimaud Research, Champalimaud Foundation
Maria V. Lisitskaya: Cand. of Sci. (Med.), Radiologist at Radiology Department with CT and MRI, Medical Research and Educational Center, Lomonosov Moscow State University
Aycan Uysal: Gulhane Medical School, University of Health Sciences
Celso Matos: Champalimaud Research, Champalimaud Foundation
Inês Domingues: Instituto Politécnico de Coimbra, Instituto Superior de Engenharia, Rua Pedro Nunes-Quinta da Nora
Nickolas Papanikolaou: Champalimaud Research, Champalimaud Foundation

DOI: https://doi.org/10.1038/s41598-023-33339-0
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract There is a growing piece of evidence that artificial intelligence may be helpful in the entire prostate cancer disease continuum. However, building machine learning algorithms robust to inter- and intra-radiologist segmentation variability is still a challenge. With this goal in mind, several model training approaches were compared: removing unstable features according to the intraclass correlation coefficient (ICC); training independently with features extracted from each radiologist’s mask; training with the feature average between both radiologists; extracting radiomic features from the intersection or union of masks; and creating a heterogeneous dataset by randomly selecting one of the radiologists’ masks for each patient. The classifier trained with this last resampled dataset presented with the lowest generalization error, suggesting that training with heterogeneous data leads to the development of the most robust classifiers. On the contrary, removing features with low ICC resulted in the highest generalization error. The selected radiomics dataset, with the randomly chosen radiologists, was concatenated with deep features extracted from neural networks trained to segment the whole prostate. This new hybrid dataset was then used to train a classifier. The results revealed that, even though the hybrid classifier was less overfitted than the one trained with deep features, it still was unable to outperform the radiomics model.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal