Cancer Medicine (Jun 2024)

An explainable machine learning model to solid adnexal masses diagnosis based on clinical data and qualitative ultrasound indicators

  • Annarita Fanizzi,
  • Francesca Arezzo,
  • Gennaro Cormio,
  • Maria Colomba Comes,
  • Gerardo Cazzato,
  • Luca Boldrini,
  • Samantha Bove,
  • Michele Bollino,
  • Anila Kardhashi,
  • Erica Silvestris,
  • Pietro Quarto,
  • Michele Mongelli,
  • Emanuele Naglieri,
  • Rahel Signorile,
  • Vera Loizzi,
  • Raffaella Massafra

DOI
https://doi.org/10.1002/cam4.7425
Journal volume & issue
Vol. 13, no. 12
pp. n/a – n/a

Abstract

Read online

Abstract Background Accurate characterization of newly diagnosed a solid adnexal lesion is a key step in defining the most appropriate therapeutic approach. Despite guidance from the International Ovarian Tumor Analyzes Panel, the evaluation of these lesions can be challenging. Recent studies have demonstrated how machine learning techniques can be applied to clinical data to solve this diagnostic problem. However, ML models can often consider as black‐boxes due to the difficulty of understanding the decision‐making process used by the algorithm to obtain a specific result. Aims For this purpose, we propose an Explainable Artificial Intelligence model trained on clinical characteristics and qualitative ultrasound indicators to predict solid adnexal masses diagnosis. Materials & Methods Since the diagnostic task was a three‐class problem (benign tumor, invasive cancer, or ovarian metastasis), we proposed a waterfall classification model: a first model was trained and validated to discriminate benign versus malignant, a second model was trained to distinguish nonmetastatic versus metastatic malignant lesion which occurs when a patient is predicted to be malignant by the first model. Firstly, a stepwise feature selection procedure was implemented. The classification performances were validated on Leave One Out scheme. Results The accuracy of the three‐class model reaches an overall accuracy of 86.36%, and the precision per‐class of the benign, nonmetastatic malignant, and metastatic malignant classes were 86.96%, 87.27%, and 77.78%, respectively. Discussion: SHapley Additive exPlanations were performed to visually show how the machine learning model made a specific decision. For each patient, the SHAP values expressed how each characteristic contributed to the classification result. Such information represents an added value for the clinical usability of a diagnostic system. Conclusions This is the first work that attempts to design an explainable machine‐learning tool for the histological diagnosis of solid masses of the ovary.

Keywords