An explainable machine learning model to solid adnexal masses diagnosis based on clinical data and qualitative ultrasound indicators

Annarita Fanizzi; Francesca Arezzo; Gennaro Cormio; Maria Colomba Comes; Gerardo Cazzato; Luca Boldrini; Samantha Bove; Michele Bollino; Anila Kardhashi; Erica Silvestris; Pietro Quarto; Michele Mongelli; Emanuele Naglieri; Rahel Signorile; Vera Loizzi; Raffaella Massafra

doi:10.1002/cam4.7425

Cancer Medicine (Jun 2024)

An explainable machine learning model to solid adnexal masses diagnosis based on clinical data and qualitative ultrasound indicators

Annarita Fanizzi,
Francesca Arezzo,
Gennaro Cormio,
Maria Colomba Comes,
Gerardo Cazzato,
Luca Boldrini,
Samantha Bove,
Michele Bollino,
Anila Kardhashi,
Erica Silvestris,
Pietro Quarto,
Michele Mongelli,
Emanuele Naglieri,
Rahel Signorile,
Vera Loizzi,
Raffaella Massafra

Affiliations

Annarita Fanizzi: Laboratorio Biostatistica e Bioinformatica I.R.C.C.S. Istituto Tumori ‘Giovanni Paolo II’ Bari Italy
Francesca Arezzo: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Gennaro Cormio: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Maria Colomba Comes: Laboratorio Biostatistica e Bioinformatica I.R.C.C.S. Istituto Tumori ‘Giovanni Paolo II’ Bari Italy
Gerardo Cazzato: Section of Molecular Pathology, Department of Emergency and Organ Transplantation University of Bari “Aldo Moro” Bari Italy
Luca Boldrini: Fondazione Policlinico Universitario “A. Gemelli” IRCCS Italy
Samantha Bove: Laboratorio Biostatistica e Bioinformatica I.R.C.C.S. Istituto Tumori ‘Giovanni Paolo II’ Bari Italy
Michele Bollino: Department of Obstetrics and Gynecology, Division of Gynecologic oncology, Skåne University Hospital and Lund University Faculty of Medicine, Clinical Sciences Lund Sweden
Anila Kardhashi: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Erica Silvestris: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Pietro Quarto: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Michele Mongelli: Department of Precision and Regenerative Medicine – Ionian Area University of Bari “Aldo Moro” Bari Italy
Emanuele Naglieri: Medical Oncology Unit, IRCCS Istituto Tumori Giovanni Paolo II Bari Italy
Rahel Signorile: Laboratorio Biostatistica e Bioinformatica I.R.C.C.S. Istituto Tumori ‘Giovanni Paolo II’ Bari Italy
Vera Loizzi: Gynecologic Oncology Unit IRCCS Istituto Tumori “Giovanni Paolo II” Bari Italy
Raffaella Massafra: Laboratorio Biostatistica e Bioinformatica I.R.C.C.S. Istituto Tumori ‘Giovanni Paolo II’ Bari Italy

DOI: https://doi.org/10.1002/cam4.7425
Journal volume & issue: Vol. 13, no. 12
pp. n/a – n/a

Abstract

Read online

Abstract Background Accurate characterization of newly diagnosed a solid adnexal lesion is a key step in defining the most appropriate therapeutic approach. Despite guidance from the International Ovarian Tumor Analyzes Panel, the evaluation of these lesions can be challenging. Recent studies have demonstrated how machine learning techniques can be applied to clinical data to solve this diagnostic problem. However, ML models can often consider as black‐boxes due to the difficulty of understanding the decision‐making process used by the algorithm to obtain a specific result. Aims For this purpose, we propose an Explainable Artificial Intelligence model trained on clinical characteristics and qualitative ultrasound indicators to predict solid adnexal masses diagnosis. Materials & Methods Since the diagnostic task was a three‐class problem (benign tumor, invasive cancer, or ovarian metastasis), we proposed a waterfall classification model: a first model was trained and validated to discriminate benign versus malignant, a second model was trained to distinguish nonmetastatic versus metastatic malignant lesion which occurs when a patient is predicted to be malignant by the first model. Firstly, a stepwise feature selection procedure was implemented. The classification performances were validated on Leave One Out scheme. Results The accuracy of the three‐class model reaches an overall accuracy of 86.36%, and the precision per‐class of the benign, nonmetastatic malignant, and metastatic malignant classes were 86.96%, 87.27%, and 77.78%, respectively. Discussion: SHapley Additive exPlanations were performed to visually show how the machine learning model made a specific decision. For each patient, the SHAP values expressed how each characteristic contributed to the classification result. Such information represents an added value for the clinical usability of a diagnostic system. Conclusions This is the first work that attempts to design an explainable machine‐learning tool for the histological diagnosis of solid masses of the ovary.

Published in Cancer Medicine

ISSN: 2045-7634 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://onlinelibrary.wiley.com/journal/20457634

About the journal

Abstract

Keywords