Frontiers in Oncology (Jul 2023)

An explainable machine learning ensemble model to predict the risk of ovarian cancer in BRCA-mutated patients undergoing risk-reducing salpingo-oophorectomy

  • Maria Colomba Comes,
  • Francesca Arezzo,
  • Francesca Arezzo,
  • Gennaro Cormio,
  • Gennaro Cormio,
  • Samantha Bove,
  • Angela Calabrese,
  • Annarita Fanizzi,
  • Anila Kardhashi,
  • Daniele La Forgia,
  • Francesco Legge,
  • Isabella Romagno,
  • Vera Loizzi,
  • Vera Loizzi,
  • Raffaella Massafra

DOI
https://doi.org/10.3389/fonc.2023.1181792
Journal volume & issue
Vol. 13

Abstract

Read online

IntroductionIt has been estimated that 19,880 new cases of ovarian cancer had been diagnosed in 2022. Most epithelial ovarian cancer are sporadic, while in 15%–25% of cases, there is evidence of a familial or inherited component. Approximately 20%–25% of high-grade serous carcinoma cases are caused by germline mutations in the BRCA1 and BRCA2 genes. However, owing to a lack of effective early detection methods, women with BRCA mutations are recommended to undergo bilateral risk-reducing salpingo-oophorectomy (RRSO) after childbearing. Determining the right timing for this procedure is a difficult decision. It is crucial to find a clinical signature to identify high-risk BRCA-mutated patients and determine the appropriate timing for performing RRSO.MethodsIn this work, clinical data referred to a cohort of 184 patients, of whom 7.6% were affected by adnexal tumors including invasive carcinomas and intraepithelial lesions after RSSO has been analyzed. Thus, we proposed an explainable machine learning (ML) ensemble approach using clinical data commonly collected in clinical practice to early identify BRCA-mutated patients at high risk of ovarian cancer and consequentially establish the correct timing for RRSO.ResultsThe ensemble model was able to handle imbalanced data achieving an accuracy value of 83.2%, a specificity value of 85.3%, a sensitivity value of 57.1%, a G-mean value of 69.8%, and an AUC value of 71.1%.DiscussionIn agreement with the promising results achieved, the application of suitable ML techniques could play a key role in the definition of a BRCA-mutated patient-centric clinical signature for ovarian cancer risk and consequently personalize the management of these patients. As far as we know, this is the first work addressing this task from an ML perspective.

Keywords