Frontiers in Oncology (Sep 2022)

Invasive ductal breast cancer molecular subtype prediction by MRI radiomic and clinical features based on machine learning

  • Weiyong Sheng,
  • Shouli Xia,
  • Yaru Wang,
  • Lizhao Yan,
  • Songqing Ke,
  • Evelyn Mellisa,
  • Fen Gong,
  • Yun Zheng,
  • Tiansheng Tang

DOI
https://doi.org/10.3389/fonc.2022.964605
Journal volume & issue
Vol. 12

Abstract

Read online

BackgroundMost studies of molecular subtype prediction in breast cancer were mainly based on two-dimensional MRI images, the predictive value of three-dimensional volumetric features from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) for predicting breast cancer molecular subtypes has not been thoroughly investigated. This study aimed to look into the role of features derived from DCE-MRI and how they could be combined with clinical data to predict invasive ductal breast cancer molecular subtypes.MethodsFrom January 2019 to December 2021, 190 Chinese women with invasive ductal breast cancer were studied (32 triple-negative, 59 HER2-enriched, and 99 luminal lesions) in this institutional review board-approved retrospective cohort study. The image processing software extracted 1130 quantitative radiomic features from the segmented lesion area, including shape-based, first-order statistical, texture, and wavelet features. Three binary classifications of the subtypes were performed: triple-negative vs. non-triple-negative, HER2-overexpressed vs. non-HER2-overexpressed, and luminal (A + B) vs. non-luminal. For the classification, five machine learning methods (random forest, logistic regression, support vector machine, naïve Bayes, and eXtreme Gradient Boosting) were employed. The classifiers were chosen using the least absolute shrinkage and selection operator method. The area evaluated classification performance under the receiver operating characteristic curve, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean.ResultsEXtreme Gradient Boosting model showed the best performance in luminal and non-luminal groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.8282, 0.7524, 0.6542, 0.6964, 0.6086, 0.3458, 0.8524 and 0.7016, respectively. Meanwhile, the random forest model showed the best performance in HER2-overexpressed and non-HER2-overexpressed groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.8054, 0.2941, 0.9744, 0.7679, 0.4348, 0.0256, 0.8333 and 0.5353, respectively. Furthermore, eXtreme Gradient Boosting model showed the best performance in the triple-negative and non-triple-negative groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.9031, 0.9362, 0.4444, 0.8571, 0.9167, 0.5556, 0.8980 and 0.6450.ConclusionClinical data and three-dimension imaging features from DCE-MRI were identified as potential biomarkers for distinguishing between three molecular subtypes of invasive ductal carcinomas breast cancer. In the future, more extensive studies will be required to evaluate the findings.

Keywords