Applied Sciences (Dec 2021)

Classification of Breast Cancer in Mammograms with Deep Learning Adding a Fifth Class

  • Salvador Castro-Tapia,
  • Celina Lizeth Castañeda-Miranda,
  • Carlos Alberto Olvera-Olvera,
  • Héctor A. Guerrero-Osuna,
  • José Manuel Ortiz-Rodriguez,
  • Ma. del Rosario Martínez-Blanco,
  • Germán Díaz-Florez,
  • Jorge Domingo Mendiola-Santibañez,
  • Luis Octavio Solís-Sánchez

DOI
https://doi.org/10.3390/app112311398
Journal volume & issue
Vol. 11, no. 23
p. 11398

Abstract

Read online

Breast cancer is one of the diseases of most profound concern, with the most prevalence worldwide, where early detections and diagnoses play the leading role against this disease achieved through imaging techniques such as mammography. Radiologists tend to have a high false positive rate for mammography diagnoses and an accuracy of around 82%. Currently, deep learning (DL) techniques have shown promising results in the early detection of breast cancer by generating computer-aided diagnosis (CAD) systems implementing convolutional neural networks (CNNs). This work focuses on applying, evaluating, and comparing the architectures: AlexNet, GoogLeNet, Resnet50, and Vgg19 to classify breast lesions after using transfer learning with fine-tuning and training the CNN with regions extracted from the MIAS and INbreast databases. We analyzed 14 classifiers, involving 4 classes as several researches have done it before, corresponding to benign and malignant microcalcifications and masses, and as our main contribution, we also added a 5th class for the normal tissue of the mammary parenchyma increasing the correct detection; in order to evaluate the architectures with a statistical analysis based on the received operational characteristics (ROC), the area under the curve (AUC), F1 Score, accuracy, precision, sensitivity, and specificity. We generate the best results with the CNN GoogLeNet trained with five classes on a balanced database with an AUC of 99.29%, F1 Score of 91.92%, the accuracy of 91.92%, precision of 92.15%, sensitivity of 91.70%, and specificity of 97.66%, concluding that GoogLeNet is optimal as a classifier in a CAD system to deal with breast cancer.

Keywords