Data (Feb 2023)

Accuracy Assessment of Machine Learning Algorithms Used to Predict Breast Cancer

  • Mohamed Ebrahim,
  • Ahmed Ahmed Hesham Sedky,
  • Saleh Mesbah

DOI
https://doi.org/10.3390/data8020035
Journal volume & issue
Vol. 8, no. 2
p. 35

Abstract

Read online

Machine learning (ML) was used to develop classification models to predict individual tumor patients’ outcomes. Binary classification defined whether the tumor was malignant or benign. This paper presents a comparative analysis of machine learning algorithms used for breast cancer prediction. This study used a dataset obtained from the National Cancer Institute (NIH), USA, which contains 1.7 million data records. Classical and deep learning methods were included in the accuracy assessment. Classical decision tree (DT), linear discriminant (LD), logistic regression (LR), support vector machine (SVM), and ensemble techniques (ET) algorithms were used. Probabilistic neural network (PNN), deep neural network (DNN), and recurrent neural network (RNN) methods were used for comparison. Feature selection and its effect on accuracy were also investigated. The results showed that decision trees and ensemble techniques outperformed the other techniques, as they both achieved a 98.7% accuracy.

Keywords