Computer Methods and Programs in Biomedicine Update (Jan 2023)

Machine learning-based diagnosis of breast cancer utilizing feature optimization technique

  • Khandaker Mohammad Mohi Uddin,
  • Nitish Biswas,
  • Sarreha Tasmin Rikta,
  • Samrat Kumar Dey

Journal volume & issue
Vol. 3
p. 100098

Abstract

Read online

Breast cancer disease is recognized as one of the leading causes of death in women worldwide after lung cancer. Breast cancer refers to a malignant neoplasm that develops from breast cells. Developed and less developed countries both are suffering from this extensive cancer. This cancer can be recuperated if it is detected at an early stage. Many researchers have proposed several machine learning techniques to predict breast cancer with the highest accuracy in the past years. In this research work, the Wisconsin Breast Cancer Dataset (WBCD) has been used as a training set from the UCI machine learning repository to compare the performance of the various machine learning techniques. Different kinds of machine learning classifiers such as support vector machine (SVM), Random Forest (RF), K-nearest neighbors(K-NN), Decision tree (DT), Naïve Bayes (NB), Logistic Regression (LR), AdaBoost (AB), Gradient Boosting (GB), Multi-layer perceptron (MLP), Nearest Cluster Classifier (NCC), and voting classifier (VC) have been used for comparing and analyzing breast cancer into benign and malignant tumors. Various matrices such as error rate, Accuracy, Precision, F1-score, and recall have been implemented to measure the model's performance. Each Algorithm's accuracy has been ascertained for finding the best suitable one. Based on the analysis, the result shows that the Voting classifier has the highest accuracy, which is 98.77%, with the lowest error rate. Finally, a web page is developed using a flask micro-framework integrating the best model using react.

Keywords