Revista Română de Informatică și Automatică (Jun 2020)
Tehnici bazate pe Machine Learning pentru îmbunătățirea depistării cancerului de sân
Abstract
Breast cancer is one the most common types of cancer diagnosed in women and the second leading cause of cancer mortality after lung cancer. The diagnostic and prediction of the cancer development are realized, nowadays, using different techniques based on advanced methods, such as Machine Learning. This article intends to present the research results in the field of Machine Learning applied for the purpose of classifying medical data. Using a set of different algorithms, the aim was to classify the Breast Cancer Wisconsin database for diagnostic. The selection criteria of the algorithms were chosen as to emphasize the performances of Machine Learning techniques in terms of accuracy and precision. For implementation, techniques such as Support Vector Machines (SVM), k-Nearest Neighbor (kNN), Multilayer Perceptron (MLP), Decision Tree, Gaussian Naïve Bayes and Random Forest were used. A set of diagnostic images from a fine needle aspirate technique (FNA) was selected based on which the most representative features were identified. The best accuracy was obtained for the Random Forest algorithm, in this case 97.90%, which allows outlining a perspective of refining the classification achieved.
Keywords