International Journal of Data and Network Science (Jan 2024)

Diagnosing diabetes mellitus using machine learning techniques

  • Mazen Alzyoud,
  • Raed Alazaidah,
  • Mohammad Aljaidi,
  • Ghassan Samara,
  • Mais Haj Qasem,
  • Muhammad Khalid,
  • Najah Al-Shanableh

DOI
https://doi.org/10.5267/j.ijdns.2023.10.006
Journal volume & issue
Vol. 8, no. 1
pp. 179 – 188

Abstract

Read online

Diabetes Mellitus (DM) is a frequent condition in which the body's sugar levels are abnormally high for an extended length of time. It is a major cause of death with high mortality rates and the second leading cause of total years lived with disability worldwide. Its seriousness comes from its long-term complications, including nephropathy, retinopathy, and neuropathy leading to kidney failure, poor vision and blindness, and peripheral sensory loss, respectively. Such conditions are life-threatening and affect patients’ quality of life. Therefore, this paper aims to identify the most relevant features in the diagnosis of DM and identify the best classifier that can efficiently diagnose DM based on a set of relevant features. To achieve this, four different feature selection methods have been utilized. Moreover, twelve different classifiers that belong to six learning strategies have been evaluated using two datasets and several evaluation metrics such as Accuracy, Precision, Recall, F1-measure, and ROC area. The obtained results revealed that the correlation attribute evaluation method would be the best choice to handle the task of feature selection and ranking for the considered datasets, especially when considering the Accuracy metric. Furthermore, MultiClassClassifier would be the best classifier to handle Diabetes datasets, especially when considering True Positive, precision, and Recall metrics.