IEEE Access (Jan 2020)

Performance Analysis of Classification Algorithms on Birth Dataset

  • Syed Ali Abbas,
  • Aqeel Ur Rehman,
  • Fiaz Majeed,
  • Abdul Majid,
  • M. Sheraz Arshed Malik,
  • Zaki Hassan Kazmi,
  • Seemab Zafar

DOI
https://doi.org/10.1109/ACCESS.2020.2999899
Journal volume & issue
Vol. 8
pp. 102146 – 102154

Abstract

Read online

Generating intuitions from data using data mining and machine learning algorithms to predict outcomes is useful area of computing. The application area of data mining techniques and machine learning is wide ranging including industries, healthcare, organizations, academics etc. A continuous improvement is witnessed due to an ongoing research, as seen particularly in healthcare. Several researchers have applied machine learning to develop decision support systems, perform analysis of dominant clinical factors, extraction of useful information from hideous patterns in historical data, making predictions and disease classification. Successful researches created opportunities for physicians to take appropriate decision at right time. In current study, we intend to utilize the learning capability of machine learning methods towards the classification of birth data using bagging and boosting classification algorithms. It is obvious that differences in living styles, medical assistances, religious implications and the region you live in collectively affect the residents of that society. This motive has encouraged the researchers to conduct studies at regional levels to comprehensively explore the associated medical factors that contribute towards complications among women during pregnancy. The current study is a comprehensive comparison of bagging and boosting classification algorithms performed on birth data collected from the government hospitals of city Muzaffarabad, Kashmir. The experimental tasks are carried out using caret package in R which is considered an inclusive framework for building machine learning models. Accuracy based results with different evaluation measures are presented. Bagging functions including Adabag and BagFda performed marginally better in terms of accuracy, precision and recall. Improvements are observed in comparison to previous study performed on same dataset.

Keywords