Journal of King Saud University: Science (Apr 2023)

Multi-objective deep learning framework for COVID-19 dataset problems

  • Roa'a Mohammedqasem,
  • Hayder Mohammedqasim,
  • Sardar Asad Ali Biabani,
  • Oguz Ata,
  • Mohammad N. Alomary,
  • Mazen Almehmadi,
  • Ahad Amer Alsairi,
  • Mohammad Azam Ansari

Journal volume & issue
Vol. 35, no. 3
p. 102527

Abstract

Read online

Background: It has been reported that a deadly virus known as COVID-19 has arisen in China and has spread rapidly throughout the country. The globe was shattered, and a large number of people on the planet died. It quickly became an epidemic due to the absence of apparent symptoms and causes for patients, confusion appears due to the lack of sufficient laboratory results, and its intelligent algorithms were used to make decisions on clinical outcomes. Methods: This study developed a new framework for medical datasets with high missing values based on deep-learning optimization models. The robustness of our model is achieved by combining: Data Missing Care (DMC) Framework to overcome the problem of high missing data in medical datasets, and Grid-Search optimization used to develop an improved deep predictive training model for patients with COVID-19 by setting multiple hyperparameters and tuning assessments on three deep learning algorithms: ANN (Artificial Neural Network), CNN (Convolutional Neural Network), and Recurrent Neural Networks (RNN). Results: The experiment results conducted on three medical datasets showed the effectiveness of our hybrid approach and an improvement in accuracy and efficiency since all the evaluation metrics were close to ideal for all deep learning classifiers. We got the best evaluation in terms of accuracy 98%, precession 98.5%, F1-score 98.6%, and ROC Curve (95% to 99%) for the COVID-19 dataset provided by GitHub. The second dataset is also Covid-19 provided by Albert Einstein Hospital with high missing data after applying our approach the accuracy reached more than 91%. Third dataset for Cervical Cancer provided by Kaggle all the evaluation metrics reached more than 95%. Conclusions: The proposed formula for processing this type of data can replace the traditional formats in optimization while providing high accuracy and less time to classify patients. Whereas, the experimental results of our approach, supported by comprehensive statistical analysis, can improve the overall evaluation performance of the problem of classifying medical data sets with high missing values. Therefore, this approach can be used in many areas such as energy management, environment, and medicine.

Keywords