Dyna (May 2023)

Classification of COVID-19 associated symptomatology using machine learning

  • Julian Andres Ramirez-Bautista,
  • Silvia L. Chaparro-Cárdenas ,
  • Wilson Gamboa-Contreras,
  • William Guerrero-Salazar ,
  • Jorge Adalberto Huerta-Ruelas

DOI
https://doi.org/10.15446/dyna.v90n226.105616
Journal volume & issue
Vol. 90, no. 226

Abstract

Read online

The health situation caused by the SARS-Cov2 coronavirus, posed major challenges for the scientific community. Advances in artificial intelligence are a very useful resource, but it is important to determine which symptoms presented by positive cases of infection are the best predictors. A machine learning approach was used with data from 5,434 people, with eleven symptoms: breathing problems, dry cough, sore throat, running nose, history of asthma, chronic lung, headache, heart disease, hypertension, diabetes, and fever. Based on public data from Kaggle with WHO standardized symptoms. A model was developed to detect COVID-19 positive cases using a simple machine learning model. The results of 4 loss functions and by SHAP values, were compared. The best loss function was Binary Cross Entropy, with a single hidden layer configuration with 10 neurons, achieving an F1 score of 0.98 and the model was rated with an area under the curve of 0.99 aucROC.

Keywords