PLoS ONE (Jan 2022)

Determinants of coronavirus disease 2019 infection by artificial intelligence technology: A study of 28 countries.

  • Hsiao-Ya Peng,
  • Yen-Kuang Lin,
  • Phung-Anh Nguyen,
  • Jason C Hsu,
  • Chun-Liang Chou,
  • Chih-Cheng Chang,
  • Chia-Chi Lin,
  • Carlos Lam,
  • Chang-I Chen,
  • Kai-Hsun Wang,
  • Christine Y Lu

DOI
https://doi.org/10.1371/journal.pone.0272546
Journal volume & issue
Vol. 17, no. 8
p. e0272546

Abstract

Read online

ObjectivesThe coronavirus disease 2019 pandemic has affected countries around the world since 2020, and an increasing number of people are being infected. The purpose of this research was to use big data and artificial intelligence technology to find key factors associated with the coronavirus disease 2019 infection. The results can be used as a reference for disease prevention in practice.MethodsThis study obtained data from the "Imperial College London YouGov Covid-19 Behaviour Tracker Open Data Hub", covering a total of 291,780 questionnaire results from 28 countries (April 1~August 31, 2020). Data included basic characteristics, lifestyle habits, disease history, and symptoms of each subject. Four types of machine learning classification models were used, including logistic regression, random forest, support vector machine, and artificial neural network, to build prediction modules. The performance of each module is presented as the area under the receiver operating characteristics curve. Then, this study further processed important factors selected by each module to obtain an overall ranking of determinants.ResultsThis study found that the area under the receiver operating characteristics curve of the prediction modules established by the four machine learning methods were all >0.95, and the RF had the highest performance (area under the receiver operating characteristics curve is 0.988). Top ten factors associated with the coronavirus disease 2019 infection were identified in order of importance: whether the family had been tested, having no symptoms, loss of smell, loss of taste, a history of epilepsy, acquired immune deficiency syndrome, cystic fibrosis, sleeping alone, country, and the number of times leaving home in a day.ConclusionsThis study used big data from 28 countries and artificial intelligence methods to determine the predictors of the coronavirus disease 2019 infection. The findings provide important insights for the coronavirus disease 2019 infection prevention strategies.