Novel large empirical study of deep transfer learning for COVID-19 classification based on CT and X-ray images

Mansour Almutaani; Turki Turki; Y.-H. Taguchi

doi:10.1038/s41598-024-76498-4

Scientific Reports (Nov 2024)

Novel large empirical study of deep transfer learning for COVID-19 classification based on CT and X-ray images

Mansour Almutaani,
Turki Turki,
Y.-H. Taguchi

Affiliations

Mansour Almutaani: Department of Computer Science, King Abdulaziz University
Turki Turki: Department of Computer Science, King Abdulaziz University
Y.-H. Taguchi: Department of Physics, Chuo University

DOI: https://doi.org/10.1038/s41598-024-76498-4
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 17

Abstract

Read online

Abstract The early and highly accurate prediction of COVID-19 based on medical images can speed up the diagnostic process and thereby mitigate disease spread; therefore, developing AI-based models is an inevitable endeavor. The presented work, to our knowledge, is the first to expand the model space and identify a better performing model among 10,000 constructed deep transfer learning (DTL) models as follows. First, we downloaded and processed 4481 CT and X-ray images pertaining to COVID-19 and non-COVID-19 patients, obtained from the Kaggle repository. Second, we provide processed images as inputs to four pre-trained deep learning models (ConvNeXt, EfficientNetV2, DenseNet121, and ResNet34) on more than a million images from the ImageNet database, in which we froze the convolutional and pooling layers pertaining to the feature extraction part while unfreezing and training the densely connected classifier with the Adam optimizer. Third, we generate and take a majority vote of two, three, and four combinations from the four DTL models, resulting in $$\sum\nolimits_{r = 2}^{4} {\left( {\begin{array}{*{20}c} 4 \\ r \\ \end{array} } \right)} = 11$$ DTL models. Then, we combine the 11 DTL models, followed by consecutively generating and taking the majority vote of $$\sum\nolimits_{r = 2}^{11} {\left( {\begin{array}{*{20}c} {11} \\ r \\ \end{array} } \right)} = 2036$$ DTL models. Finally, we select $$7953$$ DTL models from $$\left( {\begin{array}{*{20}c} {2036} \\ 2 \\ \end{array} } \right).$$ Experimental results from the whole datasets using five-fold cross-validation demonstrate that the best generated DTL model, named HC, achieving the best AUC of 0.909 when applied to the CT dataset, while ConvNeXt yielded a higher marginal AUC of 0.933 compared to 0.93 for HX when considering the X-ray dataset. These promising results set the foundation for promoting the large generation of models (LGM) in AI.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords