Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework
Arun Kumar Dubey,
Gian Luca Chabert,
Alessandro Carriero,
Alessio Pasche,
Pietro S. C. Danna,
Sushant Agarwal,
Lopamudra Mohanty,
Nillmani,
Neeraj Sharma,
Sarita Yadav,
Achin Jain,
Ashish Kumar,
Mannudeep K. Kalra,
David W. Sobel,
John R. Laird,
Inder M. Singh,
Narpinder Singh,
George Tsoulfas,
Mostafa M. Fouda,
Azra Alizad,
George D. Kitas,
Narendra N. Khanna,
Klaudija Viskovic,
Melita Kukuljan,
Mustafa Al-Maini,
Ayman El-Baz,
Luca Saba,
Jasjit S. Suri
Affiliations
Arun Kumar Dubey
Bharati Vidyapeeth’s College of Engineering, New Delhi 110063, India
Gian Luca Chabert
Department of Radiology, Azienda Ospedaliero Universitaria (A.O.U.), 09123 Cagliari, Italy
Alessandro Carriero
Department of Radiology, Azienda Ospedaliero Universitaria (A.O.U.), 09123 Cagliari, Italy
Alessio Pasche
Department of Radiology, “Maggiore della Carità” Hospital, University of Piemonte Orientale, Via Solaroli 17, 28100 Novara, Italy
Pietro S. C. Danna
Department of Radiology, “Maggiore della Carità” Hospital, University of Piemonte Orientale, Via Solaroli 17, 28100 Novara, Italy
Sushant Agarwal
Advanced Knowledge Engineering Centre, GBTI, Roseville, CA 95661, USA
Lopamudra Mohanty
ABES Engineering College, Ghaziabad 201009, India
Nillmani
School of Biomedical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, India
Neeraj Sharma
School of Biomedical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, India
Sarita Yadav
Bharati Vidyapeeth’s College of Engineering, New Delhi 110063, India
Achin Jain
Bharati Vidyapeeth’s College of Engineering, New Delhi 110063, India
Ashish Kumar
Department of Computer Science Engineering, Bennett University, Greater Noida 201310, India
Mannudeep K. Kalra
Department of Radiology, Massachusetts General Hospital, Boston, MA 02115, USA
David W. Sobel
Men’s Health Centre, Miriam Hospital Providence, Providence, RI 02906, USA
John R. Laird
Heart and Vascular Institute, Adventist Health St. Helena, St. Helena, CA 94574, USA
Inder M. Singh
Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA
Narpinder Singh
Department of Food Science and Technology, Graphic Era, Deemed to be University, Dehradun 248002, India
George Tsoulfas
Department of Surgery, Aristoteleion University of Thessaloniki, 54124 Thessaloniki, Greece
Mostafa M. Fouda
Department of Electrical and Computer Engineering, Idaho State University, Pocatello, ID 83209, USA
Azra Alizad
Department of Physiology & Biomedical Engineering, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
George D. Kitas
Academic Affairs, Dudley Group NHS Foundation Trust, Dudley DY1 2HQ, UK
Narendra N. Khanna
Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi 110001, India
Klaudija Viskovic
Department of Radiology and Ultrasound, University Hospital for Infectious Diseases, 10000 Zagreb, Croatia
Melita Kukuljan
Department of Interventional and Diagnostic Radiology, Clinical Hospital Center Rijeka, 51000 Rijeka, Croatia
Mustafa Al-Maini
Allergy, Clinical Immunology & Rheumatology Institute, Toronto, ON L4Z 4C4, Canada
Ayman El-Baz
Biomedical Engineering Department, University of Louisville, Louisville, KY 40292, USA
Luca Saba
Department of Radiology, Azienda Ospedaliero Universitaria (A.O.U.), 09123 Cagliari, Italy
Jasjit S. Suri
Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA
Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL’s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.