Biased Deep Learning Methods in Detection of COVID-19 Using CT Images: A Challenge Mounted by Subject-Wise-Split ISFCT Dataset

Shiva Parsarad; Narges Saeedizadeh; Ghazaleh Jamalipour Soufi; Shamim Shafieyoon; Farzaneh Hekmatnia; Andrew Parviz Zarei; Samira Soleimany; Amir Yousefi; Hengameh Nazari; Pegah Torabi; Abbas S. Milani; Seyed Ali Madani Tonekaboni; Hossein Rabbani; Ali Hekmatnia; Rahele Kafieh

doi:10.3390/jimaging9080159

Journal of Imaging (Aug 2023)

Biased Deep Learning Methods in Detection of COVID-19 Using CT Images: A Challenge Mounted by Subject-Wise-Split ISFCT Dataset

Shiva Parsarad,
Narges Saeedizadeh,
Ghazaleh Jamalipour Soufi,
Shamim Shafieyoon,
Farzaneh Hekmatnia,
Andrew Parviz Zarei,
Samira Soleimany,
Amir Yousefi,
Hengameh Nazari,
Pegah Torabi,
Abbas S. Milani,
Seyed Ali Madani Tonekaboni,
Hossein Rabbani,
Ali Hekmatnia,
Rahele Kafieh

Affiliations

Shiva Parsarad: Medical Image and Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Narges Saeedizadeh: Medical Image and Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Ghazaleh Jamalipour Soufi: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Shamim Shafieyoon: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Farzaneh Hekmatnia: St. George’s Hospital, London SW17 0RE, UK
Andrew Parviz Zarei: St. George’s Hospital, London SW17 0RE, UK
Samira Soleimany: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Amir Yousefi: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Hengameh Nazari: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Pegah Torabi: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Abbas S. Milani: School of Engineering, University of British Columbia, Kelowna, BC V1V 1V7, Canada
Seyed Ali Madani Tonekaboni: Cyclica Inc., Toronto, ON M5J 1A7, Canada
Hossein Rabbani: Medical Image and Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Ali Hekmatnia: Department of Radiology, School of Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran
Rahele Kafieh: Medical Image and Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan JM76+5M3, Iran

DOI: https://doi.org/10.3390/jimaging9080159
Journal volume & issue: Vol. 9, no. 8
p. 159

Abstract

Read online

Accurate detection of respiratory system damage including COVID-19 is considered one of the crucial applications of deep learning (DL) models using CT images. However, the main shortcoming of the published works has been unreliable reported accuracy and the lack of repeatability with new datasets, mainly due to slice-wise splits of the data, creating dependency between training and test sets due to shared data across the sets. We introduce a new dataset of CT images (ISFCT Dataset) with labels indicating the subject-wise split to train and test our DL algorithms in an unbiased manner. We also use this dataset to validate the real performance of the published works in a subject-wise data split. Another key feature provides more specific labels (eight characteristic lung features) rather than being limited to COVID-19 and healthy labels. We show that the reported high accuracy of the existing models on current slice-wise splits is not repeatable for subject-wise splits, and distribution differences between data splits are demonstrated using t-distribution stochastic neighbor embedding. We indicate that, by examining subject-wise data splitting, less complicated models show competitive results compared to the exiting complicated models, demonstrating that complex models do not necessarily generate accurate and repeatable results.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords