MNet-10: A robust shallow convolutional neural network model performing ablation study on medical images assessing the effectiveness of applying optimal data augmentation technique

Sidratul Montaha; Sami Azam; A. K. M. Rakibul Haque Rafid; Md. Zahid Hasan; Asif Karim; Khan Md. Hasib; Shobhit K. Patel; Mirjam Jonkman; Zubaer Ibna Mannan

doi:10.3389/fmed.2022.924979

Frontiers in Medicine (Aug 2022)

MNet-10: A robust shallow convolutional neural network model performing ablation study on medical images assessing the effectiveness of applying optimal data augmentation technique

Sidratul Montaha,
Sami Azam,
A. K. M. Rakibul Haque Rafid,
Md. Zahid Hasan,
Asif Karim,
Khan Md. Hasib,
Shobhit K. Patel,
Mirjam Jonkman,
Zubaer Ibna Mannan

Affiliations

Sidratul Montaha: Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
Sami Azam: College of Engineering, IT & Environment, Charles Darwin University, Darwin, NT, Australia
A. K. M. Rakibul Haque Rafid: Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
Md. Zahid Hasan: Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
Asif Karim: College of Engineering, IT & Environment, Charles Darwin University, Darwin, NT, Australia
Khan Md. Hasib: Department of Computer Science and Engineering, Ahsanullah University of Science and Technology, Dhaka, Bangladesh
Shobhit K. Patel: Department of Computer Engineering, Marwadi University, Rajkot, India
Mirjam Jonkman: College of Engineering, IT & Environment, Charles Darwin University, Darwin, NT, Australia
Zubaer Ibna Mannan: Department of Smart Computing, Kyungdong University – Global Campus, Sokcho-si, South Korea

DOI: https://doi.org/10.3389/fmed.2022.924979
Journal volume & issue: Vol. 9

Abstract

Read online

Interpretation of medical images with a computer-aided diagnosis (CAD) system is arduous because of the complex structure of cancerous lesions in different imaging modalities, high degree of resemblance between inter-classes, presence of dissimilar characteristics in intra-classes, scarcity of medical data, and presence of artifacts and noises. In this study, these challenges are addressed by developing a shallow convolutional neural network (CNN) model with optimal configuration performing ablation study by altering layer structure and hyper-parameters and utilizing a suitable augmentation technique. Eight medical datasets with different modalities are investigated where the proposed model, named MNet-10, with low computational complexity is able to yield optimal performance across all datasets. The impact of photometric and geometric augmentation techniques on different datasets is also evaluated. We selected the mammogram dataset to proceed with the ablation study for being one of the most challenging imaging modalities. Before generating the model, the dataset is augmented using the two approaches. A base CNN model is constructed first and applied to both the augmented and non-augmented mammogram datasets where the highest accuracy is obtained with the photometric dataset. Therefore, the architecture and hyper-parameters of the model are determined by performing an ablation study on the base model using the mammogram photometric dataset. Afterward, the robustness of the network and the impact of different augmentation techniques are assessed by training the model with the rest of the seven datasets. We obtain a test accuracy of 97.34% on the mammogram, 98.43% on the skin cancer, 99.54% on the brain tumor magnetic resonance imaging (MRI), 97.29% on the COVID chest X-ray, 96.31% on the tympanic membrane, 99.82% on the chest computed tomography (CT) scan, and 98.75% on the breast cancer ultrasound datasets by photometric augmentation and 96.76% on the breast cancer microscopic biopsy dataset by geometric augmentation. Moreover, some elastic deformation augmentation methods are explored with the proposed model using all the datasets to evaluate their effectiveness. Finally, VGG16, InceptionV3, and ResNet50 were trained on the best-performing augmented datasets, and their performance consistency was compared with that of the MNet-10 model. The findings may aid future researchers in medical data analysis involving ablation studies and augmentation techniques.

Published in Frontiers in Medicine

ISSN: 2296-858X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.frontiersin.org/journals/medicine

About the journal

Abstract

Keywords