Impact of image compression on deep learning-based mammogram classification

Yong-Yeon Jo; Young Sang Choi; Hyun Woo Park; Jae Hyeok Lee; Hyojung Jung; Hyo-Eun Kim; Kyounglan Ko; Chan Wha Lee; Hyo Soung Cha; Yul Hwangbo

doi:10.1038/s41598-021-86726-w

Scientific Reports (Apr 2021)

Impact of image compression on deep learning-based mammogram classification

Yong-Yeon Jo,
Young Sang Choi,
Hyun Woo Park,
Jae Hyeok Lee,
Hyojung Jung,
Hyo-Eun Kim,
Kyounglan Ko,
Chan Wha Lee,
Hyo Soung Cha,
Yul Hwangbo

Affiliations

Yong-Yeon Jo: Healthcare AI Team, National Cancer Center
Young Sang Choi: Healthcare AI Team, National Cancer Center
Hyun Woo Park: Healthcare AI Team, National Cancer Center
Jae Hyeok Lee: Healthcare AI Team, National Cancer Center
Hyojung Jung: Healthcare AI Team, National Cancer Center
Hyo-Eun Kim: Lunit Inc.
Kyounglan Ko: Department of Radiology, National Cancer Center
Chan Wha Lee: Department of Radiology, National Cancer Center
Hyo Soung Cha: Healthcare AI Team, National Cancer Center
Yul Hwangbo: Healthcare AI Team, National Cancer Center

DOI: https://doi.org/10.1038/s41598-021-86726-w
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Image compression is used in several clinical organizations to help address the overhead associated with medical imaging. These methods reduce file size by using a compact representation of the original image. This study aimed to analyze the impact of image compression on the performance of deep learning-based models in classifying mammograms as “malignant”—cases that lead to a cancer diagnosis and treatment—or “normal” and “benign,” non-malignant cases that do not require immediate medical intervention. In this retrospective study, 9111 unique mammograms–5672 normal, 1686 benign, and 1754 malignant cases were collected from the National Cancer Center in the Republic of Korea. Image compression was applied to mammograms with compression ratios (CRs) ranging from 15 to 11 K. Convolutional neural networks (CNNs) with three convolutional layers and three fully-connected layers were trained using these images to classify a mammogram as malignant or not malignant across a range of CRs using five-fold cross-validation. Models trained on images with maximum CRs of 5 K had an average area under the receiver operating characteristic curve (AUROC) of 0.87 and area under the precision-recall curve (AUPRC) of 0.75 across the five folds and compression ratios. For images compressed with CRs of 10 K and 11 K, model performance decreased (average 0.79 in AUROC and 0.49 in AUPRC). Upon generating saliency maps that visualize the areas each model views as significant for prediction, models trained on less compressed (CR < = 5 K) images had maps encapsulating a radiologist’s label, while models trained on images with higher amounts of compression had maps that missed the ground truth completely. In addition, base ResNet18 models pre-trained on ImageNet and trained using compressed mammograms did not show performance improvements over our CNN model, with AUROC and AUPRC values ranging from 0.77 to 0.87 and 0.52 to 0.71 respectively when trained and tested on images with maximum CRs of 5 K. This paper finds that while training models on images with increased the robustness of the models when tested on compressed data, moderate image compression did not substantially impact the classification performance of DL-based models.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal