Evaluating Uncertainty Quantification in Medical Image Segmentation: A Multi-Dataset, Multi-Algorithm Study

Nyaz Jalal; Małgorzata Śliwińska; Wadim Wojciechowski; Iwona Kucybała; Miłosz Rozynek; Kamil Krupa; Patrycja Matusik; Jarosław Jarczewski; Zbisław Tabor

doi:10.3390/app142110020

Applied Sciences (Nov 2024)

Evaluating Uncertainty Quantification in Medical Image Segmentation: A Multi-Dataset, Multi-Algorithm Study

Nyaz Jalal,
Małgorzata Śliwińska,
Wadim Wojciechowski,
Iwona Kucybała,
Miłosz Rozynek,
Kamil Krupa,
Patrycja Matusik,
Jarosław Jarczewski,
Zbisław Tabor

Affiliations

Nyaz Jalal: Department of Biocybernetics and Biomedical Engineering, AGH University, al. Mickiewicza 30, 30-059 Kraków, Poland
Małgorzata Śliwińska: Department of Biocybernetics and Biomedical Engineering, AGH University, al. Mickiewicza 30, 30-059 Kraków, Poland
Wadim Wojciechowski: Department of Radiology, Jagiellonian University Medical College, ul. Botaniczna 3, 31-503 Kraków, Poland
Iwona Kucybała: Department of Diagnostic Imaging, University Hospital, ul. Jakubowskiego 2, 30-688 Kraków, Poland
Miłosz Rozynek: Department of Radiology, Jagiellonian University Medical College, ul. Botaniczna 3, 31-503 Kraków, Poland
Kamil Krupa: Department of Radiology, Jagiellonian University Medical College, ul. Botaniczna 3, 31-503 Kraków, Poland
Patrycja Matusik: Department of Radiology, Jagiellonian University Medical College, ul. Botaniczna 3, 31-503 Kraków, Poland
Jarosław Jarczewski: Department of Diagnostic Imaging, University Hospital, ul. Jakubowskiego 2, 30-688 Kraków, Poland
Zbisław Tabor: Department of Biocybernetics and Biomedical Engineering, AGH University, al. Mickiewicza 30, 30-059 Kraków, Poland

DOI: https://doi.org/10.3390/app142110020
Journal volume & issue: Vol. 14, no. 21
p. 10020

Abstract

Read online

Deep learning is revolutionizing various scientific fields, with medical applications at the forefront. One key focus is automating image segmentation, a process crucial in many clinical services. However, medical images are often ambiguous and challenging even for experts. To address this, reliable models need to quantify their uncertainty, allowing physicians to understand the model’s confidence in its segmentation. This paper explores how the performance and uncertainty of a model are influenced by the number of annotations per input sample. We examine the effects of both single and multiple manual annotations on various deep learning architectures. To tackle this question, we employ three widely recognized deep learning architectures and evaluate them across four publicly available datasets. Furthermore, we explore the effects of dropout rates on Monte Carlo models by examining uncertainty models with dropout rates of 20%, 40%, 60%, and 80%. Subsequently, we evaluate the models using various measurement metrics. The findings reveal that the influence of multiple annotations varies significantly depending on the datasets. Additionally, we observe that the dropout rate has minimal or no impact on the model’s performance unless there is a substantial loss of training data, primarily evident in the 80% dropout rate scenario.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords