A loss-based patch label denoising method for improving whole-slide image analysis using a convolutional neural network

Murtaza Ashraf; Willmer Rafell Quiñones Robles; Mujin Kim; Young Sin Ko; Mun Yong Yi

doi:10.1038/s41598-022-05001-8

Scientific Reports (Jan 2022)

A loss-based patch label denoising method for improving whole-slide image analysis using a convolutional neural network

Murtaza Ashraf,
Willmer Rafell Quiñones Robles,
Mujin Kim,
Young Sin Ko,
Mun Yong Yi

Affiliations

Murtaza Ashraf: Department of Industrial and Systems Engineering, Graduate School of Knowledge Service Engineering, Korea Advanced Institute of Science and Technology
Willmer Rafell Quiñones Robles: Department of Industrial and Systems Engineering, Graduate School of Knowledge Service Engineering, Korea Advanced Institute of Science and Technology
Mujin Kim: Department of Industrial and Systems Engineering, Graduate School of Knowledge Service Engineering, Korea Advanced Institute of Science and Technology
Young Sin Ko: Pathology Center, Seegene Medical Foundation
Mun Yong Yi: Department of Industrial and Systems Engineering, Graduate School of Knowledge Service Engineering, Korea Advanced Institute of Science and Technology

DOI: https://doi.org/10.1038/s41598-022-05001-8
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 18

Abstract

Read online

Abstract This paper proposes a deep learning-based patch label denoising method (LossDiff) for improving the classification of whole-slide images of cancer using a convolutional neural network (CNN). Automated whole-slide image classification is often challenging, requiring a large amount of labeled data. Pathologists annotate the region of interest by marking malignant areas, which pose a high risk of introducing patch-based label noise by involving benign regions that are typically small in size within the malignant annotations, resulting in low classification accuracy with many Type-II errors. To overcome this critical problem, this paper presents a simple yet effective method for noisy patch classification. The proposed method, validated using stomach cancer images, provides a significant improvement compared to other existing methods in patch-based cancer classification, with accuracies of 98.81%, 97.30% and 89.47% for binary, ternary, and quaternary classes, respectively. Moreover, we conduct several experiments at different noise levels using a publicly available dataset to further demonstrate the robustness of the proposed method. Given the high cost of producing explicit annotations for whole-slide images and the unavoidable error-prone nature of the human annotation of medical images, the proposed method has practical implications for whole-slide image annotation and automated cancer diagnosis.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal