Scientific Reports (Jan 2022)

A loss-based patch label denoising method for improving whole-slide image analysis using a convolutional neural network

  • Murtaza Ashraf,
  • Willmer Rafell Quiñones Robles,
  • Mujin Kim,
  • Young Sin Ko,
  • Mun Yong Yi

DOI
https://doi.org/10.1038/s41598-022-05001-8
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 18

Abstract

Read online

Abstract This paper proposes a deep learning-based patch label denoising method (LossDiff) for improving the classification of whole-slide images of cancer using a convolutional neural network (CNN). Automated whole-slide image classification is often challenging, requiring a large amount of labeled data. Pathologists annotate the region of interest by marking malignant areas, which pose a high risk of introducing patch-based label noise by involving benign regions that are typically small in size within the malignant annotations, resulting in low classification accuracy with many Type-II errors. To overcome this critical problem, this paper presents a simple yet effective method for noisy patch classification. The proposed method, validated using stomach cancer images, provides a significant improvement compared to other existing methods in patch-based cancer classification, with accuracies of 98.81%, 97.30% and 89.47% for binary, ternary, and quaternary classes, respectively. Moreover, we conduct several experiments at different noise levels using a publicly available dataset to further demonstrate the robustness of the proposed method. Given the high cost of producing explicit annotations for whole-slide images and the unavoidable error-prone nature of the human annotation of medical images, the proposed method has practical implications for whole-slide image annotation and automated cancer diagnosis.