Diagnostic Pathology (Jun 2016)

Application Of Ki-67 Analysis In A Distributed Computing Infrastructure

  • Marco Strutz,
  • B. Lindequist,
  • M. Witt,
  • H. Heßling,
  • P. Hufnagl,
  • D. Krefting

DOI
https://doi.org/10.17629/www.diagnosticpathology.eu-2016-8:135
Journal volume & issue
Vol. 1, no. 8

Abstract

Read online

Introduction/ Background Over the last few years, the protein Ki-67 [1] has been established as one of the most important biomarkers for cell proliferation in breast cancer. High Ki-67 values indicate high tumor growth and have direct impact on the patient’s treatment. Several automated image anal- ysis methods for identifying Ki-67-positive and negative tumor cells have been presented. Aims For small regions of a virtual slide, the Ki-67 analysis can be realized within an acceptable period of time. However, to analyse an entire whole slide image (WSI [2])most of the current methods are not sufficient yet. On a typical office computer, the processing time of 3,752 tiles, which were extracted from a H–DAB stained WSI, exceeded 24 hours. Therefore, we propose an approach to significantly speed up the process of analysing entire WSIs by using a distributed computing infrastructure. Methods To evaluate the approach, an unmodified and validated [3] [4] analysis software for Ki-67 was deployed on a six node setup supporting two different software engines: Hadoop Streaming [5] and Apache Spark [6] . Both tools support the MapReduce methodology whereas Apache Spark offers alternative programing models. In addition, heat maps visualizing the Ki-67 scores for an entire slide were generated which can provide additional informa- tion for clinical research. Results First results on automated and reproducible tests have been produced. By processing 3,752 tiles the speedup turned out to increase linearly with the number of tiles. The overall processing time was improved by a factor of 10, more precisely from 28 hours on a typical office computer to three hours on a distributed environment. Further optimization strategies besides WSI partitioning will be considered. To achieve additional improvements in processing speed, the underlying algorithm of a Ki-67 analysis can be examined with focus on how to adapt it towards distributed processing workflows.

Keywords