Training nuclei detection algorithms with simple annotations

Henning Kost; André Homeyer; Jesper Molin; Claes Lundström; Horst Karl Hahn

doi:10.4103/jpi.jpi_3_17

Journal of Pathology Informatics (Jan 2017)

Training nuclei detection algorithms with simple annotations

Henning Kost,
André Homeyer,
Jesper Molin,
Claes Lundström,
Horst Karl Hahn

Affiliations

Henning Kost
André Homeyer
Jesper Molin
Claes Lundström
Horst Karl Hahn

DOI: https://doi.org/10.4103/jpi.jpi_3_17
Journal volume & issue: Vol. 8, no. 1
pp. 21 – 21

Abstract

Read online

Background: Generating good training datasets is essential for machine learning-based nuclei detection methods. However, creating exhaustive nuclei contour annotations, to derive optimal training data from, is often infeasible. Methods: We compared different approaches for training nuclei detection methods solely based on nucleus center markers. Such markers contain less accurate information, especially with regard to nuclear boundaries, but can be produced much easier and in greater quantities. The approaches use different automated sample extraction methods to derive image positions and class labels from nucleus center markers. In addition, the approaches use different automated sample selection methods to improve the detection quality of the classification algorithm and reduce the run time of the training process. We evaluated the approaches based on a previously published generic nuclei detection algorithm and a set of Ki-67-stained breast cancer images. Results: A Voronoi tessellation-based sample extraction method produced the best performing training sets. However, subsampling of the extracted training samples was crucial. Even simple class balancing improved the detection quality considerably. The incorporation of active learning led to a further increase in detection quality. Conclusions: With appropriate sample extraction and selection methods, nuclei detection algorithms trained on the basis of simple center marker annotations can produce comparable quality to algorithms trained on conventionally created training sets.

Published in Journal of Pathology Informatics

ISSN: 2229-5089 (Print); 2153-3539 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Pathology
Website: https://www.journals.elsevier.com/journal-of-pathology-informatics

About the journal

Abstract

Keywords